324 Commits

Author SHA1 Message Date
Sun Junyi
7ce63e53b7 Merge pull request #197 from skyerown/master
修复带权重测试脚本输出结果是调用顺序错误
2014-11-07 11:07:19 +08:00
walkskyer
6772f0282e 修复带权重测试脚本输出结果是调用顺序错误 2014-11-06 22:24:43 +08:00
Sun Junyi
a5944bb88e Merge pull request #196 from qinwf/master
Add jiebaR in README
2014-11-04 12:29:42 +08:00
Qin Wenfeng
77a831b8c1 Add jiebaR in README 2014-11-04 11:59:40 +08:00
Sun Junyi
cf2aa88122 Merge pull request #195 from gumblex/master
统一获取关键词接口,优化缓存命名
2014-11-01 12:54:57 +08:00
Dingyuan Wang
751ff35eb5 improve extract_tags; unify extract_tags and testrank 2014-10-31 23:15:51 +08:00
Dingyuan Wang
e3f3dcccba improve the loading and caching process 2014-10-31 21:56:09 +08:00
Sun Junyi
d6ef07a472 Merge pull request #192 from gumblex/master
更新、完善说明;命令行加入自定义词典功能
2014-10-25 15:29:26 +08:00
Dingyuan Wang
9d2818b440 fix English part of README 2014-10-25 13:16:30 +08:00
Dingyuan Wang
31b7d11809 improve README 2014-10-25 13:12:19 +08:00
Dingyuan Wang
a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank 2014-10-25 12:59:36 +08:00
Sun Junyi
0049b0c5b4 Merge pull request #191 from sing1ee/master
add some introduction of textrank
2014-10-24 22:50:36 +08:00
zhangcheng
138d713e98 add some introduction of textrank 2014-10-24 22:41:51 +08:00
Sun Junyi
4030d8ed86 Merge pull request #190 from sing1ee/master
add a simple implementation of textrank
2014-10-24 22:20:05 +08:00
zhangcheng
6eb9f6149c add a simple implementation of textrank 2014-10-24 21:15:54 +08:00
Sun Junyi
1850bd6d37 Update README.md 2014-10-24 20:23:10 +08:00
fxsjy
f5ca87e088 merge change of @fukuball 2014-10-23 15:59:08 +08:00
Sun Junyi
10b86e90fb Update README.md 2014-10-21 12:53:37 +08:00
fxsjy
82bfffb6ed version update to 0.34 2014-10-20 13:35:13 +08:00
Sun Junyi
4a93f21918 Merge pull request #187 from gumblex/master
不用Trie,减少内存加快速度;优化代码细节
2014-10-19 19:43:30 +08:00
Dingyuan Wang
bb1e6000c6 fix version; fix spaces at end of line 2014-10-19 10:57:46 +08:00
Dingyuan Wang
51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements 2014-10-18 22:23:26 +08:00
fxsjy
eb98eb9248 fix performance problem of extrag_tags 2014-10-10 15:41:28 +08:00
Sun Junyi
7f965e0aa3 Merge pull request #184 from keroro520/master
fix issues 125 (https://github.com/fxsjy/jieba/issues/125)
2014-09-12 17:43:43 +08:00
keroro520
77b442fa88 fix issues (https://github.com/fxsjy/jieba/issues/125) 2014-09-12 13:42:05 +08:00
Sun Junyi
fc511de012 Merge pull request #176 from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
2014-09-01 14:11:00 +08:00
Sun Junyi
99ea59e88d Update README.md v0.33 2014-08-31 20:04:02 +08:00
fxsjy
40adb1c591 version 0.33 2014-08-31 19:26:26 +08:00
Fukuball Lin
d432789cb4 fix typo 2014-08-06 17:56:05 +08:00
Fukuball Lin
cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
2014-08-06 15:53:57 +08:00
Fukuball Lin
e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
2014-08-06 15:00:07 +08:00
Sun Junyi
16d626d347 Merge pull request #174 from fukuball/master
讓 jieba 可以切換 idf 語料庫及 stop words 語料庫
2014-08-06 10:36:10 +08:00
Fukuball Lin
b658ee69cb 讓 jieba 可以自行增加 stop words 語料庫
1. 增加範例 stop words 語料庫
2. 為了讓 jieba 可以切換 stop words 語料庫,新增 set_stop_words 方法,並改寫 extract_tags
3. test 增加 extract_tags_stop_words.py 測試範例
2014-08-06 03:35:16 +08:00
Fukuball Lin
7198d562f1 讓 jieba 可以切換 idf 語料庫
1. 新增繁體中文 idf 語料庫
2. 為了讓 jieba 可以切換 iff 語料庫,新增 get_idf, set_idf_path 方法,並改寫 extract_tags
3. test 增加 extract_tags_idfpath
2014-08-05 22:55:13 +08:00
Sun Junyi
473ac1df75 Merge pull request #162 from ShuraChow/master
fix issue #161
2014-06-11 17:04:23 +08:00
ShuraChow
7583f7760a fix issue #161
posseg每次根据jieba.user_word_tag_tab的长度判断是否有新词载入,如果有,则更新word_tag_tab,然后清空jieba.user_word_tag_tab
2014-06-10 02:04:09 +08:00
Sun Junyi
2726a7c89b Merge pull request #158 from davidlihm/patch-1
Thanks
2014-05-15 10:11:03 +08:00
davidlihm
5b2ec920ed Update __init__.py 2014-05-15 07:55:11 +08:00
Sun Junyi
28621e8b00 Update README.md 2014-04-17 13:47:47 +08:00
fxsjy
2682e887b8 Merge branch 'master' of https://github.com/fxsjy/jieba 2014-03-02 17:52:52 +08:00
fxsjy
9d4ac26f16 fix the bug of issue#137 2014-03-02 17:52:19 +08:00
Sun Junyi
6942795fae Merge pull request #135 from aszxqw/patch-1
add nodejieba into README.md
2014-02-26 14:13:00 +08:00
Yanyi Wu
ccfa54530e add nodejieba into README.md
add nodejieba into README.md
2014-02-26 14:05:13 +08:00
Sun Junyi
3e430e9769 Update __init__.py v0.32 2014-02-16 20:09:57 +08:00
Sun Junyi
6946b00f14 Merge pull request #134 from Honghe/master
Fix a bug about can not import ChineseAnalyzer
2014-02-16 20:08:42 +08:00
Honghe Wu
7720fbc1d8 fix a bug about can not import ChineseAnalyzer with change tab to 4 wihte spaces under PEP8 2014-02-15 19:32:29 +08:00
fxsjy
cc708de40c version 0.32 released 2014-02-07 15:22:53 +08:00
fxsjy
dafc73425e fix a little problem of dict.txt 2014-02-07 14:35:38 +08:00
fxsjy
7cc7e70843 Merge branch 'master' of https://github.com/fxsjy/jieba 2014-01-28 13:48:35 +08:00
fxsjy
18678d50c6 fix bug issue #132 2014-01-28 13:48:03 +08:00