Sun Junyi
|
cf2aa88122
|
Merge pull request #195 from gumblex/master
统一获取关键词接口,优化缓存命名
|
2014-11-01 12:54:57 +08:00 |
|
Dingyuan Wang
|
751ff35eb5
|
improve extract_tags; unify extract_tags and testrank
|
2014-10-31 23:15:51 +08:00 |
|
Dingyuan Wang
|
e3f3dcccba
|
improve the loading and caching process
|
2014-10-31 21:56:09 +08:00 |
|
Sun Junyi
|
4cb1924d09
|
Merge pull request #193 from gumblex/jieba3k
jieba3k 对应更新 #192
|
2014-10-25 15:29:49 +08:00 |
|
Sun Junyi
|
d6ef07a472
|
Merge pull request #192 from gumblex/master
更新、完善说明;命令行加入自定义词典功能
|
2014-10-25 15:29:26 +08:00 |
|
Dingyuan Wang
|
fd9f1f2c0e
|
update README, textrank, etc.
|
2014-10-25 14:23:37 +08:00 |
|
Dingyuan Wang
|
9d2818b440
|
fix English part of README
|
2014-10-25 13:16:30 +08:00 |
|
Dingyuan Wang
|
31b7d11809
|
improve README
|
2014-10-25 13:12:19 +08:00 |
|
Dingyuan Wang
|
a6119cc995
|
add custom dictionary to __main__; update README; slightly optimize textrank
|
2014-10-25 12:59:36 +08:00 |
|
Sun Junyi
|
0049b0c5b4
|
Merge pull request #191 from sing1ee/master
add some introduction of textrank
|
2014-10-24 22:50:36 +08:00 |
|
zhangcheng
|
138d713e98
|
add some introduction of textrank
|
2014-10-24 22:41:51 +08:00 |
|
Sun Junyi
|
4030d8ed86
|
Merge pull request #190 from sing1ee/master
add a simple implementation of textrank
|
2014-10-24 22:20:05 +08:00 |
|
zhangcheng
|
6eb9f6149c
|
add a simple implementation of textrank
|
2014-10-24 21:15:54 +08:00 |
|
Sun Junyi
|
1850bd6d37
|
Update README.md
|
2014-10-24 20:23:10 +08:00 |
|
fxsjy
|
f5ca87e088
|
merge change of @fukuball
|
2014-10-23 15:59:08 +08:00 |
|
Sun Junyi
|
10b86e90fb
|
Update README.md
|
2014-10-21 12:53:37 +08:00 |
|
fxsjy
|
ba87fcb01f
|
remove trie, use prefix set instead
|
2014-10-20 14:08:09 +08:00 |
|
fxsjy
|
82bfffb6ed
|
version update to 0.34
|
2014-10-20 13:35:13 +08:00 |
|
Sun Junyi
|
56e8336af1
|
Merge pull request #188 from gumblex/jieba3k
不用Trie,同#187
|
2014-10-19 19:43:48 +08:00 |
|
Sun Junyi
|
4a93f21918
|
Merge pull request #187 from gumblex/master
不用Trie,减少内存加快速度;优化代码细节
|
2014-10-19 19:43:30 +08:00 |
|
Dingyuan Wang
|
bb1e6000c6
|
fix version; fix spaces at end of line
|
2014-10-19 10:57:46 +08:00 |
|
Dingyuan Wang
|
14671d4feb
|
fix __main__.py
|
2014-10-19 10:41:09 +08:00 |
|
Dingyuan Wang
|
b367690eeb
|
use prefix dict instead of trie, add a command line interface, and a few small improvements
|
2014-10-19 10:32:23 +08:00 |
|
Dingyuan Wang
|
51df77831b
|
use prefix dict instead of trie, add a command line interface, and a few small improvements
|
2014-10-18 22:23:26 +08:00 |
|
fxsjy
|
eb98eb9248
|
fix performance problem of extrag_tags
|
2014-10-10 15:41:28 +08:00 |
|
Sun Junyi
|
7f965e0aa3
|
Merge pull request #184 from keroro520/master
fix issues 125 (https://github.com/fxsjy/jieba/issues/125)
|
2014-09-12 17:43:43 +08:00 |
|
keroro520
|
77b442fa88
|
fix issues (https://github.com/fxsjy/jieba/issues/125)
|
2014-09-12 13:42:05 +08:00 |
|
Sun Junyi
|
8f52419386
|
Merge pull request #183 from gumblex/jieba3k
Jieba3k update to v0.33
|
2014-09-09 10:52:31 +08:00 |
|
Dingyuan Wang
|
626b415152
|
fix dict.itervalues mistake
|
2014-09-07 19:21:13 +08:00 |
|
Dingyuan Wang
|
6a3f228c72
|
fix python3 stuff
|
2014-09-07 18:50:10 +08:00 |
|
Dingyuan Wang
|
b16cf0d63f
|
fix indent typo
|
2014-09-06 23:37:54 +08:00 |
|
Dingyuan Wang
|
6fad5fbb2c
|
update to v0.33
|
2014-09-06 23:28:47 +08:00 |
|
Sun Junyi
|
fc511de012
|
Merge pull request #176 from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
|
2014-09-01 14:11:00 +08:00 |
|
Sun Junyi
|
99ea59e88d
|
Update README.md
v0.33
|
2014-08-31 20:04:02 +08:00 |
|
fxsjy
|
6eb43acc10
|
pip install jieba3k
|
2014-08-31 20:01:54 +08:00 |
|
fxsjy
|
40adb1c591
|
version 0.33
|
2014-08-31 19:26:26 +08:00 |
|
Fukuball Lin
|
d432789cb4
|
fix typo
|
2014-08-06 17:56:05 +08:00 |
|
Fukuball Lin
|
cf31a99bf6
|
將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
|
2014-08-06 15:53:57 +08:00 |
|
Fukuball Lin
|
e4d323c78b
|
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
|
2014-08-06 15:00:07 +08:00 |
|
Sun Junyi
|
16d626d347
|
Merge pull request #174 from fukuball/master
讓 jieba 可以切換 idf 語料庫及 stop words 語料庫
|
2014-08-06 10:36:10 +08:00 |
|
Fukuball Lin
|
b658ee69cb
|
讓 jieba 可以自行增加 stop words 語料庫
1. 增加範例 stop words 語料庫
2. 為了讓 jieba 可以切換 stop words 語料庫,新增 set_stop_words 方法,並改寫 extract_tags
3. test 增加 extract_tags_stop_words.py 測試範例
|
2014-08-06 03:35:16 +08:00 |
|
Fukuball Lin
|
7198d562f1
|
讓 jieba 可以切換 idf 語料庫
1. 新增繁體中文 idf 語料庫
2. 為了讓 jieba 可以切換 iff 語料庫,新增 get_idf, set_idf_path 方法,並改寫 extract_tags
3. test 增加 extract_tags_idfpath
|
2014-08-05 22:55:13 +08:00 |
|
Sun Junyi
|
91e5b26f5f
|
Merge pull request #165 from gumblex/jieba3k
fix the u'xxx' string.
|
2014-06-22 10:23:58 +08:00 |
|
Dingyuan Wang
|
8b07bce568
|
fix the u'xxx' string.
|
2014-06-21 23:30:06 +08:00 |
|
Sun Junyi
|
0d99ebce54
|
Merge pull request #164 from gumblex/jieba3k
Jieba3k v0.32 update
|
2014-06-15 19:14:28 +08:00 |
|
Dingyuan Wang
|
c04ccd0d12
|
Update to v0.32 according to the master branch.
|
2014-06-14 22:31:13 +08:00 |
|
Dingyuan Wang
|
81f77d7a08
|
Fix the re in enable_parallel.
|
2014-06-14 15:22:13 +08:00 |
|
Sun Junyi
|
473ac1df75
|
Merge pull request #162 from ShuraChow/master
fix issue #161
|
2014-06-11 17:04:23 +08:00 |
|
ShuraChow
|
7583f7760a
|
fix issue #161
posseg每次根据jieba.user_word_tag_tab的长度判断是否有新词载入,如果有,则更新word_tag_tab,然后清空jieba.user_word_tag_tab
|
2014-06-10 02:04:09 +08:00 |
|
Sun Junyi
|
2726a7c89b
|
Merge pull request #158 from davidlihm/patch-1
Thanks
|
2014-05-15 10:11:03 +08:00 |
|