495 Commits

Author SHA1 Message Date
Nomaka
9cb76dd8b9 Update __init__.py
calc的idx参数没用
2014-11-18 16:00:49 +08:00
Sun Junyi
99748bfc17 Merge pull request #201 from skyerown/master
为关键字提取函数增加词性过滤功能
2014-11-18 10:27:52 +08:00
walkskyer
a336e26403 为函数textrank增加参数allowPOS,并修改extract_tags的参数allowPOS与textrank保持一致。 2014-11-15 18:36:09 +08:00
walkskyer
bab5f362ba 将exstract_tags参数allowPOS转换为frozenset以减少查找时间。 2014-11-15 18:14:47 +08:00
Dingyuan Wang
6b0da06481 merge from upstream 2014-11-15 14:06:03 +08:00
fxsjy
5c487dbcba update verson 2014-11-15 13:46:27 +08:00
fxsjy
447c1ded8c fix problem for python3.2 2014-11-15 13:44:30 +08:00
walkskyer
dd62477605 .gitignore中忽略pycharm项目文件 2014-11-15 13:33:13 +08:00
Dingyuan Wang
a5ecf70f71 update to v0.35 2014-11-14 20:59:54 +08:00
walkskyer
d82d2c18df 为关键字提取函数增加词性过滤功能 2014-11-13 22:26:22 +08:00
fxsjy
315a411e52 version update 2014-11-13 10:43:43 +08:00
fxsjy
ec68c21ea0 version update' 2014-11-13 10:27:50 +08:00
Sun Junyi
3eea28d6f4 Merge pull request #200 from skyerown/master
修复stop words处理未考虑"\r"导致不能正常匹配的问题。
2014-11-13 10:10:07 +08:00
walkskyer
5571a0337a 修复stop words处理未考虑"\r"导致不能正常匹配的问题。 2014-11-12 22:33:27 +08:00
Sun Junyi
40c0edfd99 Merge pull request #198 from gumblex/jieba3k
Jieba3k 对应更新;半自动转换脚本
2014-11-08 22:17:51 +08:00
Dingyuan Wang
4a6140081e fix problems in auto2to3 2014-11-07 23:47:57 +08:00
Dingyuan Wang
7a6caa0c3c port extract_tags, etc to jieba3k; add auto2to3 script 2014-11-07 23:33:31 +08:00
walkskyer
36bc9e18c6 Merge pull request #1 from fxsjy/master
pull
2014-11-07 21:35:22 +08:00
Sun Junyi
7ce63e53b7 Merge pull request #197 from skyerown/master
修复带权重测试脚本输出结果是调用顺序错误
2014-11-07 11:07:19 +08:00
walkskyer
6772f0282e 修复带权重测试脚本输出结果是调用顺序错误 2014-11-06 22:24:43 +08:00
Sun Junyi
a5944bb88e Merge pull request #196 from qinwf/master
Add jiebaR in README
2014-11-04 12:29:42 +08:00
Qin Wenfeng
77a831b8c1 Add jiebaR in README 2014-11-04 11:59:40 +08:00
Sun Junyi
cf2aa88122 Merge pull request #195 from gumblex/master
统一获取关键词接口,优化缓存命名
2014-11-01 12:54:57 +08:00
Dingyuan Wang
751ff35eb5 improve extract_tags; unify extract_tags and testrank 2014-10-31 23:15:51 +08:00
Dingyuan Wang
e3f3dcccba improve the loading and caching process 2014-10-31 21:56:09 +08:00
Sun Junyi
4cb1924d09 Merge pull request #193 from gumblex/jieba3k
jieba3k 对应更新 #192
2014-10-25 15:29:49 +08:00
Sun Junyi
d6ef07a472 Merge pull request #192 from gumblex/master
更新、完善说明;命令行加入自定义词典功能
2014-10-25 15:29:26 +08:00
Dingyuan Wang
fd9f1f2c0e update README, textrank, etc. 2014-10-25 14:23:37 +08:00
Dingyuan Wang
9d2818b440 fix English part of README 2014-10-25 13:16:30 +08:00
Dingyuan Wang
31b7d11809 improve README 2014-10-25 13:12:19 +08:00
Dingyuan Wang
a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank 2014-10-25 12:59:36 +08:00
Sun Junyi
0049b0c5b4 Merge pull request #191 from sing1ee/master
add some introduction of textrank
2014-10-24 22:50:36 +08:00
zhangcheng
138d713e98 add some introduction of textrank 2014-10-24 22:41:51 +08:00
Sun Junyi
4030d8ed86 Merge pull request #190 from sing1ee/master
add a simple implementation of textrank
2014-10-24 22:20:05 +08:00
zhangcheng
6eb9f6149c add a simple implementation of textrank 2014-10-24 21:15:54 +08:00
Sun Junyi
1850bd6d37 Update README.md 2014-10-24 20:23:10 +08:00
fxsjy
f5ca87e088 merge change of @fukuball 2014-10-23 15:59:08 +08:00
Sun Junyi
10b86e90fb Update README.md 2014-10-21 12:53:37 +08:00
fxsjy
ba87fcb01f remove trie, use prefix set instead 2014-10-20 14:08:09 +08:00
fxsjy
82bfffb6ed version update to 0.34 2014-10-20 13:35:13 +08:00
Sun Junyi
56e8336af1 Merge pull request #188 from gumblex/jieba3k
不用Trie,同#187
2014-10-19 19:43:48 +08:00
Sun Junyi
4a93f21918 Merge pull request #187 from gumblex/master
不用Trie,减少内存加快速度;优化代码细节
2014-10-19 19:43:30 +08:00
Dingyuan Wang
bb1e6000c6 fix version; fix spaces at end of line 2014-10-19 10:57:46 +08:00
Dingyuan Wang
14671d4feb fix __main__.py 2014-10-19 10:41:09 +08:00
Dingyuan Wang
b367690eeb use prefix dict instead of trie, add a command line interface, and a few small improvements 2014-10-19 10:32:23 +08:00
Dingyuan Wang
51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements 2014-10-18 22:23:26 +08:00
fxsjy
eb98eb9248 fix performance problem of extrag_tags 2014-10-10 15:41:28 +08:00
Sun Junyi
7f965e0aa3 Merge pull request #184 from keroro520/master
fix issues 125 (https://github.com/fxsjy/jieba/issues/125)
2014-09-12 17:43:43 +08:00
keroro520
77b442fa88 fix issues (https://github.com/fxsjy/jieba/issues/125) 2014-09-12 13:42:05 +08:00
Sun Junyi
8f52419386 Merge pull request #183 from gumblex/jieba3k
Jieba3k update to v0.33
2014-09-09 10:52:31 +08:00