Sun Junyi
|
caae26fbfa
|
Merge pull request #231 from gumblex/master
在 FREQ 中直接储存频数
|
2015-02-09 16:50:43 +08:00 |
|
Dingyuan Wang
|
4197dfb8fa
|
store int directly in FREQ; small improvements
|
2015-02-09 16:26:00 +08:00 |
|
Dingyuan Wang
|
765fd6b7f0
|
store int directly in FREQ; small improvements
|
2015-02-09 16:14:12 +08:00 |
|
Sun Junyi
|
c95f402e2b
|
Merge pull request #214 from aszxqw/master
add iosjieba
|
2014-12-25 10:09:35 +08:00 |
|
yanyiwu
|
1d91072498
|
add iosjieba
|
2014-12-24 23:02:06 +08:00 |
|
Sun Junyi
|
852a07c4f2
|
Merge pull request #211 from gumblex/jieba3k
修复 posseg 中 pair 类 repr 返回值 (jieba3k)
|
2014-12-20 18:35:43 +08:00 |
|
Dingyuan Wang
|
7bcb128f5f
|
fix textrank divided by zero; fix posseg.pair.__repr__
|
2014-12-20 00:12:42 +08:00 |
|
Sun Junyi
|
b08c3f8ed7
|
Merge pull request #205 from lynschinzer/master
Fix divided by zero issue in case of words are not found in dict.
|
2014-12-05 20:13:51 +08:00 |
|
Lin
|
fea3aec6bd
|
Fix divided by zero issue in case of words are not found in dict.
|
2014-12-05 17:13:12 +08:00 |
|
Sun Junyi
|
8be082017a
|
Merge pull request #204 from gumblex/jieba3k
完善setup.py等对应py3k更新
|
2014-11-29 18:28:48 +08:00 |
|
Sun Junyi
|
293dbbc390
|
Merge pull request #203 from gumblex/master
修复 posseg;完善 setup.py
|
2014-11-29 18:28:23 +08:00 |
|
Dingyuan Wang
|
3dad899ec8
|
backport 2to3 scripts and changelog
|
2014-11-29 16:12:25 +08:00 |
|
Dingyuan Wang
|
c6b386f65b
|
update jieba3k
|
2014-11-29 16:06:20 +08:00 |
|
Dingyuan Wang
|
7b7c6955a9
|
complete the setup.py, fix #202 problem in posseg
|
2014-11-29 15:33:42 +08:00 |
|
Sun Junyi
|
8a2e7f0e7e
|
Merge pull request #202 from nomaka/patch-1
Update __init__.py
|
2014-11-18 16:46:59 +08:00 |
|
Nomaka
|
9cb76dd8b9
|
Update __init__.py
calc的idx参数没用
|
2014-11-18 16:00:49 +08:00 |
|
Sun Junyi
|
99748bfc17
|
Merge pull request #201 from skyerown/master
为关键字提取函数增加词性过滤功能
|
2014-11-18 10:27:52 +08:00 |
|
walkskyer
|
a336e26403
|
为函数textrank增加参数allowPOS,并修改extract_tags的参数allowPOS与textrank保持一致。
|
2014-11-15 18:36:09 +08:00 |
|
walkskyer
|
bab5f362ba
|
将exstract_tags参数allowPOS转换为frozenset以减少查找时间。
|
2014-11-15 18:14:47 +08:00 |
|
Dingyuan Wang
|
6b0da06481
|
merge from upstream
|
2014-11-15 14:06:03 +08:00 |
|
fxsjy
|
5c487dbcba
|
update verson
|
2014-11-15 13:46:27 +08:00 |
|
fxsjy
|
447c1ded8c
|
fix problem for python3.2
|
2014-11-15 13:44:30 +08:00 |
|
walkskyer
|
dd62477605
|
.gitignore中忽略pycharm项目文件
|
2014-11-15 13:33:13 +08:00 |
|
Dingyuan Wang
|
a5ecf70f71
|
update to v0.35
|
2014-11-14 20:59:54 +08:00 |
|
walkskyer
|
d82d2c18df
|
为关键字提取函数增加词性过滤功能
|
2014-11-13 22:26:22 +08:00 |
|
fxsjy
|
315a411e52
|
version update
|
2014-11-13 10:43:43 +08:00 |
|
fxsjy
|
ec68c21ea0
|
version update'
|
2014-11-13 10:27:50 +08:00 |
|
Sun Junyi
|
3eea28d6f4
|
Merge pull request #200 from skyerown/master
修复stop words处理未考虑"\r"导致不能正常匹配的问题。
|
2014-11-13 10:10:07 +08:00 |
|
walkskyer
|
5571a0337a
|
修复stop words处理未考虑"\r"导致不能正常匹配的问题。
|
2014-11-12 22:33:27 +08:00 |
|
Sun Junyi
|
40c0edfd99
|
Merge pull request #198 from gumblex/jieba3k
Jieba3k 对应更新;半自动转换脚本
|
2014-11-08 22:17:51 +08:00 |
|
Dingyuan Wang
|
4a6140081e
|
fix problems in auto2to3
|
2014-11-07 23:47:57 +08:00 |
|
Dingyuan Wang
|
7a6caa0c3c
|
port extract_tags, etc to jieba3k; add auto2to3 script
|
2014-11-07 23:33:31 +08:00 |
|
walkskyer
|
36bc9e18c6
|
Merge pull request #1 from fxsjy/master
pull
|
2014-11-07 21:35:22 +08:00 |
|
Sun Junyi
|
7ce63e53b7
|
Merge pull request #197 from skyerown/master
修复带权重测试脚本输出结果是调用顺序错误
|
2014-11-07 11:07:19 +08:00 |
|
walkskyer
|
6772f0282e
|
修复带权重测试脚本输出结果是调用顺序错误
|
2014-11-06 22:24:43 +08:00 |
|
Sun Junyi
|
a5944bb88e
|
Merge pull request #196 from qinwf/master
Add jiebaR in README
|
2014-11-04 12:29:42 +08:00 |
|
Qin Wenfeng
|
77a831b8c1
|
Add jiebaR in README
|
2014-11-04 11:59:40 +08:00 |
|
Sun Junyi
|
cf2aa88122
|
Merge pull request #195 from gumblex/master
统一获取关键词接口,优化缓存命名
|
2014-11-01 12:54:57 +08:00 |
|
Dingyuan Wang
|
751ff35eb5
|
improve extract_tags; unify extract_tags and testrank
|
2014-10-31 23:15:51 +08:00 |
|
Dingyuan Wang
|
e3f3dcccba
|
improve the loading and caching process
|
2014-10-31 21:56:09 +08:00 |
|
Sun Junyi
|
4cb1924d09
|
Merge pull request #193 from gumblex/jieba3k
jieba3k 对应更新 #192
|
2014-10-25 15:29:49 +08:00 |
|
Sun Junyi
|
d6ef07a472
|
Merge pull request #192 from gumblex/master
更新、完善说明;命令行加入自定义词典功能
|
2014-10-25 15:29:26 +08:00 |
|
Dingyuan Wang
|
fd9f1f2c0e
|
update README, textrank, etc.
|
2014-10-25 14:23:37 +08:00 |
|
Dingyuan Wang
|
9d2818b440
|
fix English part of README
|
2014-10-25 13:16:30 +08:00 |
|
Dingyuan Wang
|
31b7d11809
|
improve README
|
2014-10-25 13:12:19 +08:00 |
|
Dingyuan Wang
|
a6119cc995
|
add custom dictionary to __main__; update README; slightly optimize textrank
|
2014-10-25 12:59:36 +08:00 |
|
Sun Junyi
|
0049b0c5b4
|
Merge pull request #191 from sing1ee/master
add some introduction of textrank
|
2014-10-24 22:50:36 +08:00 |
|
zhangcheng
|
138d713e98
|
add some introduction of textrank
|
2014-10-24 22:41:51 +08:00 |
|
Sun Junyi
|
4030d8ed86
|
Merge pull request #190 from sing1ee/master
add a simple implementation of textrank
|
2014-10-24 22:20:05 +08:00 |
|
zhangcheng
|
6eb9f6149c
|
add a simple implementation of textrank
|
2014-10-24 21:15:54 +08:00 |
|