433 Commits

Author SHA1 Message Date
Sun Junyi
a4fb439070 Merge pull request #241 from sing1ee/master
improve some details from other commiters' adivces
2015-02-16 20:41:06 +08:00
zhangcheng
01b7f6efcf improve some details from other commiters' adivces 2015-02-16 20:35:45 +08:00
Sun Junyi
4e05cde07e Merge pull request #240 from sing1ee/master
build stable sort for graph iteration
2015-02-16 20:28:22 +08:00
zhangcheng
8b8c6c85d0 remove unusage import 2015-02-16 15:51:05 +08:00
zhangcheng
a6d1b2479e build stable sort for graph iteration, then we can get stable result and adatpe details for python 3~ 2015-02-16 15:49:10 +08:00
zhangcheng
1152db7736 build stable sort for graph iteration, then we can get stable result. 2015-02-16 15:46:36 +08:00
fxsjy
49657c976d make extract_tags behavior compatiable with previous version 2015-02-14 21:23:58 +08:00
fxsjy
abcaf3e475 fix bug: load_userdict 2015-02-14 19:56:38 +08:00
Jack
a06b7d388e fix bug in __main__.py 2015-02-12 14:08:39 +08:00
Sun Junyi
9ca5b69907 Merge pull request #238 from gumblex/master
use str.splitlines to avoid losing line breaks
2015-02-12 13:35:52 +08:00
Dingyuan Wang
f2b7183a71 use str.splitlines to avoid losing line breaks 2015-02-12 12:39:14 +08:00
Sun Junyi
b14eb329e3 Merge pull request #237 from gumblex/master
直接将前缀储存在词频字典里
2015-02-12 11:27:25 +08:00
Dingyuan Wang
872a7039f2 Merge branch 'master' of https://github.com/fxsjy/jieba 2015-02-12 10:33:56 +08:00
Dingyuan Wang
f808ea0ebb use only one dict to store words and prefixes 2015-02-12 10:31:52 +08:00
fxsjy
4d7b515801 Merge branch 'master' of https://github.com/fxsjy/jieba 2015-02-11 20:57:35 +08:00
fxsjy
5bfa43a781 fix test scripts 2015-02-11 20:46:48 +08:00
Dingyuan Wang
f3a53dd2da fix print() in tests 2015-02-11 20:45:55 +08:00
Sun Junyi
a229041e58 Merge pull request #234 from yanyiwu/patch-2
Update README.md
2015-02-11 18:48:47 +08:00
Yanyi Wu
5d321cbccd Update README.md 2015-02-11 17:37:32 +08:00
fxsjy
8cbb26a7b6 fix test_file.py 2015-02-11 16:47:57 +08:00
Sun Junyi
41b47b0593 Merge pull request #233 from gumblex/master
合并 jieba3k,兼容 Python 2/3
2015-02-11 15:44:22 +08:00
Dingyuan Wang
32a0e92a09 don't compile re every time; autopep8 2015-02-10 21:22:34 +08:00
Dingyuan Wang
22bcf8be7a Merge master and jieba3k, make the code Python 2/3 compatible 2015-02-10 20:54:55 +08:00
Sun Junyi
caae26fbfa Merge pull request #231 from gumblex/master
在 FREQ 中直接储存频数
2015-02-09 16:50:43 +08:00
Dingyuan Wang
4197dfb8fa store int directly in FREQ; small improvements 2015-02-09 16:26:00 +08:00
Dingyuan Wang
765fd6b7f0 store int directly in FREQ; small improvements 2015-02-09 16:14:12 +08:00
Sun Junyi
c95f402e2b Merge pull request #214 from aszxqw/master
add iosjieba
2014-12-25 10:09:35 +08:00
yanyiwu
1d91072498 add iosjieba 2014-12-24 23:02:06 +08:00
Sun Junyi
852a07c4f2 Merge pull request #211 from gumblex/jieba3k
修复 posseg 中 pair 类 repr 返回值 (jieba3k)
2014-12-20 18:35:43 +08:00
Dingyuan Wang
7bcb128f5f fix textrank divided by zero; fix posseg.pair.__repr__ 2014-12-20 00:12:42 +08:00
Sun Junyi
b08c3f8ed7 Merge pull request #205 from lynschinzer/master
Fix divided by zero issue in case of words are not found in dict.
2014-12-05 20:13:51 +08:00
Lin
fea3aec6bd Fix divided by zero issue in case of words are not found in dict. 2014-12-05 17:13:12 +08:00
Sun Junyi
8be082017a Merge pull request #204 from gumblex/jieba3k
完善setup.py等对应py3k更新
2014-11-29 18:28:48 +08:00
Sun Junyi
293dbbc390 Merge pull request #203 from gumblex/master
修复 posseg;完善 setup.py
2014-11-29 18:28:23 +08:00
Dingyuan Wang
3dad899ec8 backport 2to3 scripts and changelog 2014-11-29 16:12:25 +08:00
Dingyuan Wang
c6b386f65b update jieba3k 2014-11-29 16:06:20 +08:00
Dingyuan Wang
7b7c6955a9 complete the setup.py, fix #202 problem in posseg 2014-11-29 15:33:42 +08:00
Sun Junyi
8a2e7f0e7e Merge pull request #202 from nomaka/patch-1
Update __init__.py
2014-11-18 16:46:59 +08:00
Nomaka
9cb76dd8b9 Update __init__.py
calc的idx参数没用
2014-11-18 16:00:49 +08:00
Sun Junyi
99748bfc17 Merge pull request #201 from skyerown/master
为关键字提取函数增加词性过滤功能
2014-11-18 10:27:52 +08:00
walkskyer
a336e26403 为函数textrank增加参数allowPOS,并修改extract_tags的参数allowPOS与textrank保持一致。 2014-11-15 18:36:09 +08:00
walkskyer
bab5f362ba 将exstract_tags参数allowPOS转换为frozenset以减少查找时间。 2014-11-15 18:14:47 +08:00
Dingyuan Wang
6b0da06481 merge from upstream 2014-11-15 14:06:03 +08:00
fxsjy
5c487dbcba update verson 2014-11-15 13:46:27 +08:00
fxsjy
447c1ded8c fix problem for python3.2 2014-11-15 13:44:30 +08:00
walkskyer
dd62477605 .gitignore中忽略pycharm项目文件 2014-11-15 13:33:13 +08:00
Dingyuan Wang
a5ecf70f71 update to v0.35 2014-11-14 20:59:54 +08:00
walkskyer
d82d2c18df 为关键字提取函数增加词性过滤功能 2014-11-13 22:26:22 +08:00
fxsjy
315a411e52 version update 2014-11-13 10:43:43 +08:00
fxsjy
ec68c21ea0 version update' 2014-11-13 10:27:50 +08:00