292 Commits

Author SHA1 Message Date
Fukuball Lin
b658ee69cb 讓 jieba 可以自行增加 stop words 語料庫
1. 增加範例 stop words 語料庫
2. 為了讓 jieba 可以切換 stop words 語料庫,新增 set_stop_words 方法,並改寫 extract_tags
3. test 增加 extract_tags_stop_words.py 測試範例
2014-08-06 03:35:16 +08:00
Fukuball Lin
7198d562f1 讓 jieba 可以切換 idf 語料庫
1. 新增繁體中文 idf 語料庫
2. 為了讓 jieba 可以切換 iff 語料庫,新增 get_idf, set_idf_path 方法,並改寫 extract_tags
3. test 增加 extract_tags_idfpath
2014-08-05 22:55:13 +08:00
Sun Junyi
473ac1df75 Merge pull request #162 from ShuraChow/master
fix issue #161
2014-06-11 17:04:23 +08:00
ShuraChow
7583f7760a fix issue #161
posseg每次根据jieba.user_word_tag_tab的长度判断是否有新词载入,如果有,则更新word_tag_tab,然后清空jieba.user_word_tag_tab
2014-06-10 02:04:09 +08:00
Sun Junyi
2726a7c89b Merge pull request #158 from davidlihm/patch-1
Thanks
2014-05-15 10:11:03 +08:00
davidlihm
5b2ec920ed Update __init__.py 2014-05-15 07:55:11 +08:00
Sun Junyi
28621e8b00 Update README.md 2014-04-17 13:47:47 +08:00
fxsjy
2682e887b8 Merge branch 'master' of https://github.com/fxsjy/jieba 2014-03-02 17:52:52 +08:00
fxsjy
9d4ac26f16 fix the bug of issue#137 2014-03-02 17:52:19 +08:00
Sun Junyi
6942795fae Merge pull request #135 from aszxqw/patch-1
add nodejieba into README.md
2014-02-26 14:13:00 +08:00
Yanyi Wu
ccfa54530e add nodejieba into README.md
add nodejieba into README.md
2014-02-26 14:05:13 +08:00
Sun Junyi
3e430e9769 Update __init__.py v0.32 2014-02-16 20:09:57 +08:00
Sun Junyi
6946b00f14 Merge pull request #134 from Honghe/master
Fix a bug about can not import ChineseAnalyzer
2014-02-16 20:08:42 +08:00
Honghe Wu
7720fbc1d8 fix a bug about can not import ChineseAnalyzer with change tab to 4 wihte spaces under PEP8 2014-02-15 19:32:29 +08:00
fxsjy
cc708de40c version 0.32 released 2014-02-07 15:22:53 +08:00
fxsjy
dafc73425e fix a little problem of dict.txt 2014-02-07 14:35:38 +08:00
fxsjy
7cc7e70843 Merge branch 'master' of https://github.com/fxsjy/jieba 2014-01-28 13:48:35 +08:00
fxsjy
18678d50c6 fix bug issue #132 2014-01-28 13:48:03 +08:00
Sun Junyi
62240c5add Merge pull request #131 from aholic/master
better indent
2014-01-25 18:17:50 -08:00
aholic
e2c796088f better indent 2014-01-24 00:43:48 +08:00
fxsjy
136676381a fix a bug of add_word 2013-12-05 13:33:24 +08:00
Sun Junyi
e79d54b380 Merge pull request #114 from hermanschaaf/patch-1
Fix typo in error message
2013-10-23 03:41:20 -07:00
Herman Schaaf
95286b8887 Fix typo in error message 2013-10-21 22:21:09 +09:00
fxsjy
14a0ab0466 fix a bug in issue #111 2013-10-11 13:05:59 +08:00
fxsjy
759e1029c8 add an API to control log level: jieba.setLogLevel 2013-09-22 10:26:33 +08:00
Sun Junyi
2ef9dd3a70 Merge pull request #107 from mozillazg/logging
use logging instead of print
2013-09-21 18:54:34 -07:00
Mozillazg
1cf3f0d00b use logging instead of print 2013-09-19 10:31:44 +08:00
Sun Junyi
fd96527f71 Merge pull request #106 from jannson/master
add better support for english for ChineseAnalyzer
2013-09-16 23:58:46 -07:00
Sun Junyi
6a66620088 Update README.md 2013-09-14 22:32:45 +08:00
Sun Junyi
00bc72c877 Update README.md 2013-09-14 22:31:38 +08:00
gan
31d5845535 add better support for english. like input: 'this is interesting and interested me'-->output:'this interest interest',which 'interest' match 'interesting interested' 2013-09-09 11:54:30 +08:00
Sun Junyi
7e7fcc1184 add an option to disable HMM 2013-09-05 17:09:27 +08:00
fxsjy
21f7da0ca4 conver tab to spaces 2013-08-30 18:31:25 +08:00
fxsjy
c5bd9773d1 fix bug in issue #103 2013-08-30 18:26:53 +08:00
Sun Junyi
66e334229b Merge pull request #99 from aszxqw/branch1
sed -i 's/not \(.*\) in/\1 not in/g' ...
2013-08-20 18:33:39 -07:00
gwdwyy
cc81135429 sed -i 's/not \(.*\) in/\1 not in/g' ... 2013-08-20 20:08:03 +08:00
Sun Junyi
efebf5371c Merge branch 'master' of https://github.com/fxsjy/jieba 2013-08-09 13:59:38 +08:00
Sun Junyi
90ab511deb fix the bug about issue: #92 2013-08-09 13:59:02 +08:00
Sun Junyi
92c6c3d9cd Update README.md 2013-08-06 13:26:53 +08:00
Sun Junyi
0bb2ddcc1b Update README.md 2013-08-06 11:05:19 +08:00
Sun Junyi
81390a2d23 test_file.py: close the file object 2013-08-02 15:51:33 +08:00
Sun Junyi
3667a4ab01 include Changelog & README.md in the distribution package 2013-07-29 13:19:39 +08:00
Sun Junyi
33089138fd Merge branch 'master' of https://github.com/fxsjy/jieba 2013-07-29 12:48:04 +08:00
Sun Junyi
d0578ad99b add a license file 2013-07-29 12:47:47 +08:00
Sun Junyi
d97c1d584c 0.31 released
pypi update
2013-07-29 10:31:52 +08:00
fxsjy
b77645b3aa modify test_file.py; use less memory 2013-07-29 10:17:39 +08:00
fxsjy
ed1fa64e27 fix a bug. use sys.version_info.major can't be used in Python2.5 v0.31 2013-07-29 10:07:55 +08:00
Sun Junyi
0f972df0ac raise exception in case of lower version 2013-07-29 10:01:47 +08:00
Sun Junyi
e68bb5a28e fix a compatibility problem;python2.5 has no 'multiprocessing'; 2013-07-29 09:57:09 +08:00
Sun Junyi
689e27280a Merge branch 'master' of https://github.com/fxsjy/jieba 2013-07-29 09:49:10 +08:00