Fukuball Lin
|
b658ee69cb
|
讓 jieba 可以自行增加 stop words 語料庫
1. 增加範例 stop words 語料庫
2. 為了讓 jieba 可以切換 stop words 語料庫,新增 set_stop_words 方法,並改寫 extract_tags
3. test 增加 extract_tags_stop_words.py 測試範例
|
2014-08-06 03:35:16 +08:00 |
|
Fukuball Lin
|
7198d562f1
|
讓 jieba 可以切換 idf 語料庫
1. 新增繁體中文 idf 語料庫
2. 為了讓 jieba 可以切換 iff 語料庫,新增 get_idf, set_idf_path 方法,並改寫 extract_tags
3. test 增加 extract_tags_idfpath
|
2014-08-05 22:55:13 +08:00 |
|
Sun Junyi
|
473ac1df75
|
Merge pull request #162 from ShuraChow/master
fix issue #161
|
2014-06-11 17:04:23 +08:00 |
|
ShuraChow
|
7583f7760a
|
fix issue #161
posseg每次根据jieba.user_word_tag_tab的长度判断是否有新词载入,如果有,则更新word_tag_tab,然后清空jieba.user_word_tag_tab
|
2014-06-10 02:04:09 +08:00 |
|
Sun Junyi
|
2726a7c89b
|
Merge pull request #158 from davidlihm/patch-1
Thanks
|
2014-05-15 10:11:03 +08:00 |
|
davidlihm
|
5b2ec920ed
|
Update __init__.py
|
2014-05-15 07:55:11 +08:00 |
|
Sun Junyi
|
28621e8b00
|
Update README.md
|
2014-04-17 13:47:47 +08:00 |
|
fxsjy
|
2682e887b8
|
Merge branch 'master' of https://github.com/fxsjy/jieba
|
2014-03-02 17:52:52 +08:00 |
|
fxsjy
|
9d4ac26f16
|
fix the bug of issue#137
|
2014-03-02 17:52:19 +08:00 |
|
Sun Junyi
|
6942795fae
|
Merge pull request #135 from aszxqw/patch-1
add nodejieba into README.md
|
2014-02-26 14:13:00 +08:00 |
|
Yanyi Wu
|
ccfa54530e
|
add nodejieba into README.md
add nodejieba into README.md
|
2014-02-26 14:05:13 +08:00 |
|
Sun Junyi
|
3e430e9769
|
Update __init__.py
v0.32
|
2014-02-16 20:09:57 +08:00 |
|
Sun Junyi
|
6946b00f14
|
Merge pull request #134 from Honghe/master
Fix a bug about can not import ChineseAnalyzer
|
2014-02-16 20:08:42 +08:00 |
|
Honghe Wu
|
7720fbc1d8
|
fix a bug about can not import ChineseAnalyzer with change tab to 4 wihte spaces under PEP8
|
2014-02-15 19:32:29 +08:00 |
|
fxsjy
|
cc708de40c
|
version 0.32 released
|
2014-02-07 15:22:53 +08:00 |
|
fxsjy
|
dafc73425e
|
fix a little problem of dict.txt
|
2014-02-07 14:35:38 +08:00 |
|
fxsjy
|
7cc7e70843
|
Merge branch 'master' of https://github.com/fxsjy/jieba
|
2014-01-28 13:48:35 +08:00 |
|
fxsjy
|
18678d50c6
|
fix bug issue #132
|
2014-01-28 13:48:03 +08:00 |
|
Sun Junyi
|
62240c5add
|
Merge pull request #131 from aholic/master
better indent
|
2014-01-25 18:17:50 -08:00 |
|
aholic
|
e2c796088f
|
better indent
|
2014-01-24 00:43:48 +08:00 |
|
fxsjy
|
136676381a
|
fix a bug of add_word
|
2013-12-05 13:33:24 +08:00 |
|
Sun Junyi
|
e79d54b380
|
Merge pull request #114 from hermanschaaf/patch-1
Fix typo in error message
|
2013-10-23 03:41:20 -07:00 |
|
Herman Schaaf
|
95286b8887
|
Fix typo in error message
|
2013-10-21 22:21:09 +09:00 |
|
fxsjy
|
14a0ab0466
|
fix a bug in issue #111
|
2013-10-11 13:05:59 +08:00 |
|
fxsjy
|
759e1029c8
|
add an API to control log level: jieba.setLogLevel
|
2013-09-22 10:26:33 +08:00 |
|
Sun Junyi
|
2ef9dd3a70
|
Merge pull request #107 from mozillazg/logging
use logging instead of print
|
2013-09-21 18:54:34 -07:00 |
|
Mozillazg
|
1cf3f0d00b
|
use logging instead of print
|
2013-09-19 10:31:44 +08:00 |
|
Sun Junyi
|
fd96527f71
|
Merge pull request #106 from jannson/master
add better support for english for ChineseAnalyzer
|
2013-09-16 23:58:46 -07:00 |
|
Sun Junyi
|
6a66620088
|
Update README.md
|
2013-09-14 22:32:45 +08:00 |
|
Sun Junyi
|
00bc72c877
|
Update README.md
|
2013-09-14 22:31:38 +08:00 |
|
gan
|
31d5845535
|
add better support for english. like input: 'this is interesting and interested me'-->output:'this interest interest',which 'interest' match 'interesting interested'
|
2013-09-09 11:54:30 +08:00 |
|
Sun Junyi
|
7e7fcc1184
|
add an option to disable HMM
|
2013-09-05 17:09:27 +08:00 |
|
fxsjy
|
21f7da0ca4
|
conver tab to spaces
|
2013-08-30 18:31:25 +08:00 |
|
fxsjy
|
c5bd9773d1
|
fix bug in issue #103
|
2013-08-30 18:26:53 +08:00 |
|
Sun Junyi
|
66e334229b
|
Merge pull request #99 from aszxqw/branch1
sed -i 's/not \(.*\) in/\1 not in/g' ...
|
2013-08-20 18:33:39 -07:00 |
|
gwdwyy
|
cc81135429
|
sed -i 's/not \(.*\) in/\1 not in/g' ...
|
2013-08-20 20:08:03 +08:00 |
|
Sun Junyi
|
efebf5371c
|
Merge branch 'master' of https://github.com/fxsjy/jieba
|
2013-08-09 13:59:38 +08:00 |
|
Sun Junyi
|
90ab511deb
|
fix the bug about issue: #92
|
2013-08-09 13:59:02 +08:00 |
|
Sun Junyi
|
92c6c3d9cd
|
Update README.md
|
2013-08-06 13:26:53 +08:00 |
|
Sun Junyi
|
0bb2ddcc1b
|
Update README.md
|
2013-08-06 11:05:19 +08:00 |
|
Sun Junyi
|
81390a2d23
|
test_file.py: close the file object
|
2013-08-02 15:51:33 +08:00 |
|
Sun Junyi
|
3667a4ab01
|
include Changelog & README.md in the distribution package
|
2013-07-29 13:19:39 +08:00 |
|
Sun Junyi
|
33089138fd
|
Merge branch 'master' of https://github.com/fxsjy/jieba
|
2013-07-29 12:48:04 +08:00 |
|
Sun Junyi
|
d0578ad99b
|
add a license file
|
2013-07-29 12:47:47 +08:00 |
|
Sun Junyi
|
d97c1d584c
|
0.31 released
pypi update
|
2013-07-29 10:31:52 +08:00 |
|
fxsjy
|
b77645b3aa
|
modify test_file.py; use less memory
|
2013-07-29 10:17:39 +08:00 |
|
fxsjy
|
ed1fa64e27
|
fix a bug. use sys.version_info.major can't be used in Python2.5
v0.31
|
2013-07-29 10:07:55 +08:00 |
|
Sun Junyi
|
0f972df0ac
|
raise exception in case of lower version
|
2013-07-29 10:01:47 +08:00 |
|
Sun Junyi
|
e68bb5a28e
|
fix a compatibility problem;python2.5 has no 'multiprocessing';
|
2013-07-29 09:57:09 +08:00 |
|
Sun Junyi
|
689e27280a
|
Merge branch 'master' of https://github.com/fxsjy/jieba
|
2013-07-29 09:49:10 +08:00 |
|