Neutrino
67fa2e36e7
Update README.md update paddle link. ( #817 )
2020-02-15 16:33:35 +08:00
fxsjy
eb37e048da
update version to 0.41
2020-01-08 16:04:30 +08:00
JesseyXujin
381b0691ac
Add enable_paddle interface to install paddle and import packages ( #802 )
...
* enable_paddle_interface
* Add enable_paddle interface to install paddle and import packages
* Add enable_paddle interface to install paddle and import packages
* add posseg lcut for paddle mode
* fix vocabulary
2020-01-08 15:26:12 +08:00
JesseyXujin
0b74b6c2de
add jieba upgrade not in README.md and change import imp to import importlib in _compat.py ( #794 )
2019-12-31 14:14:50 +08:00
Sun Junyi
2fdee89883
Update README.md
2019-12-30 17:11:22 +08:00
Sun Junyi
9dc8e6d992
Update README.md
2019-12-24 19:29:17 +08:00
JesseyXujin
5b3bb4b7f2
加入paddle分词和词性标注功能 ( #788 )
...
* paddle cut release
* 修改README.md,提示用户安装paddlepaddle.tiny
* 删除两个init.py文件中utf头文件
* 修改readme细节
2019-12-24 17:27:41 +08:00
Paul Meng
3645a5bb5d
Update README.md ( #745 )
2019-07-01 19:41:47 +08:00
Sun Junyi
8212b6c572
Update README.md
2018-12-03 16:29:32 +08:00
Sun Junyi
7653db2e33
Update README.md
2018-07-04 17:18:02 +08:00
OOC
b485ae916c
Update README.md
2017-04-04 11:45:53 +08:00
OOC
ee0ce32bbd
Update
2017-04-04 11:17:44 +08:00
Yanyi Wu
76ae798137
Update README.md
2016-06-10 22:48:01 +08:00
Dingyuan Wang
87734d3785
support POS tagging in __main__
2015-11-17 19:06:44 +08:00
Dingyuan Wang
99d0fb1a8a
use regex and fix encoding related issues in load_userdict
2015-11-09 20:54:50 +08:00
andersc
343bfe9783
Add introduction to jieba.NET port.
2015-09-22 23:16:02 +08:00
Dingyuan Wang
d0e68974bf
improved doc for tmp_dir and cache_file
2015-06-26 22:24:20 +08:00
Dingyuan Wang
ceb5c26be4
fix self.FREQ in cut_for_search; make pair object iterable
2015-06-01 14:36:38 +08:00
Dingyuan Wang
3b76328f2a
allow ignoring word frequency while providing pos tag
2015-05-23 21:51:00 +08:00
Dingyuan Wang
94840a734c
wraps most globals in classes
...
API changes:
* class jieba.Tokenizer, jieba.posseg.POSTokenizer
* class jieba.analyse.TFIDF, jieba.analyse.TextRank
* global functions are mapped to jieba.(posseg.)dt, the default (POS)Tokenizer
* multiprocessing only works with jieba.(posseg.)dt
* new lcut, lcut_for_search functions that returns a list
* jieba.analyse.textrank now returns 20 items by default
Tests:
* added test_lock.py to test multithread locking
* demo.py now contains most of the examples in README
2015-05-09 21:29:05 +08:00
Fukuball Lin
e712a4de61
更新 README
...
增加结巴分词 PHP 版本相關資訊
2015-04-28 22:05:26 +08:00
Dingyuan Wang
eeaab012bf
update docs
2015-03-21 10:53:42 +08:00
Dingyuan Wang
4fa2728fb6
update README about new features
2015-03-14 12:44:49 +08:00
Dingyuan Wang
f29430f49e
details in textrank; update README
2015-02-16 21:25:55 +08:00
Yanyi Wu
5d321cbccd
Update README.md
2015-02-11 17:37:32 +08:00
Dingyuan Wang
22bcf8be7a
Merge master and jieba3k, make the code Python 2/3 compatible
2015-02-10 20:54:55 +08:00
Dingyuan Wang
4197dfb8fa
store int directly in FREQ; small improvements
2015-02-09 16:26:00 +08:00
yanyiwu
1d91072498
add iosjieba
2014-12-24 23:02:06 +08:00
Dingyuan Wang
7a6caa0c3c
port extract_tags, etc to jieba3k; add auto2to3 script
2014-11-07 23:33:31 +08:00
Qin Wenfeng
77a831b8c1
Add jiebaR in README
2014-11-04 11:59:40 +08:00
Dingyuan Wang
751ff35eb5
improve extract_tags; unify extract_tags and testrank
2014-10-31 23:15:51 +08:00
Dingyuan Wang
fd9f1f2c0e
update README, textrank, etc.
2014-10-25 14:23:37 +08:00
Dingyuan Wang
9d2818b440
fix English part of README
2014-10-25 13:16:30 +08:00
Dingyuan Wang
31b7d11809
improve README
2014-10-25 13:12:19 +08:00
Dingyuan Wang
a6119cc995
add custom dictionary to __main__; update README; slightly optimize textrank
2014-10-25 12:59:36 +08:00
Sun Junyi
0049b0c5b4
Merge pull request #191 from sing1ee/master
...
add some introduction of textrank
2014-10-24 22:50:36 +08:00
zhangcheng
138d713e98
add some introduction of textrank
2014-10-24 22:41:51 +08:00
Sun Junyi
1850bd6d37
Update README.md
2014-10-24 20:23:10 +08:00
fxsjy
f5ca87e088
merge change of @fukuball
2014-10-23 15:59:08 +08:00
Sun Junyi
10b86e90fb
Update README.md
2014-10-21 12:53:37 +08:00
Dingyuan Wang
6fad5fbb2c
update to v0.33
2014-09-06 23:28:47 +08:00
Sun Junyi
fc511de012
Merge pull request #176 from fukuball/master
...
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
2014-09-01 14:11:00 +08:00
Sun Junyi
99ea59e88d
Update README.md
2014-08-31 20:04:02 +08:00
Fukuball Lin
d432789cb4
fix typo
2014-08-06 17:56:05 +08:00
Fukuball Lin
cf31a99bf6
將 Readme 中文和半形的英文、數字、符號之間插入空白
...
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
2014-08-06 15:53:57 +08:00
Fukuball Lin
e4d323c78b
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
...
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
2014-08-06 15:00:07 +08:00
Dingyuan Wang
c04ccd0d12
Update to v0.32 according to the master branch.
2014-06-14 22:31:13 +08:00
Sun Junyi
28621e8b00
Update README.md
2014-04-17 13:47:47 +08:00
Yanyi Wu
ccfa54530e
add nodejieba into README.md
...
add nodejieba into README.md
2014-02-26 14:05:13 +08:00
Sun Junyi
6a66620088
Update README.md
2013-09-14 22:32:45 +08:00