104 Commits

Author SHA1 Message Date
OOC
b485ae916c Update README.md 2017-04-04 11:45:53 +08:00
OOC
ee0ce32bbd Update 2017-04-04 11:17:44 +08:00
Yanyi Wu
76ae798137 Update README.md 2016-06-10 22:48:01 +08:00
Dingyuan Wang
87734d3785 support POS tagging in __main__ 2015-11-17 19:06:44 +08:00
Dingyuan Wang
99d0fb1a8a use regex and fix encoding related issues in load_userdict 2015-11-09 20:54:50 +08:00
andersc
343bfe9783 Add introduction to jieba.NET port. 2015-09-22 23:16:02 +08:00
Dingyuan Wang
d0e68974bf improved doc for tmp_dir and cache_file 2015-06-26 22:24:20 +08:00
Dingyuan Wang
ceb5c26be4 fix self.FREQ in cut_for_search; make pair object iterable 2015-06-01 14:36:38 +08:00
Dingyuan Wang
3b76328f2a allow ignoring word frequency while providing pos tag 2015-05-23 21:51:00 +08:00
Dingyuan Wang
94840a734c wraps most globals in classes
API changes:
* class jieba.Tokenizer, jieba.posseg.POSTokenizer
* class jieba.analyse.TFIDF, jieba.analyse.TextRank
* global functions are mapped to jieba.(posseg.)dt, the default (POS)Tokenizer
* multiprocessing only works with jieba.(posseg.)dt
* new lcut, lcut_for_search functions that returns a list
* jieba.analyse.textrank now returns 20 items by default

Tests:
* added test_lock.py to test multithread locking
* demo.py now contains most of the examples in README
2015-05-09 21:29:05 +08:00
Fukuball Lin
e712a4de61 更新 README
增加结巴分词 PHP 版本相關資訊
2015-04-28 22:05:26 +08:00
Dingyuan Wang
eeaab012bf update docs 2015-03-21 10:53:42 +08:00
Dingyuan Wang
4fa2728fb6 update README about new features 2015-03-14 12:44:49 +08:00
Dingyuan Wang
f29430f49e details in textrank; update README 2015-02-16 21:25:55 +08:00
Yanyi Wu
5d321cbccd Update README.md 2015-02-11 17:37:32 +08:00
Dingyuan Wang
22bcf8be7a Merge master and jieba3k, make the code Python 2/3 compatible 2015-02-10 20:54:55 +08:00
Dingyuan Wang
4197dfb8fa store int directly in FREQ; small improvements 2015-02-09 16:26:00 +08:00
yanyiwu
1d91072498 add iosjieba 2014-12-24 23:02:06 +08:00
Dingyuan Wang
7a6caa0c3c port extract_tags, etc to jieba3k; add auto2to3 script 2014-11-07 23:33:31 +08:00
Qin Wenfeng
77a831b8c1 Add jiebaR in README 2014-11-04 11:59:40 +08:00
Dingyuan Wang
751ff35eb5 improve extract_tags; unify extract_tags and testrank 2014-10-31 23:15:51 +08:00
Dingyuan Wang
fd9f1f2c0e update README, textrank, etc. 2014-10-25 14:23:37 +08:00
Dingyuan Wang
9d2818b440 fix English part of README 2014-10-25 13:16:30 +08:00
Dingyuan Wang
31b7d11809 improve README 2014-10-25 13:12:19 +08:00
Dingyuan Wang
a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank 2014-10-25 12:59:36 +08:00
Sun Junyi
0049b0c5b4 Merge pull request #191 from sing1ee/master
add some introduction of textrank
2014-10-24 22:50:36 +08:00
zhangcheng
138d713e98 add some introduction of textrank 2014-10-24 22:41:51 +08:00
Sun Junyi
1850bd6d37 Update README.md 2014-10-24 20:23:10 +08:00
fxsjy
f5ca87e088 merge change of @fukuball 2014-10-23 15:59:08 +08:00
Sun Junyi
10b86e90fb Update README.md 2014-10-21 12:53:37 +08:00
Dingyuan Wang
6fad5fbb2c update to v0.33 2014-09-06 23:28:47 +08:00
Sun Junyi
fc511de012 Merge pull request #176 from fukuball/master
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
2014-09-01 14:11:00 +08:00
Sun Junyi
99ea59e88d Update README.md 2014-08-31 20:04:02 +08:00
Fukuball Lin
d432789cb4 fix typo 2014-08-06 17:56:05 +08:00
Fukuball Lin
cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白
將 Readme 中文和半形的英文、數字、符號之間插入空白,增加可讀性
2014-08-06 15:53:57 +08:00
Fukuball Lin
e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明
2014-08-06 15:00:07 +08:00
Dingyuan Wang
c04ccd0d12 Update to v0.32 according to the master branch. 2014-06-14 22:31:13 +08:00
Sun Junyi
28621e8b00 Update README.md 2014-04-17 13:47:47 +08:00
Yanyi Wu
ccfa54530e add nodejieba into README.md
add nodejieba into README.md
2014-02-26 14:05:13 +08:00
Sun Junyi
6a66620088 Update README.md 2013-09-14 22:32:45 +08:00
Sun Junyi
00bc72c877 Update README.md 2013-09-14 22:31:38 +08:00
ZoeyYoung
dce353f88b merge from master 2013-08-21 15:32:46 +08:00
Sun Junyi
92c6c3d9cd Update README.md 2013-08-06 13:26:53 +08:00
Sun Junyi
0bb2ddcc1b Update README.md 2013-08-06 11:05:19 +08:00
Sun Junyi
e1c1d46324 Update README.md 2013-07-01 12:43:33 +08:00
Sun Junyi
915b3164b0 Update README.md 2013-07-01 11:47:15 +08:00
Sun Junyi
45e6594a09 Update README.md 2013-07-01 11:46:16 +08:00
Sun Junyi
465e475460 Update README.md 2013-06-24 12:24:50 +09:00
Sun Junyi
ca97b19951 merge change from master 2013-06-23 22:28:32 +08:00
Sun Junyi
b1238a2306 Update README.md 2013-06-14 13:01:07 +09:00