279 Commits

Author SHA1 Message Date
Richard Wong
99d2492d67 Add re.U flag to re variable. 2013-07-10 16:22:17 +08:00
Richard Wong
fbfaac2eaa Reindent function
* jieba/__init__.py (require_initialized):
2013-07-08 13:54:36 +08:00
Richard Wong
7bfd432fc5 Remove the unused imports. 2013-07-08 13:51:39 +08:00
Sun Junyi
9d0ea771a5 fix bug; decimals & digit-english mixed 2013-07-05 16:16:49 +08:00
Sun Junyi
ba5114dc95 update whoosh example 2013-07-04 09:31:09 +08:00
Sun Junyi
4b237f79fa add test/tmp/* into git ignore 2013-07-03 17:56:15 +08:00
Sun Junyi
f424862222 clean the files in tmp 2013-07-03 17:55:01 +08:00
Sun Junyi
b18d56d2a3 Merge pull request #72 from linkerlin/master
添加一个tmp目录,好让test_whoosh.py可以运行。
2013-07-03 02:52:46 -07:00
miao.lin
becd32b178 made test_whoosh.py happy.
添加一个tmp目录,好让test_whoosh.py可以运行。
2013-07-03 17:32:35 +08:00
Sun Junyi
9ea14a8a54 merge chage from chao78787 2013-07-03 17:07:16 +08:00
Sun Junyi
45daf561c7 follow PEP8: change tab to 4 white spaces 2013-07-03 16:58:22 +08:00
Sun Junyi
632a086035 Merge pull request #71 from chao787/feature_richard
Separate cal and IO process.
2013-07-03 01:57:34 -07:00
Richard Wong
3246236133 Separate cal and IO process. 2013-07-03 15:03:45 +08:00
Sun Junyi
e1c1d46324 Update README.md 2013-07-01 12:43:33 +08:00
Sun Junyi
915b3164b0 Update README.md 2013-07-01 11:47:15 +08:00
Sun Junyi
45e6594a09 Update README.md 2013-07-01 11:46:16 +08:00
Sun Junyi
0886875af3 0.3 released v0.3 2013-07-01 11:36:16 +08:00
Sun Junyi
dbec3ad9df add some comments 2013-07-01 11:20:56 +08:00
Sun Junyi
efc784312c add ChineseAnalyzer for whoosh search engine 2013-07-01 10:53:39 +08:00
Sun Junyi
f08690a2df add 'search mode' for jieba.tokenize 2013-06-28 12:04:16 +08:00
Sun Junyi
237dc6625e add mix words to extra_dict/dict.txt.big 2013-06-26 09:36:41 +08:00
Sun Junyi
cb1b0499f7 unittest for jieba.tokenize 2013-06-24 16:20:04 +08:00
Sun Junyi
11a3b10755 new method: jieba.tokenize 2013-06-24 16:14:11 +08:00
Sun Junyi
8eab1cdb6d Merge branch 'master' of https://github.com/fxsjy/jieba 2013-06-24 13:48:30 +08:00
Sun Junyi
1a3be67691 make cache dumping more robust 2013-06-24 13:48:16 +08:00
Sun Junyi
465e475460 Update README.md 2013-06-24 12:24:50 +09:00
Sun Junyi
38b6bcd54e remove some words 2013-06-23 21:52:22 +08:00
fxsjy
e1afafe353 fix a bug of cxfree support 2013-06-23 12:50:28 +08:00
fxsjy
a9f53e9c85 don't seprate CRLF 2013-06-22 21:56:39 +08:00
fxsjy
c015f4e297 support cxfree py2exe; keep white space 2013-06-22 21:24:45 +08:00
fxsjy
7343679ba8 fix a bug in parallel mode 2013-06-21 15:09:27 +08:00
Sun Junyi
c0816b9bb0 more mixed words 2013-06-18 18:09:55 +08:00
Sun Junyi
c9e8da9e63 add more mix words to dict.txt 2013-06-18 14:10:36 +08:00
Sun Junyi
322e8e48b6 Update Changelog 2013-06-17 10:31:31 +09:00
Sun Junyi
1d06f124d6 Update Changelog 2013-06-17 09:31:09 +08:00
Sun Junyi
dbfd0e0f63 minor version v0.29.1 2013-06-17 09:24:10 +08:00
Sun Junyi
cfcfb26792 Merge branch 'master' of https://github.com/fxsjy/jieba 2013-06-16 13:22:02 +08:00
Sun Junyi
9d1e23ce6f speed up the viterbi 2013-06-16 13:21:43 +08:00
Sun Junyi
b1238a2306 Update README.md 2013-06-14 13:01:07 +09:00
Sun Junyi
02e9a0328d Update README.md 2013-06-14 09:06:15 +08:00
Sun Junyi
b050bfe946 remove some useless words 2013-06-08 15:40:01 +08:00
fxsjy
69e584677a Merge branch 'master' of https://github.com/fxsjy/jieba 2013-06-08 10:48:11 +08:00
Sun Junyi
7993a3ea73 version 0.29 v0.29 2013-06-07 18:23:19 +08:00
fxsjy
bdfaaa4eea Merge branch 'master' of https://code.csdn.net/fxsjy/jieba 2013-06-07 18:11:58 +08:00
fxsjy
1febdf847f clear 2013-06-07 18:11:11 +08:00
fxsjy
ffea881a46 second commit 2013-06-07 18:03:21 +08:00
979a9177ae first commit 2013-06-07 17:47:16 +08:00
fxsjy
e12e176d17 rollback, seems no abvious speed up by the previous change 2013-06-07 15:51:48 +08:00
fxsjy
d3531f197d rollback, seems no abvious speed up by the previous change 2013-06-07 15:51:13 +08:00
fxsjy
f2d6abf063 speed up of viterbi 2013-06-07 14:41:55 +08:00