Dingyuan Wang
|
7a6caa0c3c
|
port extract_tags, etc to jieba3k; add auto2to3 script
|
2014-11-07 23:33:31 +08:00 |
|
fxsjy
|
ba87fcb01f
|
remove trie, use prefix set instead
|
2014-10-20 14:08:09 +08:00 |
|
Dingyuan Wang
|
b367690eeb
|
use prefix dict instead of trie, add a command line interface, and a few small improvements
|
2014-10-19 10:32:23 +08:00 |
|
Dingyuan Wang
|
626b415152
|
fix dict.itervalues mistake
|
2014-09-07 19:21:13 +08:00 |
|
Dingyuan Wang
|
6a3f228c72
|
fix python3 stuff
|
2014-09-07 18:50:10 +08:00 |
|
Dingyuan Wang
|
6fad5fbb2c
|
update to v0.33
|
2014-09-06 23:28:47 +08:00 |
|
Dingyuan Wang
|
c04ccd0d12
|
Update to v0.32 according to the master branch.
|
2014-06-14 22:31:13 +08:00 |
|
Dingyuan Wang
|
81f77d7a08
|
Fix the re in enable_parallel.
|
2014-06-14 15:22:13 +08:00 |
|
jagt
|
7f3513edb7
|
close cache file to avoid warning message.
|
2014-04-24 00:35:09 +08:00 |
|
wind
|
7488b114e7
|
use logging instead of print in init file
|
2014-03-20 13:48:33 +13:00 |
|
fxsjy
|
5e6a2c4661
|
fix a bug of add_word
|
2013-12-05 13:35:40 +08:00 |
|
ZoeyYoung
|
dce353f88b
|
merge from master
|
2013-08-21 15:32:46 +08:00 |
|
ZoeyYoung
|
2857ae45cc
|
Merge branch 'master' into jieba3k
Conflicts:
Changelog
jieba/__init__.py
jieba/finalseg/__init__.py
jieba/posseg/__init__.py
setup.py
test/parallel/test_file.py
test/test_file.py
|
2013-08-21 13:55:21 +08:00 |
|
fxsjy
|
b77645b3aa
|
modify test_file.py; use less memory
|
2013-07-29 10:17:39 +08:00 |
|
fxsjy
|
ed1fa64e27
|
fix a bug. use sys.version_info.major can't be used in Python2.5
|
2013-07-29 10:07:55 +08:00 |
|
Sun Junyi
|
0f972df0ac
|
raise exception in case of lower version
|
2013-07-29 10:01:47 +08:00 |
|
Sun Junyi
|
e68bb5a28e
|
fix a compatibility problem;python2.5 has no 'multiprocessing';
|
2013-07-29 09:57:09 +08:00 |
|
Sun Junyi
|
689e27280a
|
Merge branch 'master' of https://github.com/fxsjy/jieba
|
2013-07-29 09:49:10 +08:00 |
|
Sun Junyi
|
9d87e798fd
|
0.31 release
|
2013-07-29 09:48:53 +08:00 |
|
Linker Lin
|
1dbc525dff
|
自动检测CPU数目,启动合适数目的进程。
|
2013-07-28 00:10:27 +08:00 |
|
Sun Junyi
|
6549deabbd
|
merge change from master
|
2013-07-16 11:06:41 +08:00 |
|
Sun Junyi
|
d63140fe5e
|
make a serial white spaces seperated
|
2013-07-10 17:27:47 +08:00 |
|
Richard Wong
|
c2ded83ead
|
Refactor: fix line indent to 4.
* jieba/__init__.py (cut):
|
2013-07-10 16:22:49 +08:00 |
|
Richard Wong
|
99d2492d67
|
Add re.U flag to re variable.
|
2013-07-10 16:22:17 +08:00 |
|
Richard Wong
|
fbfaac2eaa
|
Reindent function
* jieba/__init__.py (require_initialized):
|
2013-07-08 13:54:36 +08:00 |
|
Richard Wong
|
7bfd432fc5
|
Remove the unused imports.
|
2013-07-08 13:51:39 +08:00 |
|
Sun Junyi
|
b9b1f1a418
|
fix conflict of merging
|
2013-07-03 17:47:45 +08:00 |
|
Sun Junyi
|
b62f052927
|
PEP8
|
2013-07-03 17:21:21 +08:00 |
|
Sun Junyi
|
9ea14a8a54
|
merge chage from chao78787
|
2013-07-03 17:07:16 +08:00 |
|
Sun Junyi
|
45daf561c7
|
follow PEP8: change tab to 4 white spaces
|
2013-07-03 16:58:22 +08:00 |
|
Richard Wong
|
3246236133
|
Separate cal and IO process.
|
2013-07-03 15:03:45 +08:00 |
|
Sun Junyi
|
f08690a2df
|
add 'search mode' for jieba.tokenize
|
2013-06-28 12:04:16 +08:00 |
|
Sun Junyi
|
11a3b10755
|
new method: jieba.tokenize
|
2013-06-24 16:14:11 +08:00 |
|
Sun Junyi
|
1a3be67691
|
make cache dumping more robust
|
2013-06-24 13:48:16 +08:00 |
|
Sun Junyi
|
ca97b19951
|
merge change from master
|
2013-06-23 22:28:32 +08:00 |
|
fxsjy
|
e1afafe353
|
fix a bug of cxfree support
|
2013-06-23 12:50:28 +08:00 |
|
fxsjy
|
a9f53e9c85
|
don't seprate CRLF
|
2013-06-22 21:56:39 +08:00 |
|
fxsjy
|
c015f4e297
|
support cxfree py2exe; keep white space
|
2013-06-22 21:24:45 +08:00 |
|
fxsjy
|
7343679ba8
|
fix a bug in parallel mode
|
2013-06-21 15:09:27 +08:00 |
|
fxsjy
|
be1686654d
|
merge master to jieba3k
|
2013-06-08 11:18:56 +08:00 |
|
cloudaice
|
dfc807e65b
|
Don't lose nformation about a function when using a decorator
|
2013-05-23 00:25:45 +02:00 |
|
Sun Junyi
|
a8f902545c
|
fix some bad cases
|
2013-05-15 18:21:08 +08:00 |
|
cloudaice
|
9b0f60df93
|
Catch明确的错误
|
2013-05-10 11:26:27 +02:00 |
|
cloudaice
|
8ba8735f46
|
使用更明确的表达
|
2013-05-10 11:09:41 +02:00 |
|
Sun Junyi
|
ff4ea5d882
|
fix a bug of file leak
|
2013-05-02 11:24:22 +08:00 |
|
Sun Junyi
|
35aa38ed12
|
fix a bug caused by default argument binding
|
2013-04-28 12:04:16 +08:00 |
|
fxsjy
|
aae91b6fb6
|
merge change from master to jieba3k
|
2013-04-27 16:04:16 +08:00 |
|
Sun Junyi
|
94d455b079
|
hot fix of cut_all=True
|
2013-04-27 10:23:01 +08:00 |
|
Sun Junyi
|
59d5d3b811
|
fix bug and change version
|
2013-04-27 09:45:39 +08:00 |
|
fxsjy
|
c8df565981
|
more log trace for trouble shooting
|
2013-04-26 17:43:24 +08:00 |
|