Dingyuan Wang
|
8814e08f9b
|
load default dictionary from pkg_resources and improve the loading method;
change the serialized models from marshal to pickle
|
2015-11-12 20:18:09 +08:00 |
|
Dingyuan Wang
|
1c33252fce
|
change the recognized Chinese character range to [\u4E00-\u9FD5]
|
2015-11-09 20:23:43 +08:00 |
|
Dingyuan Wang
|
32a0e92a09
|
don't compile re every time; autopep8
|
2015-02-10 21:22:34 +08:00 |
|
Dingyuan Wang
|
22bcf8be7a
|
Merge master and jieba3k, make the code Python 2/3 compatible
|
2015-02-10 20:54:55 +08:00 |
|
Dingyuan Wang
|
4197dfb8fa
|
store int directly in FREQ; small improvements
|
2015-02-09 16:26:00 +08:00 |
|
Dingyuan Wang
|
765fd6b7f0
|
store int directly in FREQ; small improvements
|
2015-02-09 16:14:12 +08:00 |
|
fxsjy
|
447c1ded8c
|
fix problem for python3.2
|
2014-11-15 13:44:30 +08:00 |
|
Dingyuan Wang
|
7a6caa0c3c
|
port extract_tags, etc to jieba3k; add auto2to3 script
|
2014-11-07 23:33:31 +08:00 |
|
Dingyuan Wang
|
751ff35eb5
|
improve extract_tags; unify extract_tags and testrank
|
2014-10-31 23:15:51 +08:00 |
|
Dingyuan Wang
|
bb1e6000c6
|
fix version; fix spaces at end of line
|
2014-10-19 10:57:46 +08:00 |
|
Dingyuan Wang
|
b367690eeb
|
use prefix dict instead of trie, add a command line interface, and a few small improvements
|
2014-10-19 10:32:23 +08:00 |
|
Dingyuan Wang
|
51df77831b
|
use prefix dict instead of trie, add a command line interface, and a few small improvements
|
2014-10-18 22:23:26 +08:00 |
|
Dingyuan Wang
|
c04ccd0d12
|
Update to v0.32 according to the master branch.
|
2014-06-14 22:31:13 +08:00 |
|
ZoeyYoung
|
25839b5127
|
fix bug
|
2013-08-21 19:46:14 +08:00 |
|
ZoeyYoung
|
d49542c06e
|
fix bug
|
2013-08-21 19:31:12 +08:00 |
|
ZoeyYoung
|
dce353f88b
|
merge from master
|
2013-08-21 15:32:46 +08:00 |
|
ZoeyYoung
|
2857ae45cc
|
Merge branch 'master' into jieba3k
Conflicts:
Changelog
jieba/__init__.py
jieba/finalseg/__init__.py
jieba/posseg/__init__.py
setup.py
test/parallel/test_file.py
test/test_file.py
|
2013-08-21 13:55:21 +08:00 |
|
fxsjy
|
8e9b4bbe72
|
fix the compatibility with Python2.5
|
2013-07-25 10:25:24 +08:00 |
|
Sun Junyi
|
d4ede0fee6
|
hold the backward compatibility, let jython use a special loading workflow
|
2013-07-25 10:08:58 +08:00 |
|
piaolignxue
|
aea8496b1f
|
serialize model to file so that it can support jython.
|
2013-07-24 22:50:48 +08:00 |
|
Sun Junyi
|
6549deabbd
|
merge change from master
|
2013-07-16 11:06:41 +08:00 |
|
Sun Junyi
|
9d0ea771a5
|
fix bug; decimals & digit-english mixed
|
2013-07-05 16:16:49 +08:00 |
|
Sun Junyi
|
b62f052927
|
PEP8
|
2013-07-03 17:21:21 +08:00 |
|
Sun Junyi
|
45daf561c7
|
follow PEP8: change tab to 4 white spaces
|
2013-07-03 16:58:22 +08:00 |
|
Sun Junyi
|
ca97b19951
|
merge change from master
|
2013-06-23 22:28:32 +08:00 |
|
fxsjy
|
c015f4e297
|
support cxfree py2exe; keep white space
|
2013-06-22 21:24:45 +08:00 |
|
Sun Junyi
|
9d1e23ce6f
|
speed up the viterbi
|
2013-06-16 13:21:43 +08:00 |
|
fxsjy
|
d3531f197d
|
rollback, seems no abvious speed up by the previous change
|
2013-06-07 15:51:13 +08:00 |
|
fxsjy
|
f2d6abf063
|
speed up of viterbi
|
2013-06-07 14:41:55 +08:00 |
|
Sun Junyi
|
c77823aa1d
|
merge improvement to Py3k branch
|
2013-04-12 14:58:25 +08:00 |
|
Sun Junyi
|
a383f035ba
|
support decimal point: example PI=3.141569 = > PI / = / 3.14159
|
2013-04-08 09:38:49 +08:00 |
|
Sun Junyi
|
8e49199993
|
keep punctuation marks
|
2013-04-05 21:48:36 +08:00 |
|
Sun Junyi
|
0f4f9067c3
|
fix bugs in jieba for py3k
|
2013-03-21 11:10:57 +08:00 |
|
Sun Junyi
|
fd20cbbd4b
|
use logarithmic addition instead of multiplication, to avoid bad case in issue19
|
2012-12-28 11:29:51 +08:00 |
|
Sun Junyi
|
9c07d80edb
|
first py3k version of jieba
|
2012-11-28 10:50:40 +08:00 |
|
fxsjy
|
cd94e69241
|
fix a bug
|
2012-10-08 21:27:01 +08:00 |
|
fxsjy
|
c8b1cb0c88
|
remove a bug prone role
|
2012-10-08 20:52:35 +08:00 |
|
fxsjy
|
51765aa6dd
|
first commit
|
2012-10-01 15:25:06 +08:00 |
|