aholic
|
e2c796088f
|
better indent
|
2014-01-24 00:43:48 +08:00 |
|
fxsjy
|
5e6a2c4661
|
fix a bug of add_word
|
2013-12-05 13:35:40 +08:00 |
|
fxsjy
|
136676381a
|
fix a bug of add_word
|
2013-12-05 13:33:24 +08:00 |
|
Herman Schaaf
|
95286b8887
|
Fix typo in error message
|
2013-10-21 22:21:09 +09:00 |
|
fxsjy
|
14a0ab0466
|
fix a bug in issue #111
|
2013-10-11 13:05:59 +08:00 |
|
fxsjy
|
759e1029c8
|
add an API to control log level: jieba.setLogLevel
|
2013-09-22 10:26:33 +08:00 |
|
Mozillazg
|
1cf3f0d00b
|
use logging instead of print
|
2013-09-19 10:31:44 +08:00 |
|
gan
|
31d5845535
|
add better support for english. like input: 'this is interesting and interested me'-->output:'this interest interest',which 'interest' match 'interesting interested'
|
2013-09-09 11:54:30 +08:00 |
|
Sun Junyi
|
7e7fcc1184
|
add an option to disable HMM
|
2013-09-05 17:09:27 +08:00 |
|
fxsjy
|
21f7da0ca4
|
conver tab to spaces
|
2013-08-30 18:31:25 +08:00 |
|
fxsjy
|
c5bd9773d1
|
fix bug in issue #103
|
2013-08-30 18:26:53 +08:00 |
|
ZoeyYoung
|
25839b5127
|
fix bug
|
2013-08-21 19:46:14 +08:00 |
|
ZoeyYoung
|
d49542c06e
|
fix bug
|
2013-08-21 19:31:12 +08:00 |
|
ZoeyYoung
|
dce353f88b
|
merge from master
|
2013-08-21 15:32:46 +08:00 |
|
ZoeyYoung
|
2857ae45cc
|
Merge branch 'master' into jieba3k
Conflicts:
Changelog
jieba/__init__.py
jieba/finalseg/__init__.py
jieba/posseg/__init__.py
setup.py
test/parallel/test_file.py
test/test_file.py
|
2013-08-21 13:55:21 +08:00 |
|
gwdwyy
|
cc81135429
|
sed -i 's/not \(.*\) in/\1 not in/g' ...
|
2013-08-20 20:08:03 +08:00 |
|
Sun Junyi
|
90ab511deb
|
fix the bug about issue: #92
|
2013-08-09 13:59:02 +08:00 |
|
fxsjy
|
b77645b3aa
|
modify test_file.py; use less memory
|
2013-07-29 10:17:39 +08:00 |
|
fxsjy
|
ed1fa64e27
|
fix a bug. use sys.version_info.major can't be used in Python2.5
|
2013-07-29 10:07:55 +08:00 |
|
Sun Junyi
|
0f972df0ac
|
raise exception in case of lower version
|
2013-07-29 10:01:47 +08:00 |
|
Sun Junyi
|
e68bb5a28e
|
fix a compatibility problem;python2.5 has no 'multiprocessing';
|
2013-07-29 09:57:09 +08:00 |
|
Sun Junyi
|
689e27280a
|
Merge branch 'master' of https://github.com/fxsjy/jieba
|
2013-07-29 09:49:10 +08:00 |
|
Sun Junyi
|
9d87e798fd
|
0.31 release
|
2013-07-29 09:48:53 +08:00 |
|
Linker Lin
|
1dbc525dff
|
自动检测CPU数目,启动合适数目的进程。
|
2013-07-28 00:10:27 +08:00 |
|
fxsjy
|
8e9b4bbe72
|
fix the compatibility with Python2.5
|
2013-07-25 10:25:24 +08:00 |
|
Sun Junyi
|
d4ede0fee6
|
hold the backward compatibility, let jython use a special loading workflow
|
2013-07-25 10:08:58 +08:00 |
|
piaolignxue
|
aea8496b1f
|
serialize model to file so that it can support jython.
|
2013-07-24 22:50:48 +08:00 |
|
Sun Junyi
|
6549deabbd
|
merge change from master
|
2013-07-16 11:06:41 +08:00 |
|
Sun Junyi
|
d691d91674
|
fix a bug about ImportError
|
2013-07-15 09:32:52 +08:00 |
|
Sun Junyi
|
d63140fe5e
|
make a serial white spaces seperated
|
2013-07-10 17:27:47 +08:00 |
|
Richard Wong
|
c2ded83ead
|
Refactor: fix line indent to 4.
* jieba/__init__.py (cut):
|
2013-07-10 16:22:49 +08:00 |
|
Richard Wong
|
99d2492d67
|
Add re.U flag to re variable.
|
2013-07-10 16:22:17 +08:00 |
|
Richard Wong
|
fbfaac2eaa
|
Reindent function
* jieba/__init__.py (require_initialized):
|
2013-07-08 13:54:36 +08:00 |
|
Richard Wong
|
7bfd432fc5
|
Remove the unused imports.
|
2013-07-08 13:51:39 +08:00 |
|
Cheng wei
|
27cf9cfd62
|
fix syntax invalid
* python3.2 not support unicode literal
* unicode regex as normal
|
2013-07-06 02:51:13 +08:00 |
|
Sun Junyi
|
9d0ea771a5
|
fix bug; decimals & digit-english mixed
|
2013-07-05 16:16:49 +08:00 |
|
Sun Junyi
|
b9b1f1a418
|
fix conflict of merging
|
2013-07-03 17:47:45 +08:00 |
|
Sun Junyi
|
c01680c6a8
|
merge the new file
|
2013-07-03 17:29:33 +08:00 |
|
Sun Junyi
|
b62f052927
|
PEP8
|
2013-07-03 17:21:21 +08:00 |
|
Sun Junyi
|
9ea14a8a54
|
merge chage from chao78787
|
2013-07-03 17:07:16 +08:00 |
|
Sun Junyi
|
45daf561c7
|
follow PEP8: change tab to 4 white spaces
|
2013-07-03 16:58:22 +08:00 |
|
Richard Wong
|
3246236133
|
Separate cal and IO process.
|
2013-07-03 15:03:45 +08:00 |
|
Sun Junyi
|
efc784312c
|
add ChineseAnalyzer for whoosh search engine
|
2013-07-01 10:53:39 +08:00 |
|
Sun Junyi
|
f08690a2df
|
add 'search mode' for jieba.tokenize
|
2013-06-28 12:04:16 +08:00 |
|
Sun Junyi
|
237dc6625e
|
add mix words to extra_dict/dict.txt.big
|
2013-06-26 09:36:41 +08:00 |
|
Sun Junyi
|
11a3b10755
|
new method: jieba.tokenize
|
2013-06-24 16:14:11 +08:00 |
|
Sun Junyi
|
1a3be67691
|
make cache dumping more robust
|
2013-06-24 13:48:16 +08:00 |
|
Sun Junyi
|
ca97b19951
|
merge change from master
|
2013-06-23 22:28:32 +08:00 |
|
Sun Junyi
|
38b6bcd54e
|
remove some words
|
2013-06-23 21:52:22 +08:00 |
|
fxsjy
|
e1afafe353
|
fix a bug of cxfree support
|
2013-06-23 12:50:28 +08:00 |
|