336 Commits

Author SHA1 Message Date
fxsjy
c8df565981 more log trace for trouble shooting 2013-04-26 17:43:24 +08:00
fxsjy
04eb4f08cf fix a bug of changing dictionary 2013-04-26 16:48:46 +08:00
fxsjy
8666428fb0 fix a bug of changing dictionary 2013-04-26 16:47:00 +08:00
fxsjy
9bebe6120b utf-8 output is more friendly to Linux 2013-04-26 16:19:00 +08:00
Sun Junyi
d3339633d5 in the speed test: initialize first to ignore the time of dict loading 2013-04-26 14:51:58 +08:00
fxsjy
bc049090a5 make lazy load thread safe 2013-04-26 12:54:05 +08:00
fxsjy
d2460029d5 merge lazy load 2013-04-26 09:57:06 +08:00
Herman Schaaf
7342a18534 Update readme in both languages with new functions 2013-04-25 21:46:15 +09:00
Herman Schaaf
c6098a8657 Add initialize function and lazy initialization 2013-04-25 21:04:56 +09:00
fxsjy
47d94a13e6 log(1)==0, since we have changed from PRODUCT to sum of LOG 2013-04-25 10:11:04 +08:00
fxsjy
c350fab2b9 fix wrong line number 2013-04-25 09:28:00 +08:00
fxsjy
65b78b2b4d read() and then split -- faster; from __future__ import with 2013-04-24 22:14:10 +08:00
Sun Junyi
966532b462 Merge pull request #39 from neuront/master
auto close file; locate error when failing to parse
2013-04-24 07:00:50 -07:00
Neuron Teckid
166c2ca7a5 auto close file; locate error when failing to parse 2013-04-24 19:01:08 +08:00
Sun Junyi
5f8435ce58 Update README.md 2013-04-22 15:57:36 +08:00
Sun Junyi
7337c6d420 Merge branch 'master' of https://github.com/fxsjy/jieba 2013-04-22 13:27:00 +08:00
Sun Junyi
ceae5c56d8 add changelog 2013-04-22 13:26:40 +08:00
Sun Junyi
604e6910e2 Update README.md 2013-04-22 13:08:23 +08:00
Sun Junyi
9af4d0a9d9 Update README.md 2013-04-22 12:49:54 +08:00
Sun Junyi
b06d6de174 Update README.md 2013-04-22 12:49:22 +08:00
Sun Junyi
f2fa585f3a Update README.md 2013-04-22 12:48:49 +08:00
Sun Junyi
825da757d0 Update README.md 2013-04-22 12:47:31 +08:00
Sun Junyi
1bb497ac09 version change v0.27 2013-04-22 12:37:02 +08:00
fxsjy
3f003e2f29 new method: jieba.disable_parallel, which is the inverse operation of jieba.enable_parallel 2013-04-22 12:35:17 +08:00
fxsjy
b46166f768 use CRLF as seperator to make chunks in parallel mode 2013-04-20 18:46:04 +08:00
fxsjy
6b83593b5a rm stub.log 2013-04-20 14:13:10 +08:00
fxsjy
62cf22121f new feature: parallel segment with multiprocessing 2013-04-20 14:11:31 +08:00
Sun Junyi
8d89e8afda handle 的 2013-04-19 10:02:33 +08:00
Sun Junyi
012fddf13f ignore white space 2013-04-12 22:37:53 +08:00
fxsjy
45591bb9ab support flag '_'; ignore white space 2013-04-12 21:53:03 +08:00
Sun Junyi
afdcb8a77d Update README.md 2013-04-08 09:56:41 +08:00
Sun Junyi
94ad7e7035 support decimal point 2013-04-08 09:53:04 +08:00
Sun Junyi
72fff6c8e2 support decimal point 2013-04-08 09:40:32 +08:00
Sun Junyi
a383f035ba support decimal point: example PI=3.141569 = > PI / = / 3.14159 2013-04-08 09:38:49 +08:00
Sun Junyi
7ce3433316 fix bug: python2.6 does not support CRLF in eval(astring) 2013-04-07 22:55:06 +08:00
fxsjy
600a7fc285 CRLF to LF 2013-04-07 22:30:18 +08:00
fxsjy
ddeb766202 CRLF to LF 2013-04-07 22:29:39 +08:00
fxsjy
6632bb80ec CRLF to LF 2013-04-07 22:27:58 +08:00
fxsjy
f1d5d90ae6 CRLF to LF 2013-04-07 22:27:17 +08:00
Sun Junyi
fcb3747814 Update README.md 2013-04-07 11:03:54 +08:00
Sun Junyi
9fd2b38293 Update README.md 2013-04-07 11:02:49 +08:00
Sun Junyi
4a9193de4f Update README.md 2013-04-07 11:00:30 +08:00
Sun Junyi
a600868363 version change v0.26 2013-04-07 09:36:04 +08:00
Sun Junyi
659326c4e1 punctuation; improve keywords extraction 2013-04-06 14:02:11 +08:00
Sun Junyi
7d227da5c4 punctuation 2013-04-05 22:49:16 +08:00
Sun Junyi
8e49199993 keep punctuation marks 2013-04-05 21:48:36 +08:00
Sun Junyi
58c363655c support user defined word tag 2013-03-25 17:28:37 +08:00
Sun Junyi
44e19a2e27 fix bug in pypy 2013-03-22 15:20:19 +08:00
Sun Junyi
6cc0e95759 rm 1.log 2013-03-22 15:19:57 +08:00
Sun Junyi
d2634a049b fix a bug in pypy 2013-03-22 15:16:47 +08:00