25 Commits

Author SHA1 Message Date
yanyiwu
0984c9ed3f update user dict loading method about word weight, and add unit tests 2016-07-22 23:53:49 +08:00
yanyiwu
c3fd357a6d [QuerySegment] add SetMaxWordLen,GetMaxWordLen, and filter the english sentence in secondary Cut 2015-10-29 14:23:01 +08:00
yanyiwu
5bf7454ad2 add multi user dict unittest 2015-09-25 16:07:01 +08:00
yanyiwu
3528b6296a 修改 cjserver 服务,可以通过http参数使用不同切词算法进行切词。
修改 make install 的安装目录,统一安装到同一个目录 /usr/local/cppjieba
2015-06-05 21:59:16 +08:00
yanyiwu
e5d1ac7bc8 把dict/{extra_dict,gbk_dict} 挪进 test/testdata 2015-06-05 16:31:43 +08:00
yanyiwu
269bc0fd0d make QuerySegment support user.dict.utf8 2015-01-23 01:10:12 +08:00
wyy
e9cbec02c2 增加两条词性标注的规则,针对连续英文和数字。 2014-11-29 12:45:11 +08:00
wyy
107638f7d8 修改测试数据等 2014-11-03 11:19:00 +08:00
wyy
b68a76e63a 完善一些测试 2014-10-26 12:21:10 +08:00
wyy
fbbcfbdec7 update limonp and husky for threadpool using 2014-09-15 17:52:33 +08:00
wyy
8df0a1c89e fix max probability segmentor's bug : result is imcomplete while speical symbol in sentence 2014-07-08 23:38:06 -07:00
wyy
bc6ed2368d use vector's reserve 2014-05-24 15:37:31 +08:00
wyy
f0a0731b74 add server.conf into testdata for testing 2014-05-17 21:20:09 -05:00
wyy
5b654f66db make single one chinese word in userdict will not be ignored in mixsegment.hpp 2014-05-17 16:22:54 +08:00
wyy
57ef504d9b modify test/segment_demo.cpp 2014-04-25 22:09:55 +08:00
wyy
566187a49c add userdict.utf8 2014-04-25 19:22:32 +08:00
wyy
884aa89009 add test case 2014-04-19 13:01:31 +08:00
wyy
e225c8c722 and modify some test case 2014-04-19 12:35:19 +08:00
wyy
76d640b26e use filterSpecialChars in segmentbase.hpp 2014-04-14 22:21:09 +08:00
wyy
a4b0a6c762 rm TrieManager.hpp 2014-03-15 22:48:29 +08:00
wyy
df9f47eb47 add ut case 2014-03-09 04:49:47 -07:00
wyy
1db13168ff add servertest 2013-12-24 02:58:34 -08:00
aholic
4f21617180 add test data for TrieManager 2013-12-17 01:05:46 +08:00
wyy
3545eef281 modify test 2013-12-14 22:18:39 -08:00
wyy
7744e7c36c add weicheng.utf8 2013-12-14 19:37:17 -08:00