311 Commits

Author SHA1 Message Date
wyy
a63fe809b1 rm unused file 2014-11-30 00:34:17 +08:00
Qin Wenfeng
2b522b20ff 使用 uint8_t 通过 UBSAN 测试 2014-11-29 19:41:12 +08:00
Qin Wenfeng
61f2031e4b 删除 MPSegment.hpp 中的重复头文件 2014-11-29 19:36:55 +08:00
wyy
e9cbec02c2 增加两条词性标注的规则,针对连续英文和数字。 2014-11-29 12:45:11 +08:00
wyy
7868f7cdff 去除一些 template 代码 2014-11-13 01:16:38 +08:00
wyy
c119dc0a93 use localvector in dag 2014-11-12 21:18:30 +08:00
wyy
99c3405e13 move flag 2014-11-12 20:03:32 +08:00
wyy
75367a20c9 little modification 2014-11-12 19:45:20 +08:00
wyy
3ced451212 use automation 2014-11-12 18:55:17 +08:00
wyy
b9736ee132 update trie and dag , make cut faster . see details in changelog.md 2014-11-05 15:31:09 +08:00
wyy
fbae0f6075 增加两条分词规则 2014-11-03 10:54:53 +08:00
wyy
ef5766904a 修改自定义词性的格式为: word tag 2014-09-28 13:43:30 +08:00
wyy
6a8ebae344 支持自定义词性 2014-09-28 13:22:37 +08:00
wyy
28246fba5d 去除 PosTagger 构造函数里一些暂时无用的参数,和增加 PosTagger 的单元测试。 2014-09-28 11:59:30 +08:00
wyy
da1b9e0c1c update limonp 2014-09-18 00:05:43 +08:00
wyy
fbbcfbdec7 update limonp and husky for threadpool using 2014-09-15 17:52:33 +08:00
wyy
40eb40288d compatiable with -std=c++0x 2014-08-15 22:09:21 +08:00
wyy
9571a4d0d5 remove InitOnOff to make code lighter 2014-08-12 00:34:37 +08:00
wyy
5bfd3d0c49 update fullsegment for reducing memory cost 2014-08-11 23:34:29 +08:00
wyy
8df0a1c89e fix max probability segmentor's bug : result is imcomplete while speical symbol in sentence 2014-07-08 23:38:06 -07:00
wyy
007649494d avoid warning in cmake about Loggger.hpp 2014-07-05 19:18:39 +08:00
wyy
4a8f63fcd2 make segments NonCopyable 2014-06-11 16:18:09 +08:00
wyy
12d3741562 avoid warning in g++ 2014-06-05 19:29:57 +08:00
wyy
a8f83dd6f0 update localvector 2014-06-05 18:30:08 +08:00
wyy
189b2725a0 add localvector 2014-06-05 01:00:17 +08:00
wyy
76dd93051e add localvector 2014-06-05 00:48:49 +08:00
wyy
c46980c17c minor change 2014-05-30 00:21:11 +08:00
wyy
e96885c38e update limonp/codeconverter.hpp 2014-05-29 23:57:32 +08:00
wyy
059f05c25d update limonp : add CodeConverter and delete some unused files 2014-05-29 22:39:22 +08:00
wyy
fb608627c9 update limonp 2014-05-26 17:15:52 +08:00
wyy
75581495b4 use vector's reserve 2014-05-24 16:09:00 +08:00
wyy
bc6ed2368d use vector's reserve 2014-05-24 15:37:31 +08:00
wyy
1a314d4b4c use vector's reserve 2014-05-24 13:44:55 +08:00
wyy
5b654f66db make single one chinese word in userdict will not be ignored in mixsegment.hpp 2014-05-17 16:22:54 +08:00
wyy
f8487fd9cf remove src/segment and mv server.cpp into server/server.cpp and modify readme.md 2014-04-25 21:48:29 +08:00
wyy
3e0aaf73a5 adding user dict interface and test ok 2014-04-25 19:30:26 +08:00
wyy
2937985243 adding user dict interface 2014-04-25 18:47:22 +08:00
wyy
dc96bb3795 add userdict loader 2014-04-25 17:29:42 +08:00
wyy
ae3e0a1b6a make keywordextractor faster 2014-04-20 00:20:25 +08:00
wyy
e225c8c722 and modify some test case 2014-04-19 12:35:19 +08:00
wyy
a585471e76 rewrite cut for chinese special symbol 2014-04-19 11:25:13 +08:00
wyy
d61d694ac7 do some rename 2014-04-16 19:12:24 +08:00
wyy
76d640b26e use filterSpecialChars in segmentbase.hpp 2014-04-14 22:21:09 +08:00
wyy
bb6c3f9e78 add shrink for vector in DictTrie.hpp 2014-04-11 15:25:03 +08:00
wyy
0af9ae3de3 rewrite server script for cjserver 2014-04-11 15:08:24 +08:00
wyy
cae1503725 split Trie.hpp into (Trie.hpp & DictTrie.hpp) test ok 2014-04-11 12:08:46 +08:00
wyy
24120c92b1 compile ok 2014-04-10 09:16:35 -07:00
wyy
776191b375 ci 2014-04-10 22:32:39 +08:00
wyy
abd23a4d79 rename Trie -> DictTrie 2014-04-10 21:07:11 +08:00
wyy
f70b654b66 split Trie.hpp into (Trie.hpp & DictTrie.hpp) 2014-04-10 21:05:01 +08:00