59 Commits

Author SHA1 Message Date
yanyiwu
b28d6db574 code style 2015-10-08 17:08:57 +08:00
yanyiwu
14974d51b4 abondom ISegment 2015-09-13 17:02:04 +08:00
yanyiwu
001a69d8c6 增加MPSegment的细粒度分词功能。 2015-08-30 01:04:30 +08:00
yanyiwu
fae951a95d 统一私有函数的命名风格 2015-08-28 11:17:38 +08:00
yanyiwu
41e4300c9a LevelSegment 2015-08-11 00:53:06 +08:00
yanyiwu
efd029c20b namespace husky; namespace limonp; 2015-08-08 12:30:14 +08:00
yanyiwu
8a3ced2b27 去掉一些没必要的返回值判断,精简代码 2015-07-24 14:39:03 +08:00
yanyiwu
0f79fa6c24 统一在SegmentBase搞定所有Unicode和string的转码事情 2015-07-24 13:42:24 +08:00
yanyiwu
78e41e5fd0 规范Unicode的相关命名,使用Rune代表一个中文字符 2015-07-21 14:54:50 +08:00
yanyiwu
0e16e000ea 解决一些历史遗留问题 2015-07-21 14:32:05 +08:00
yanyiwu
620d276887 底层常用结构修整 2015-07-21 12:11:43 +08:00
yanyiwu
d56bf2cc68 重构:增加让各个分词类的构造函数,为后面的憋大招做准备。 2015-06-04 22:38:55 +08:00
yanyiwu
aed1c8f4a6 删除一些无必要的错误检查 2015-05-21 16:04:41 +08:00
yanyiwu
6e3bb7d057 use reverse_iterator 2015-05-18 23:57:13 +08:00
yanyiwu
c04b2dd0d4 增加更详细的错误日志,在初始化过程中合理使用LogFatal。 2015-05-07 20:03:19 +08:00
yanyiwu
2b18a582fc code style 2015-05-06 23:02:03 +08:00
yanyiwu
bb32234654 astyle --style=google --indent=spaces=2 2015-05-06 17:53:20 +08:00
yanyiwu
10e9b32258 little adjustment 2015-01-31 12:58:49 +08:00
Qin Wenfeng
61f2031e4b 删除 MPSegment.hpp 中的重复头文件 2014-11-29 19:36:55 +08:00
wyy
75367a20c9 little modification 2014-11-12 19:45:20 +08:00
wyy
3ced451212 use automation 2014-11-12 18:55:17 +08:00
wyy
b9736ee132 update trie and dag , make cut faster . see details in changelog.md 2014-11-05 15:31:09 +08:00
wyy
6a8ebae344 支持自定义词性 2014-09-28 13:22:37 +08:00
wyy
40eb40288d compatiable with -std=c++0x 2014-08-15 22:09:21 +08:00
wyy
9571a4d0d5 remove InitOnOff to make code lighter 2014-08-12 00:34:37 +08:00
wyy
5bfd3d0c49 update fullsegment for reducing memory cost 2014-08-11 23:34:29 +08:00
wyy
8df0a1c89e fix max probability segmentor's bug : result is imcomplete while speical symbol in sentence 2014-07-08 23:38:06 -07:00
wyy
fb608627c9 update limonp 2014-05-26 17:15:52 +08:00
wyy
bc6ed2368d use vector's reserve 2014-05-24 15:37:31 +08:00
wyy
1a314d4b4c use vector's reserve 2014-05-24 13:44:55 +08:00
wyy
5b654f66db make single one chinese word in userdict will not be ignored in mixsegment.hpp 2014-05-17 16:22:54 +08:00
wyy
2937985243 adding user dict interface 2014-04-25 18:47:22 +08:00
wyy
dc96bb3795 add userdict loader 2014-04-25 17:29:42 +08:00
wyy
d61d694ac7 do some rename 2014-04-16 19:12:24 +08:00
wyy
24120c92b1 compile ok 2014-04-10 09:16:35 -07:00
wyy
abd23a4d79 rename Trie -> DictTrie 2014-04-10 21:07:11 +08:00
wyy
61f542a6b1 little modify MPSegment 2014-04-08 09:05:09 -07:00
wyy
45a7cac784 change MPSegment's cut(..., vector<TrieNodeInfo>) -> cut(..., vector<Unicode>) 2014-04-08 08:43:32 -07:00
wyy
a3e0db22e8 change trie.find args 2014-04-08 19:59:02 +08:00
wyy
f254691e53 ci MPSegment.hpp 2014-04-07 23:05:09 -07:00
wyy
fe7e3ff807 prettify Trie.hpp ing 2014-03-16 20:20:37 +08:00
wyy
ddaa5589f1 rm TrieManager.hpp 2014-03-15 22:02:48 +08:00
wyy
3eb0470c2f update husky and limonp 2013-12-23 23:58:54 -08:00
wyy
679179859e add some log debug & info 2013-12-21 21:47:01 -08:00
wyy
fa75f0f319 modify construction and init for segments 2013-12-21 09:37:12 -08:00
wyy
335a7eff47 add THMMSegment.cpp and TMPSegment.cpp for fix little error in using hmmsegment and mpsegment. 2013-12-18 22:42:46 -08:00
aholic
7add684a8a change algorithm for QuerySegment(now is mix+full) | use TrieManager to get a trie for all Segment 2013-12-16 14:18:44 +08:00
wyy
1e29d25855 use assert for getinitflag 2013-12-11 04:52:33 -08:00
wyy
acb4150e3c remove some unused code 2013-12-08 03:29:28 -08:00
wyy
313e05da1b ci for lunch 2013-12-07 20:25:28 -08:00