75 Commits

Author SHA1 Message Date
yanyiwu
a3d9b40c2a 修改QuerySegment的构造函数参数顺序 2015-06-05 16:23:51 +08:00
yanyiwu
45588b75cc 增加 Application 这个类,整合了所有CppJieba的功能进去,以后用户只需要使用这个类即可。 2015-06-05 16:00:32 +08:00
yanyiwu
c04b2dd0d4 增加更详细的错误日志,在初始化过程中合理使用LogFatal。 2015-05-07 20:03:19 +08:00
yanyiwu
bb32234654 astyle --style=google --indent=spaces=2 2015-05-06 17:53:20 +08:00
qinwf
c0bdef74fb 添加英文+数字分词规则 qinwf/jiebaR#7 2015-02-06 10:19:43 +08:00
yanyiwu
2488738b55 update unittest 2015-01-24 15:51:24 +08:00
yanyiwu
4e72d4a06f KeywordExtractor 支持自定义词典(可选参数)。 2015-01-24 15:34:34 +08:00
yanyiwu
269bc0fd0d make QuerySegment support user.dict.utf8 2015-01-23 01:10:12 +08:00
wyy
e9cbec02c2 增加两条词性标注的规则,针对连续英文和数字。 2014-11-29 12:45:11 +08:00
wyy
c119dc0a93 use localvector in dag 2014-11-12 21:18:30 +08:00
wyy
3ced451212 use automation 2014-11-12 18:55:17 +08:00
wyy
b9736ee132 update trie and dag , make cut faster . see details in changelog.md 2014-11-05 15:31:09 +08:00
wyy
471a68e08e 增加测试 2014-11-03 11:30:45 +08:00
wyy
107638f7d8 修改测试数据等 2014-11-03 11:19:00 +08:00
wyy
fbae0f6075 增加两条分词规则 2014-11-03 10:54:53 +08:00
wyy
6a8ebae344 支持自定义词性 2014-09-28 13:22:37 +08:00
wyy
28246fba5d 去除 PosTagger 构造函数里一些暂时无用的参数,和增加 PosTagger 的单元测试。 2014-09-28 11:59:30 +08:00
wyy
4d686edb7f update unittest for compiling ok in mac 2014-08-15 22:30:52 +08:00
wyy
9571a4d0d5 remove InitOnOff to make code lighter 2014-08-12 00:34:37 +08:00
wyy
8df0a1c89e fix max probability segmentor's bug : result is imcomplete while speical symbol in sentence 2014-07-08 23:38:06 -07:00
wyy
5b0ac64bc2 add unittest 2014-07-08 23:07:27 -07:00
wyy
fb608627c9 update limonp 2014-05-26 17:15:52 +08:00
wyy
3e0aaf73a5 adding user dict interface and test ok 2014-04-25 19:30:26 +08:00
wyy
2937985243 adding user dict interface 2014-04-25 18:47:22 +08:00
wyy
dc96bb3795 add userdict loader 2014-04-25 17:29:42 +08:00
wyy
884aa89009 add test case 2014-04-19 13:01:31 +08:00
wyy
e225c8c722 and modify some test case 2014-04-19 12:35:19 +08:00
wyy
cae1503725 split Trie.hpp into (Trie.hpp & DictTrie.hpp) test ok 2014-04-11 12:08:46 +08:00
wyy
24120c92b1 compile ok 2014-04-10 09:16:35 -07:00
wyy
c04ab76afb fix potential bug in Trie.hpp 2014-04-10 02:58:04 -07:00
wyy
a3e0db22e8 change trie.find args 2014-04-08 19:59:02 +08:00
wyy
bfbd63f3e8 remove trie.find(xx,xx, vector) 2014-04-08 19:51:49 +08:00
wyy
8382828e48 little modification 2014-03-30 23:19:10 +08:00
wyy
abe4be255f modify some stuff to adapter lower version cmake & g++ 2014-03-27 01:41:05 -07:00
wyy
d2d6868b75 merge some testfile into one testfile to reduce compiler cost 2014-03-21 11:18:34 +08:00
wyy
fe7e3ff807 prettify Trie.hpp ing 2014-03-16 20:20:37 +08:00
wyy
6de292a56d add stopword in KeywordExtractor 2014-03-15 23:31:59 +08:00
wyy
a4b0a6c762 rm TrieManager.hpp 2014-03-15 22:48:29 +08:00
wyy
ddaa5589f1 rm TrieManager.hpp 2014-03-15 22:02:48 +08:00
wyy
90d2280002 use map as DagType to fix a unordered bug in different environment , by the way, it improves 1/6 speed 2014-03-11 10:28:10 +08:00
wyy
df9f47eb47 add ut case 2014-03-09 04:49:47 -07:00
wyy
a7f4e18027 ci TKeywordExtractor.cpp to fix bug which test in x64 and x86 not the same 2014-03-07 18:35:14 +08:00
wyy
664ded109a modify cmakelist 2014-02-27 12:13:08 +08:00
wyy
eff8d45267 fix bug: cmp function pair<string, uint> -> pair<string, double> 2014-02-10 11:16:24 +08:00
wyy
5cf310f445 modify test for keywordextractor 2014-02-10 00:38:38 +08:00
wyy
5f96dcf09a add filter singword in keywordextractor. 2014-02-07 17:51:08 +08:00
wyy
f64c11c57e add blacklist 2014-01-31 17:37:40 +08:00
wyy
259b296b71 int -> uint for avoid warning 2014-01-29 20:20:24 +08:00
aholic
680399efdc merge upstream 2014-01-12 18:12:22 +08:00
Richard Lee
af7fedd3ef Fix OS X 10.9 compiling issues 2014-01-17 19:11:39 +08:00