mayunyun
|
0f66a923b3
|
1.增加单元测试
2.增加了构造函数的重载,增加了提取函数的重载
|
2016-05-03 18:06:14 +08:00 |
|
yanyiwu
|
81cd435f2a
|
prettify demo output
|
2016-03-28 01:22:24 +08:00 |
|
yanyiwu
|
c19736995c
|
Add KeywordExtractor::Word and add more overrided KeywordExtractor::Extract
|
2016-03-26 22:12:40 +08:00 |
|
yanyiwu
|
e6a2b47b87
|
hange the return value of KeywordExtractor::Extract from bool to void
|
2016-03-26 01:16:44 +08:00 |
|
yanyiwu
|
0a7b6e62f3
|
add Unicode32 cases for cut testing
|
2016-02-18 15:18:35 +08:00 |
|
yanyiwu
|
e6454fef77
|
use HashMap in Trie, and remove the base array of trie root node, see details in Changelog
|
2016-02-12 01:37:39 +08:00 |
|
yanyiwu
|
34668aa379
|
remove server, see details in ChangeLog.md
|
2016-01-21 01:07:31 +08:00 |
|
yanyiwu
|
193e717d22
|
override constructor in KeywordExtractor
|
2016-01-13 00:40:46 +08:00 |
|
yanyiwu
|
b41cb0e2ee
|
fix compile error
|
2016-01-11 14:50:14 +08:00 |
|
yanyiwu
|
3c5ad24260
|
source code layout change:
1. src/ -> include/cppjieba/
2. src/limonp/ -> deps/limonp/
3. server/husky -> deps/husky/
4. test/unittest/gtest -> deps/gtest
|
2016-01-11 14:25:02 +08:00 |
|
yanyiwu
|
bcb112a4b1
|
upgrade basic functions
|
2015-12-12 21:25:57 +08:00 |
|
yanyiwu
|
8482bef442
|
change multi user dicts seperator from ':' to '|;'
|
2015-12-09 00:01:27 +08:00 |
|
yanyiwu
|
8dc01ae614
|
add Jieba::Locate function to get word location of cutted sentence
|
2015-12-02 01:19:23 +08:00 |
|
yanyiwu
|
60ca5093a9
|
add Jieba::Tag
|
2015-11-26 00:47:16 +08:00 |
|
yanyiwu
|
c3fd357a6d
|
[QuerySegment] add SetMaxWordLen,GetMaxWordLen, and filter the english sentence in secondary Cut
|
2015-10-29 14:23:01 +08:00 |
|
yanyiwu
|
83cc67cb15
|
[code style] uppercase function name
|
2015-10-29 12:39:10 +08:00 |
|
yanyiwu
|
6f51373280
|
support optional user word freq weight
|
2015-10-09 11:20:06 +08:00 |
|
yanyiwu
|
ecacf118e6
|
[code style] lower case namespace
|
2015-10-08 21:13:11 +08:00 |
|
yanyiwu
|
16b69e35c1
|
delete Application.hpp, use Jieba.hpp instead
|
2015-10-08 21:03:09 +08:00 |
|
yanyiwu
|
4d56be920b
|
support optional user word freq weight
|
2015-10-08 20:05:27 +08:00 |
|
yanyiwu
|
b28d6db574
|
code style
|
2015-10-08 17:08:57 +08:00 |
|
yanyiwu
|
5bf7454ad2
|
add multi user dict unittest
|
2015-09-25 16:07:01 +08:00 |
|
yanyiwu
|
ea4d81cde7
|
add segment cut case
|
2015-09-18 14:28:34 +08:00 |
|
yanyiwu
|
eb6f47b6b0
|
refactor unittest
|
2015-09-13 18:09:56 +08:00 |
|
yanyiwu
|
8eef9a13a8
|
fix bug about optional argument hmm
|
2015-09-13 18:06:44 +08:00 |
|
yanyiwu
|
14974d51b4
|
abondom ISegment
|
2015-09-13 17:02:04 +08:00 |
|
yanyiwu
|
e9241d9025
|
fixed the bug in the last commit
|
2015-09-13 16:18:48 +08:00 |
|
yanyiwu
|
28bcb3bf57
|
use PreFilter in SegmentBase
|
2015-09-13 16:05:17 +08:00 |
|
yanyiwu
|
0542dd1cfd
|
add PreFilter
|
2015-09-13 15:10:10 +08:00 |
|
yanyiwu
|
1babe57ebc
|
细粒度分词功能
|
2015-08-30 16:35:21 +08:00 |
|
yanyiwu
|
3c60c35906
|
修复FullSegment对于有些单字没有输出的bug
|
2015-08-30 13:09:37 +08:00 |
|
yanyiwu
|
001a69d8c6
|
增加MPSegment的细粒度分词功能。
|
2015-08-30 01:04:30 +08:00 |
|
yanyiwu
|
0e0318f6ad
|
集成LevelSegment进Application
|
2015-08-11 11:57:58 +08:00 |
|
yanyiwu
|
41e4300c9a
|
LevelSegment
|
2015-08-11 00:53:06 +08:00 |
|
yanyiwu
|
efd029c20b
|
namespace husky; namespace limonp;
|
2015-08-08 12:30:14 +08:00 |
|
yanyiwu
|
8a3ced2b27
|
去掉一些没必要的返回值判断,精简代码
|
2015-07-24 14:39:03 +08:00 |
|
yanyiwu
|
4d86abb001
|
新增findByLimit函数
|
2015-07-23 21:10:56 +08:00 |
|
yanyiwu
|
0e16e000ea
|
解决一些历史遗留问题
|
2015-07-21 14:32:05 +08:00 |
|
yanyiwu
|
620d276887
|
底层常用结构修整
|
2015-07-21 12:11:43 +08:00 |
|
yanyiwu
|
2ae6eba3a7
|
更新insertUserWord的示例程序
|
2015-06-27 13:16:25 +08:00 |
|
yanyiwu
|
d33c09d74a
|
增加单元测试
|
2015-06-27 12:34:27 +08:00 |
|
yanyiwu
|
64d073d194
|
支持insertUserWord接口
|
2015-06-27 11:39:43 +08:00 |
|
yanyiwu
|
c5f7d4d670
|
重构trie前先ci一下
|
2015-06-26 14:29:44 +08:00 |
|
yanyiwu
|
e0db070529
|
开放insertUserWord接口;增加cut的默认参数,默认切词算法为Mix
|
2015-06-26 12:22:11 +08:00 |
|
yanyiwu
|
67cc5941be
|
update demo
|
2015-06-07 11:13:33 +08:00 |
|
yanyiwu
|
3528b6296a
|
修改 cjserver 服务,可以通过http参数使用不同切词算法进行切词。
修改 make install 的安装目录,统一安装到同一个目录 /usr/local/cppjieba
|
2015-06-05 21:59:16 +08:00 |
|
yanyiwu
|
8ce2af9706
|
更新Demo示例文件,demo只使用一个Application实例即可。
|
2015-06-05 18:12:27 +08:00 |
|
yanyiwu
|
e5d1ac7bc8
|
把dict/{extra_dict,gbk_dict} 挪进 test/testdata
|
2015-06-05 16:31:43 +08:00 |
|
yanyiwu
|
a3d9b40c2a
|
修改QuerySegment的构造函数参数顺序
|
2015-06-05 16:23:51 +08:00 |
|
yanyiwu
|
45588b75cc
|
增加 Application 这个类,整合了所有CppJieba的功能进去,以后用户只需要使用这个类即可。
|
2015-06-05 16:00:32 +08:00 |
|