yanyiwu
|
53bc279dea
|
fix compiler warning
|
2016-07-23 20:49:27 +08:00 |
|
yanyiwu
|
0984c9ed3f
|
update user dict loading method about word weight, and add unit tests
|
2016-07-22 23:53:49 +08:00 |
|
npes87184
|
0c3cf04b43
|
fix second element parse error in dict
|
2016-07-22 10:19:28 +08:00 |
|
bigelephant29
|
986106a553
|
change stoi to atoi
|
2016-07-21 10:54:08 +08:00 |
|
bigelephant29
|
2e1b6e0443
|
user dict support user weight and user tag
|
2016-07-21 10:38:46 +08:00 |
|
bigelephant29
|
b82acaf71e
|
fix user dict tag bug : wrong buf index assigned
|
2016-07-21 10:06:24 +08:00 |
|
t-k-
|
5775a40bee
|
Add LookupTag function for single token tag lookup.
|
2016-07-06 02:44:56 -06:00 |
|
Jaimin Pan
|
ce8cafe54a
|
add tag capbility for each segments
|
2016-06-27 18:10:42 +08:00 |
|
yanyiwu
|
c425bcc49f
|
add Jieba::ResetSeparators api and unittest
|
2016-05-09 22:49:51 +08:00 |
|
yanyiwu
|
6e3ecec599
|
improve readability
|
2016-05-09 22:09:57 +08:00 |
|
yanyiwu
|
0a23d6b268
|
merge questionfish/master
|
2016-05-04 19:27:05 +08:00 |
|
mayunyun
|
d5a52a8e7b
|
1. remove stopword from span windows
2. update unittest
|
2016-05-04 17:52:30 +08:00 |
|
yanyiwu
|
5c739484ae
|
merge the latest codes in master branch, and update unittest cases to pass ci
|
2016-05-03 23:20:03 +08:00 |
|
yanyiwu
|
f253db0133
|
use map/set instead of unordered_map/unordered_set to make result stable
|
2016-05-03 21:24:40 +08:00 |
|
Yanyi Wu
|
6d105a864d
|
Update TextRankExtractor.hpp
remove unused function which using c++11 keyword `auto`
|
2016-05-03 19:53:40 +08:00 |
|
mayunyun
|
0f66a923b3
|
1.增加单元测试
2.增加了构造函数的重载,增加了提取函数的重载
|
2016-05-03 18:06:14 +08:00 |
|
mayunyun
|
f2de41c15e
|
code layout change: tab -> space
|
2016-05-03 09:03:16 +08:00 |
|
yanyiwu
|
5ac9e48eb0
|
rewrite QuerySegment, make Jieba::CutForSearch behaves the same as [jieba] cut_for_search api
remove Jieba::SetQuerySegmentThreshold
|
2016-05-02 16:18:36 +08:00 |
|
mayunyun
|
1aa0a32d90
|
code format check
|
2016-04-25 20:28:47 +08:00 |
|
mayunyun
|
669e971e3e
|
new file: include/cppjieba/TextRankExtractor.hpp
Add TextRank Keyword Extractor to JiebaCpp
新增TextRank关键词提取
|
2016-04-25 20:20:50 +08:00 |
|
yanyiwu
|
3befc42697
|
update KeywordExtractor::Word's printing format to json format
|
2016-04-19 16:00:53 +08:00 |
|
yanyiwu
|
29e085904d
|
add log and unittest
|
2016-04-18 14:55:42 +08:00 |
|
yanyiwu
|
63e9c94fb7
|
add unicode decoding unittest
|
2016-04-18 14:37:17 +08:00 |
|
yanyiwu
|
6fa843b527
|
override Cut functions, add location information into Word results;
|
2016-04-17 23:39:57 +08:00 |
|
yanyiwu
|
b6703aba90
|
use offset instead of str in RuneStr
|
2016-04-17 22:50:32 +08:00 |
|
yanyiwu
|
e7a45d2dde
|
remove LevelSegment
|
2016-04-17 22:23:00 +08:00 |
|
yanyiwu
|
42a73eeb64
|
make compiler happy
|
2016-04-17 22:11:58 +08:00 |
|
yanyiwu
|
dcced8561e
|
remove namespace unicode
|
2016-04-17 21:59:10 +08:00 |
|
yanyiwu
|
6ff6fe1430
|
WordRange construct
|
2016-04-17 21:57:36 +08:00 |
|
yanyiwu
|
339e3ca772
|
big change: add RuneStr for the position of word in string
|
2016-04-17 17:30:05 +08:00 |
|
yanyiwu
|
c19736995c
|
Add KeywordExtractor::Word and add more overrided KeywordExtractor::Extract
|
2016-03-26 22:12:40 +08:00 |
|
yanyiwu
|
e6a2b47b87
|
hange the return value of KeywordExtractor::Extract from bool to void
|
2016-03-26 01:16:44 +08:00 |
|
yanyiwu
|
5102b8a5c3
|
Change Jieba::Locate to be static function.
|
2016-03-26 01:14:48 +08:00 |
|
yanyiwu
|
7db3f87b5f
|
remove info log for dict loading
|
2016-03-22 10:45:20 +08:00 |
|
yanyiwu
|
3e28b4bcb1
|
adjust code for limonp v0.5.5 to solve macro name conflicts
|
2016-02-19 16:15:23 +08:00 |
|
yanyiwu
|
14e09290c2
|
change Rune type from uint16_t to uint32_t to support more chinese word
|
2016-02-18 14:54:03 +08:00 |
|
yanyiwu
|
239d025cd8
|
delete HashMap, use unordered_map instead
|
2016-02-16 20:24:28 +08:00 |
|
yanyiwu
|
e6454fef77
|
use HashMap in Trie, and remove the base array of trie root node, see details in Changelog
|
2016-02-12 01:37:39 +08:00 |
|
yanyiwu
|
2d3c51dba7
|
upgrade limonp and use limonp::HashMap in Trie
|
2016-02-04 23:43:26 +08:00 |
|
yanyiwu
|
721b34f1bd
|
fix bug, see details in ChangeLog.md
|
2016-01-29 00:30:38 +08:00 |
|
yanyiwu
|
193e717d22
|
override constructor in KeywordExtractor
|
2016-01-13 00:40:46 +08:00 |
|
yanyiwu
|
3c5ad24260
|
source code layout change:
1. src/ -> include/cppjieba/
2. src/limonp/ -> deps/limonp/
3. server/husky -> deps/husky/
4. test/unittest/gtest -> deps/gtest
|
2016-01-11 14:25:02 +08:00 |
|