56 Commits

Author SHA1 Message Date
wuyanyi
03cc7c39ff feature: add RemoveWord api from https://github.com/yanyiwu/gojieba/pull/99 2022-10-16 13:17:19 +08:00
Yanyi Wu
8a258dfaf4
Merge pull request #127 from byronhe/patch-2
remove duplicate #include
2019-09-15 16:54:42 +08:00
byronhe
55a94b417c
fix typo 2019-09-04 20:50:11 +08:00
byronhe
6444f4b226
fix compile warning 2019-04-29 12:18:03 +08:00
byronhe
798b7b81c9
remove duplicate #include
remove duplicate #include
2019-03-15 15:48:09 +08:00
zhoupeng
111fb007cf exposes InsertUserWord Find 2018-06-09 16:21:13 +08:00
zhoupeng
1e1e585194 LoadUserDict by set,vector 2018-06-08 14:23:01 +08:00
zhoupeng
1066bc085e fix input type ,expose to Jieba 2018-06-08 01:32:47 +08:00
zhoupeng
d56e5c0659 InsertUserWord with freq arg,expose InserUserDictNode with vector<string> arg 2018-06-08 00:44:33 +08:00
Wangzhe
e7602afaac 减少Visual Studio编译器警告 2017-06-27 23:00:31 +08:00
Roy Guo
f74d716570 Add Unicode offset/length support for Word 2016-10-16 13:05:56 +08:00
Roy Guo
a2f75a00d3 Add Unicode offset/length support for Word 2016-10-16 12:52:50 +08:00
yanyiwu
74c70c70cd create keyword_extract in Jieba 2016-09-11 21:42:53 +08:00
yanyiwu
4a755dff6a may be more friendly for compiler 2016-08-11 00:00:20 +08:00
yanyiwu
53bc279dea fix compiler warning 2016-07-23 20:49:27 +08:00
yanyiwu
0984c9ed3f update user dict loading method about word weight, and add unit tests 2016-07-22 23:53:49 +08:00
npes87184
0c3cf04b43 fix second element parse error in dict 2016-07-22 10:19:28 +08:00
bigelephant29
986106a553 change stoi to atoi 2016-07-21 10:54:08 +08:00
bigelephant29
2e1b6e0443 user dict support user weight and user tag 2016-07-21 10:38:46 +08:00
bigelephant29
b82acaf71e fix user dict tag bug : wrong buf index assigned 2016-07-21 10:06:24 +08:00
t-k-
5775a40bee Add LookupTag function for single token tag lookup. 2016-07-06 02:44:56 -06:00
Jaimin Pan
ce8cafe54a add tag capbility for each segments 2016-06-27 18:10:42 +08:00
yanyiwu
c425bcc49f add Jieba::ResetSeparators api and unittest 2016-05-09 22:49:51 +08:00
yanyiwu
6e3ecec599 improve readability 2016-05-09 22:09:57 +08:00
yanyiwu
0a23d6b268 merge questionfish/master 2016-05-04 19:27:05 +08:00
mayunyun
d5a52a8e7b 1. remove stopword from span windows
2. update unittest
2016-05-04 17:52:30 +08:00
yanyiwu
5c739484ae merge the latest codes in master branch, and update unittest cases to pass ci 2016-05-03 23:20:03 +08:00
yanyiwu
f253db0133 use map/set instead of unordered_map/unordered_set to make result stable 2016-05-03 21:24:40 +08:00
Yanyi Wu
6d105a864d Update TextRankExtractor.hpp
remove unused function which using c++11 keyword `auto`
2016-05-03 19:53:40 +08:00
mayunyun
0f66a923b3 1.增加单元测试
2.增加了构造函数的重载,增加了提取函数的重载
2016-05-03 18:06:14 +08:00
mayunyun
f2de41c15e code layout change: tab -> space 2016-05-03 09:03:16 +08:00
yanyiwu
5ac9e48eb0 rewrite QuerySegment, make Jieba::CutForSearch behaves the same as [jieba] cut_for_search api
remove Jieba::SetQuerySegmentThreshold
2016-05-02 16:18:36 +08:00
mayunyun
1aa0a32d90 code format check 2016-04-25 20:28:47 +08:00
mayunyun
669e971e3e new file: include/cppjieba/TextRankExtractor.hpp
Add TextRank Keyword Extractor to JiebaCpp
新增TextRank关键词提取
2016-04-25 20:20:50 +08:00
yanyiwu
3befc42697 update KeywordExtractor::Word's printing format to json format 2016-04-19 16:00:53 +08:00
yanyiwu
29e085904d add log and unittest 2016-04-18 14:55:42 +08:00
yanyiwu
63e9c94fb7 add unicode decoding unittest 2016-04-18 14:37:17 +08:00
yanyiwu
6fa843b527 override Cut functions, add location information into Word results; 2016-04-17 23:39:57 +08:00
yanyiwu
b6703aba90 use offset instead of str in RuneStr 2016-04-17 22:50:32 +08:00
yanyiwu
e7a45d2dde remove LevelSegment 2016-04-17 22:23:00 +08:00
yanyiwu
42a73eeb64 make compiler happy 2016-04-17 22:11:58 +08:00
yanyiwu
dcced8561e remove namespace unicode 2016-04-17 21:59:10 +08:00
yanyiwu
6ff6fe1430 WordRange construct 2016-04-17 21:57:36 +08:00
yanyiwu
339e3ca772 big change: add RuneStr for the position of word in string 2016-04-17 17:30:05 +08:00
yanyiwu
c19736995c Add KeywordExtractor::Word and add more overrided KeywordExtractor::Extract 2016-03-26 22:12:40 +08:00
yanyiwu
e6a2b47b87 hange the return value of KeywordExtractor::Extract from bool to void 2016-03-26 01:16:44 +08:00
yanyiwu
5102b8a5c3 Change Jieba::Locate to be static function. 2016-03-26 01:14:48 +08:00
yanyiwu
7db3f87b5f remove info log for dict loading 2016-03-22 10:45:20 +08:00
yanyiwu
3e28b4bcb1 adjust code for limonp v0.5.5 to solve macro name conflicts 2016-02-19 16:15:23 +08:00
yanyiwu
14e09290c2 change Rune type from uint16_t to uint32_t to support more chinese word 2016-02-18 14:54:03 +08:00