mirror of
https://github.com/yanyiwu/cppjieba.git
synced 2025-07-18 00:00:12 +08:00
add stop_words.utf8
This commit is contained in:
parent
d96c37d372
commit
752ae03b34
@ -19,11 +19,13 @@ __对于MixSegment(混合MPSegment和HMMSegment两者)则同时使用以上两
|
|||||||
|
|
||||||
## 关键词抽取
|
## 关键词抽取
|
||||||
|
|
||||||
## idf.utf8
|
### idf.utf8
|
||||||
|
|
||||||
IDF(Inverse Document Frequency)
|
IDF(Inverse Document Frequency)
|
||||||
在KeywordExtractor中,使用的是经典的TF-IDF算法,所以需要这么一个词典提供IDF信息。
|
在KeywordExtractor中,使用的是经典的TF-IDF算法,所以需要这么一个词典提供IDF信息。
|
||||||
|
|
||||||
|
### stop_words.utf8
|
||||||
|
|
||||||
|
停用词词典
|
||||||
|
|
||||||
|
|
||||||
|
Loading…
x
Reference in New Issue
Block a user