From 752ae03b34384380f8ff61a790e3a34ac61a75a2 Mon Sep 17 00:00:00 2001 From: wyy Date: Sat, 15 Mar 2014 23:11:22 +0800 Subject: [PATCH] add stop_words.utf8 --- dict/README.md | 6 ++++-- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/dict/README.md b/dict/README.md index 614e071..8879118 100644 --- a/dict/README.md +++ b/dict/README.md @@ -19,11 +19,13 @@ __对于MixSegment(混合MPSegment和HMMSegment两者)则同时使用以上两 ## 关键词抽取 -## idf.utf8 +### idf.utf8 IDF(Inverse Document Frequency) 在KeywordExtractor中,使用的是经典的TF-IDF算法,所以需要这么一个词典提供IDF信息。 - +### stop_words.utf8 + +停用词词典