Commit Graph

  • f2b7183a71 use str.splitlines to avoid losing line breaks Dingyuan Wang 2015-02-12 12:39:14 +08:00
  • b14eb329e3 Merge pull request #237 from gumblex/master Sun Junyi 2015-02-12 11:27:25 +08:00
  • 872a7039f2 Merge branch 'master' of https://github.com/fxsjy/jieba Dingyuan Wang 2015-02-12 10:33:56 +08:00
  • f808ea0ebb use only one dict to store words and prefixes Dingyuan Wang 2015-02-12 10:31:52 +08:00
  • 4d7b515801 Merge branch 'master' of https://github.com/fxsjy/jieba fxsjy 2015-02-11 20:57:35 +08:00
  • 5bfa43a781 fix test scripts fxsjy 2015-02-11 20:46:48 +08:00
  • f3a53dd2da fix print() in tests Dingyuan Wang 2015-02-11 20:45:55 +08:00
  • a229041e58 Merge pull request #234 from yanyiwu/patch-2 Sun Junyi 2015-02-11 18:48:47 +08:00
  • 5d321cbccd Update README.md Yanyi Wu 2015-02-11 17:37:32 +08:00
  • 8cbb26a7b6 fix test_file.py fxsjy 2015-02-11 16:47:57 +08:00
  • 41b47b0593 Merge pull request #233 from gumblex/master Sun Junyi 2015-02-11 15:44:22 +08:00
  • 32a0e92a09 don't compile re every time; autopep8 Dingyuan Wang 2015-02-10 21:22:34 +08:00
  • 22bcf8be7a Merge master and jieba3k, make the code Python 2/3 compatible Dingyuan Wang 2015-02-10 20:54:55 +08:00
  • 1b0400aeda Merge pull request #232 from gumblex/jieba3k jieba3k Sun Junyi 2015-02-09 17:09:14 +08:00
  • caae26fbfa Merge pull request #231 from gumblex/master Sun Junyi 2015-02-09 16:50:43 +08:00
  • 4197dfb8fa store int directly in FREQ; small improvements Dingyuan Wang 2015-02-09 16:26:00 +08:00
  • 765fd6b7f0 store int directly in FREQ; small improvements Dingyuan Wang 2015-02-09 16:14:12 +08:00
  • c95f402e2b Merge pull request #214 from aszxqw/master Sun Junyi 2014-12-25 10:09:35 +08:00
  • 1d91072498 add iosjieba yanyiwu 2014-12-24 23:02:06 +08:00
  • 852a07c4f2 Merge pull request #211 from gumblex/jieba3k Sun Junyi 2014-12-20 18:35:43 +08:00
  • 7bcb128f5f fix textrank divided by zero; fix posseg.pair.__repr__ Dingyuan Wang 2014-12-20 00:12:42 +08:00
  • b08c3f8ed7 Merge pull request #205 from lynschinzer/master Sun Junyi 2014-12-05 20:13:51 +08:00
  • fea3aec6bd Fix divided by zero issue in case of words are not found in dict. Lin 2014-12-05 17:13:12 +08:00
  • 8be082017a Merge pull request #204 from gumblex/jieba3k Sun Junyi 2014-11-29 18:28:48 +08:00
  • 293dbbc390 Merge pull request #203 from gumblex/master Sun Junyi 2014-11-29 18:28:23 +08:00
  • 3dad899ec8 backport 2to3 scripts and changelog Dingyuan Wang 2014-11-29 16:12:25 +08:00
  • c6b386f65b update jieba3k Dingyuan Wang 2014-11-29 16:06:20 +08:00
  • 7b7c6955a9 complete the setup.py, fix #202 problem in posseg Dingyuan Wang 2014-11-29 15:33:42 +08:00
  • 8a2e7f0e7e Merge pull request #202 from nomaka/patch-1 Sun Junyi 2014-11-18 16:46:59 +08:00
  • 9cb76dd8b9 Update __init__.py Nomaka 2014-11-18 16:00:49 +08:00
  • 99748bfc17 Merge pull request #201 from skyerown/master Sun Junyi 2014-11-18 10:27:52 +08:00
  • a336e26403 为函数textrank增加参数allowPOS,并修改extract_tags的参数allowPOS与textrank保持一致。 walkskyer 2014-11-15 18:36:09 +08:00
  • bab5f362ba 将exstract_tags参数allowPOS转换为frozenset以减少查找时间。 walkskyer 2014-11-15 18:14:47 +08:00
  • 6b0da06481 merge from upstream Dingyuan Wang 2014-11-15 14:06:03 +08:00
  • 5c487dbcba update verson fxsjy 2014-11-15 13:46:27 +08:00
  • 447c1ded8c fix problem for python3.2 fxsjy 2014-11-15 13:44:30 +08:00
  • dd62477605 .gitignore中忽略pycharm项目文件 walkskyer 2014-11-15 13:33:13 +08:00
  • a5ecf70f71 update to v0.35 Dingyuan Wang 2014-11-14 20:58:29 +08:00
  • d82d2c18df 为关键字提取函数增加词性过滤功能 walkskyer 2014-11-13 22:26:22 +08:00
  • 315a411e52 version update fxsjy 2014-11-13 10:43:43 +08:00
  • ec68c21ea0 version update' fxsjy 2014-11-13 10:27:50 +08:00
  • 3eea28d6f4 Merge pull request #200 from skyerown/master Sun Junyi 2014-11-13 10:10:07 +08:00
  • 5571a0337a 修复stop words处理未考虑"\r"导致不能正常匹配的问题。 walkskyer 2014-11-12 22:33:27 +08:00
  • 40c0edfd99 Merge pull request #198 from gumblex/jieba3k Sun Junyi 2014-11-08 22:17:51 +08:00
  • 4a6140081e fix problems in auto2to3 Dingyuan Wang 2014-11-07 23:47:57 +08:00
  • 7a6caa0c3c port extract_tags, etc to jieba3k; add auto2to3 script Dingyuan Wang 2014-11-07 23:33:31 +08:00
  • 36bc9e18c6 Merge pull request #1 from fxsjy/master walkskyer 2014-11-07 21:35:22 +08:00
  • 7ce63e53b7 Merge pull request #197 from skyerown/master Sun Junyi 2014-11-07 11:07:19 +08:00
  • 6772f0282e 修复带权重测试脚本输出结果是调用顺序错误 walkskyer 2014-11-06 22:24:43 +08:00
  • a5944bb88e Merge pull request #196 from qinwf/master Sun Junyi 2014-11-04 12:29:42 +08:00
  • 77a831b8c1 Add jiebaR in README Qin Wenfeng 2014-11-04 11:59:40 +08:00
  • cf2aa88122 Merge pull request #195 from gumblex/master Sun Junyi 2014-11-01 12:54:57 +08:00
  • 751ff35eb5 improve extract_tags; unify extract_tags and testrank Dingyuan Wang 2014-10-31 23:15:51 +08:00
  • e3f3dcccba improve the loading and caching process Dingyuan Wang 2014-10-31 21:56:09 +08:00
  • 4cb1924d09 Merge pull request #193 from gumblex/jieba3k Sun Junyi 2014-10-25 15:29:49 +08:00
  • d6ef07a472 Merge pull request #192 from gumblex/master Sun Junyi 2014-10-25 15:29:26 +08:00
  • fd9f1f2c0e update README, textrank, etc. Dingyuan Wang 2014-10-25 14:23:37 +08:00
  • 9d2818b440 fix English part of README Dingyuan Wang 2014-10-25 13:16:30 +08:00
  • 31b7d11809 improve README Dingyuan Wang 2014-10-25 13:12:19 +08:00
  • a6119cc995 add custom dictionary to __main__; update README; slightly optimize textrank Dingyuan Wang 2014-10-25 12:59:36 +08:00
  • 0049b0c5b4 Merge pull request #191 from sing1ee/master Sun Junyi 2014-10-24 22:50:36 +08:00
  • 138d713e98 add some introduction of textrank zhangcheng 2014-10-24 22:41:51 +08:00
  • 4030d8ed86 Merge pull request #190 from sing1ee/master Sun Junyi 2014-10-24 22:20:05 +08:00
  • 6eb9f6149c add a simple implementation of textrank zhangcheng 2014-10-24 21:15:54 +08:00
  • 1850bd6d37 Update README.md Sun Junyi 2014-10-24 20:23:10 +08:00
  • f5ca87e088 merge change of @fukuball fxsjy 2014-10-23 15:59:08 +08:00
  • 10b86e90fb Update README.md Sun Junyi 2014-10-21 12:53:37 +08:00
  • ba87fcb01f remove trie, use prefix set instead fxsjy 2014-10-20 14:08:09 +08:00
  • 82bfffb6ed version update to 0.34 fxsjy 2014-10-20 13:35:13 +08:00
  • 56e8336af1 Merge pull request #188 from gumblex/jieba3k Sun Junyi 2014-10-19 19:43:48 +08:00
  • 4a93f21918 Merge pull request #187 from gumblex/master Sun Junyi 2014-10-19 19:43:30 +08:00
  • bb1e6000c6 fix version; fix spaces at end of line Dingyuan Wang 2014-10-19 10:57:46 +08:00
  • 14671d4feb fix __main__.py Dingyuan Wang 2014-10-19 10:41:09 +08:00
  • b367690eeb use prefix dict instead of trie, add a command line interface, and a few small improvements Dingyuan Wang 2014-10-19 10:32:23 +08:00
  • 51df77831b use prefix dict instead of trie, add a command line interface, and a few small improvements Dingyuan Wang 2014-10-18 22:22:14 +08:00
  • eb98eb9248 fix performance problem of extrag_tags fxsjy 2014-10-10 15:41:28 +08:00
  • 7f965e0aa3 Merge pull request #184 from keroro520/master Sun Junyi 2014-09-12 17:43:43 +08:00
  • 77b442fa88 fix issues (https://github.com/fxsjy/jieba/issues/125) keroro520 2014-09-12 13:42:05 +08:00
  • 8f52419386 Merge pull request #183 from gumblex/jieba3k Sun Junyi 2014-09-09 10:52:31 +08:00
  • 626b415152 fix dict.itervalues mistake Dingyuan Wang 2014-09-07 19:21:13 +08:00
  • 6a3f228c72 fix python3 stuff Dingyuan Wang 2014-09-07 18:50:10 +08:00
  • b16cf0d63f fix indent typo Dingyuan Wang 2014-09-06 23:37:54 +08:00
  • 6fad5fbb2c update to v0.33 Dingyuan Wang 2014-09-06 23:28:47 +08:00
  • fc511de012 Merge pull request #176 from fukuball/master Sun Junyi 2014-09-01 14:11:00 +08:00
  • 99ea59e88d Update README.md v0.33 Sun Junyi 2014-08-31 20:04:02 +08:00
  • 6eb43acc10 pip install jieba3k fxsjy 2014-08-31 20:01:54 +08:00
  • 40adb1c591 version 0.33 fxsjy 2014-08-31 19:26:26 +08:00
  • d432789cb4 fix typo Fukuball Lin 2014-08-06 17:56:05 +08:00
  • cf31a99bf6 將 Readme 中文和半形的英文、數字、符號之間插入空白 Fukuball Lin 2014-08-06 15:53:57 +08:00
  • e4d323c78b 更新 jieba 可以切換 idf 語料庫及 stop words 語料庫的說明 Fukuball Lin 2014-08-06 15:00:07 +08:00
  • 16d626d347 Merge pull request #174 from fukuball/master Sun Junyi 2014-08-06 10:36:10 +08:00
  • b658ee69cb 讓 jieba 可以自行增加 stop words 語料庫 Fukuball Lin 2014-08-06 03:35:16 +08:00
  • 7198d562f1 讓 jieba 可以切換 idf 語料庫 Fukuball Lin 2014-08-05 22:55:13 +08:00
  • 91e5b26f5f Merge pull request #165 from gumblex/jieba3k Sun Junyi 2014-06-22 10:23:58 +08:00
  • 8b07bce568 fix the u'xxx' string. Dingyuan Wang 2014-06-21 23:30:06 +08:00
  • 0d99ebce54 Merge pull request #164 from gumblex/jieba3k Sun Junyi 2014-06-15 19:14:28 +08:00
  • c04ccd0d12 Update to v0.32 according to the master branch. Dingyuan Wang 2014-06-14 22:31:13 +08:00
  • 81f77d7a08 Fix the re in enable_parallel. Dingyuan Wang 2014-06-14 15:22:13 +08:00
  • 473ac1df75 Merge pull request #162 from ShuraChow/master Sun Junyi 2014-06-11 17:04:23 +08:00
  • 7583f7760a fix issue #161 ShuraChow 2014-06-10 02:04:09 +08:00