摘要: 直接上源码吧tokenizer类:#_*_encoding:utf-8_*_from ctypes import *class tokenizer: def __init__(self): self._stext=['、','“','”',',','。','《','》',':',';','!','‘','’','?','?','!','·& 阅读全文
posted @ 2012-01-07 11:36 app_ 阅读(2619) 评论(1) 推荐(0) 编辑