jieba库词频分析

利用jieba库对《聊斋志异》进行词频分析
import jieba
jieba.setLogLevel(jieba.logging.INFO)
txt=open('聊斋志异.txt','r',encoding='utf-8').read()
words=jieba.lcut(txt)
counts={}
for word in words:
if len(word)==1:
continue

else:
counts[word]=counts.get(word,0)+1
items=list(counts.items())
items.sort(key=lambda x:x[1],reverse=True)
for i in range(20):
word,count=items[i]
print('{0:<10}{1:>5}'.format(word,count))

 

posted @ 2021-11-13 22:03  仰望半月的夜  阅读(175)  评论(0)    收藏  举报