自然语言处理工具:中文 word2vec 开源项目,教程,数据集

word2vec

word2vec/glove/swivel binary file on chinese corpus

word2vec: https://code.google.com/p/word2vec/

glove: http://nlp.stanford.edu/projects/glove/

swivel: https://github.com/tensorflow/models/tree/master/swivel
http://arxiv.org/abs/1602.02215

开源项目

wordvectors

Pre-trained word vectors of 30+ languages

https://github.com/Kyubyong/wordvectors

chinese-word2vec

word2vec/glove/swivel binary file on chinese corpus

https://github.com/to-shimo/chinese-word2vec

教程

维基百科语料中的词语相似度探索

http://www.52nlp.cn/tag/gensim

利用word2vec对关键词进行聚类

http://blog.csdn.net/zhaoxinfan/article/details/11069485

Training Word2Vec Model on English Wikipedia by Gensim

http://textminingonline.com/training-word2vec-model-on-english-wikipedia-by-gensim

数据集

wiki

https://dumps.wikimedia.org/zhwiki/latest/zhwiki-latest-pages-articles.xml.bz2

sogou

http://www.sogou.com/labs/resource/list_news.php

更多机器学习教程:http://www.tensorflownews.com/

posted on 2017-10-01 16:08  TensorFlowNews  阅读(2844)  评论(0编辑  收藏  举报

TensorFlow

TensorFlow

磐创AI

TensorFlow 教程从入门到精通

TensorFlow 安装教程

Keras 从入门到精通

粒子群优化算法

聊天机器人

自然语言处理

TensorFlow

TensorFlow