【NLP】Resources of Coursera-NLP

Written By: stackupdown

I. Background
This repo is learning notes of the open course **Natural Language Processing** of Dan Jurafsky and Christopher Manning in Coursera.

- Course Videos:
You can watch the videos from youtube or coursera.The link can be reached from https://nlp.stanford.edu/manning/, which is currently [NLP Course in Youtube](https://www.youtube.com/playlist?list=PLoROMvodv4rOFZnDyrlW3-nI7tMLtmiJZ).
- The course slides is as follow:
https://web.stanford.edu/~jurafsky/NLPCourseraSlides.html

II. Resources

The following is some resources for researching NLP. It does not include some newly published resources like SQuAD or

Bert or XLNet.

N-grams

https://books.google.com/ngrams

Sentiment Analysis

http://sentiwordnet.isti.cnr.it/ SentiWordNet assigns to each synset of WordNet three sentiment scores: positivity, negativity, objectivity.

Extracting Relations

DBPedia: 1 billion RDF triples, 385 from English Wikipedia

20 Semantics

Q. What's the comparision between different corpuses?

 

III. Some Papers

A Primer on Neural Network Models for Natural Language Processing
A neural probabilistic language model
Efficient Estimation of Word Representations in Vector Space
Distributed Representations of Words and Phrases and their Compositionality
 

III. Opensource Software/Books

Speech and Language Processing

https://web.stanford.edu/~jurafsky/slp3/ 

nltk

It's written in python and now used mainly for research and teaching.

HanLP

https://github.com/hankcs/HanLP A series of toolkit for Chinese language processing(mainly), which is aimed

at production environment and now used in many opensource projects as a basic component. It is written in Java.

IV. Researchers

Sebstian Ruder (A well-known blog author)

http://ruder.io/

Kimiyoung

http://kimiyoung.github.io/

posted @   stackupdown  阅读(254)  评论(0编辑  收藏  举报
编辑推荐:
· 基于Microsoft.Extensions.AI核心库实现RAG应用
· Linux系列:如何用heaptrack跟踪.NET程序的非托管内存泄露
· 开发者必知的日志记录最佳实践
· SQL Server 2025 AI相关能力初探
· Linux系列:如何用 C#调用 C方法造成内存泄露
阅读排行:
· 无需6万激活码!GitHub神秘组织3小时极速复刻Manus,手把手教你使用OpenManus搭建本
· Manus爆火,是硬核还是营销?
· 终于写完轮子一部分:tcp代理 了,记录一下
· 别再用vector<bool>了!Google高级工程师:这可能是STL最大的设计失误
· 单元测试从入门到精通
点击右上角即可分享
微信分享提示