2014年2月15日

摘要: IntroductionThis article is the first in a short series in which I intend to explain how CPython[1]implements and uses symbol tables in its quest to compile Python source code into bytecode. In this part I will explain what a symbol table is and show how the general concepts apply to Python. In the 阅读全文

posted @ 2014-02-15 23:42 wintor12 阅读(375) 评论(0) 推荐(0) 编辑

摘要: >>> x = y = z = 0 # Zero x, y and za, b = 0, 1>>> tax = 12.5 / 100>>> price = 100.50>>> price * tax12.5625>>> price + _113.0625>>> range(10)[0, 1, 2,... 阅读全文

posted @ 2014-02-15 14:16 wintor12 阅读(101) 评论(0) 推荐(0) 编辑


2014年1月30日

摘要: 定义来自维基百科Tokenizationis the process of breaking a stream of text up into words, phrases, symbols, or other meaningful elements called tokens. The list of tokens becomes input for further processing such asparsingortext mining.Parsingorsyntactic analysisis the process of analysing astringof symbols, e 阅读全文

posted @ 2014-01-30 01:00 wintor12 阅读(294) 评论(0) 推荐(0) 编辑


2014年1月29日

摘要: 昨天实验室一位刚进组的同学发邮件来问我如何查找学术论文,这让我想起自己刚读研究生时茫然四顾的情形:看着学长们高谈阔论领域动态,却不知如何入门。经过研究生几年的耳濡目染,现在终于能自信地知道去哪儿了解最新科研动态了。我想这可能是初学者们共通的困惑,与其只告诉一个人知道,不如将这些Folk Knowledge写下来,来减少更多人的麻烦吧。当然,这个总结不过是一家之谈,只盼有人能从中获得一点点益处,受个人认知所限,难免挂一漏万,还望大家海涵指正。1.国际学术组织、学术会议与学术论文自然语言处理(natural language processing,NLP)在很大程度上与计算语言学(computat 阅读全文

posted @ 2014-01-29 04:00 wintor12 阅读(380) 评论(0) 推荐(0) 编辑


2014年1月28日

摘要: As well as providing background information for research, scientific publications can be processed to transform textual information into database content or complex networks and can be integrated with existing knowledge resources to suggest novel hypotheses.The latest developments in text-mining sol 阅读全文

posted @ 2014-01-28 23:50 wintor12 阅读(294) 评论(0) 推荐(0) 编辑

摘要: -cp 如果包不在系统环境变量里,要用此来导入包 (注意在windows下包之间用;隔开!)-Xmx 设置虚拟机大小,如果程序占内存,需要用这个来设 阅读全文

posted @ 2014-01-28 23:50 wintor12 阅读(94) 评论(0) 推荐(0) 编辑


2013年11月9日

摘要: In Git, it basically takes a picture of what all your files look like at that moment and stores a reference to that snapshot. To be efficient, if files have not changed, Git doesn’t store the file again—just a link to the previous identical file it has already stored.Everything in Git is check-summe 阅读全文

posted @ 2013-11-09 03:50 wintor12 阅读(220) 评论(0) 推荐(0) 编辑


2013年11月1日

摘要: Answering natural-language questions may often involve identifying hidden associations and implicit relationships.The unifying theme for answering common-bond questions and missing-link questions is the need to identify concepts that are closely related to those given in the question.In IBM Watson, 阅读全文

posted @ 2013-11-01 03:40 wintor12 阅读(168) 评论(0) 推荐(0) 编辑


2013年10月30日

摘要: 文章出自http://homepages.inf.ed.ac.uk/lzhang10/slm.htmlThe goal of Statistical Language Modeling is to build a statistical language model that can estimate the distribution of natural language as accurate as possible. A statistical language model (SLM) is a probability distribution P(s) over strings S t 阅读全文

posted @ 2013-10-30 05:26 wintor12 阅读(308) 评论(0) 推荐(0) 编辑


Copyright © 2024 wintor12
Powered by .NET 8.0 on Kubernetes