摘要: 1、项目选则 Content schema definition & Content Pipeline 要求:定义这个网站需要的内容结构, 并从爬到的内容中抽取元数据 (meta data), 并支持标签, 翻译等功能。 a.Define a schema of "online education Q&A", find out entities and their relationships. b.Input new content into pipeline, then merge it into existing content, under schem 阅读全文
posted @ 2012-10-20 23:56 teamshit 阅读(281) 评论(0) 推荐(0) 编辑