Ted

  博客园 :: 首页 :: 博问 :: 闪存 :: 新随笔 :: 联系 :: 订阅 订阅 :: 管理 ::

2012年5月11日

摘要: The original article is an excellent one to explain How Google chrome use bloom filtes, so I just quote the article below.Nice Bloom filter applicationhttp://blog.alexyakunin.com/2010/03/nice-bloom-filter-application.htmlToday I accidentally found a couple of interesting files in one ofGoogle Chrome 阅读全文
posted @ 2012-05-11 00:30 wufawei 阅读(443) 评论(0) 推荐(0) 编辑

摘要: Bloom Filters by Examplehttp://llimllib.github.com/bloomfilter-tutorial/A Bloom filter is a data structure designed to tell you, rapidly and memory-efficiently, whether an element is present in a set.The price paid for this efficiency is that a Bloom filter is aprobabilistic data structure: it tells 阅读全文
posted @ 2012-05-11 00:27 wufawei 阅读(295) 评论(0) 推荐(0) 编辑

摘要: 1 Bloom filterWhen we write crawler, if we have to crawler millions websites, we need to check whether a website is crawled or not,So we need both space and time efficient algorithoms to achieve this goal, Bloom filter is our choice.Bloom filter is aprobabilistic data structure: it tells us that the 阅读全文
posted @ 2012-05-11 00:21 wufawei 阅读(180) 评论(0) 推荐(0) 编辑

摘要: 1 Scrapy, Google Scholar and MongoDB contains code about how to use Scrapy in bitbucket.2 Writing a spider in 10 mins using Scrapy This is a simple example. 阅读全文
posted @ 2012-05-11 00:09 wufawei 阅读(288) 评论(0) 推荐(0) 编辑