摘要:
The original article is an excellent one to explain How Google chrome use bloom filtes, so I just quote the article below.Nice Bloom filter applicationhttp://blog.alexyakunin.com/2010/03/nice-bloom-filter-application.htmlToday I accidentally found a couple of interesting files in one ofGoogle Chrome 阅读全文
摘要:
Bloom Filters by Examplehttp://llimllib.github.com/bloomfilter-tutorial/A Bloom filter is a data structure designed to tell you, rapidly and memory-efficiently, whether an element is present in a set.The price paid for this efficiency is that a Bloom filter is aprobabilistic data structure: it tells 阅读全文
摘要:
1 Bloom filterWhen we write crawler, if we have to crawler millions websites, we need to check whether a website is crawled or not,So we need both space and time efficient algorithoms to achieve this goal, Bloom filter is our choice.Bloom filter is aprobabilistic data structure: it tells us that the 阅读全文
摘要:
1 Scrapy, Google Scholar and MongoDB contains code about how to use Scrapy in bitbucket.2 Writing a spider in 10 mins using Scrapy This is a simple example. 阅读全文