摘要: The components that are involved in a crawl.1.Content Host This is the server that hosts/stores the content that your indexer is crawling. For example, if you have a content source that crawls a SharePoint site, the content host would be the web front end server that hosts the site. If you are crawl 阅读全文
posted @ 2012-05-20 21:00 l'oiseau 阅读(406) 评论(0) 推荐(0) 编辑
摘要: The indexing processContent Source(Start Addresses)->Protocol Handler(ie.HTTP)->IFilters(Office Docs, HTML files PDFs, etc.)->Word Breakers->Stemmers->Noise Word->index Catalog.Indexing process start with the content source and the start address(es). Once the protocol handler and I 阅读全文
posted @ 2012-05-20 20:58 l'oiseau 阅读(168) 评论(0) 推荐(0) 编辑