摘要:
The components that are involved in a crawl.1.Content Host This is the server that hosts/stores the content that your indexer is crawling. For example, if you have a content source that crawls a SharePoint site, the content host would be the web front end server that hosts the site. If you are crawl 阅读全文
摘要:
The indexing processContent Source(Start Addresses)->Protocol Handler(ie.HTTP)->IFilters(Office Docs, HTML files PDFs, etc.)->Word Breakers->Stemmers->Noise Word->index Catalog.Indexing process start with the content source and the start address(es). Once the protocol handler and I 阅读全文