浙江省高等学校教师教育理论培训

微信搜索“毛凌志岗前心得”小程序

  博客园  :: 首页  :: 新随笔  :: 联系 :: 订阅 订阅  :: 管理

INDRI - Language modeling meets inference networks


INDRI
Language modeling meets inference networks

 

 

Indri is a new search engine from the Lemur project; a
cooperative effort between the University of Massachusetts
and Carnegie Mellon University
to build language modeling information retrieval tools.

Effective

  • Best-in-class ad hoc retrieval performance

Flexible

  • Supports popular structured query operators from INQUERY
  • Open source, with a flexible BSD-inspired license
  • Parses PDF, HTML, XML, and TREC documents
  • Word and PowerPoint parsing (Windows only)

Usable

  • Supports UTF-8 encoded text
  • Language independent tokenization of UTF-8 encoded documents.
  • Includes both command line tools and a Java user interface
  • API can be used from Java, PHP, or C++
  • Works on Windows, Linux, Solaris and Mac OS X

Powerful

  • Can be used on a cluster of machines for faster indexing and retrieval
  • Suffix-based wildcard term matching
  • Field retrieval
  • Passage retrieval
  • Scales to terabyte-sized collections

Related Links

posted on   lexus  阅读(267)  评论(0编辑  收藏  举报
努力加载评论中...
点击右上角即可分享
微信分享提示