Nutch - an open-source web search engine

本项目在 搜索引擎 中的评分:   9

Nutch is a nascent effort to implement an open-source web search engine.

Web search is a basic requirement for internet navigation, yet the number of web search engines is decreasing. Today's oligopoly could soon be a monopoly, with a single company controlling nearly all web search for its commercial gain. That would not be good for users of the internet.

Nutch provides a transparent alternative to commercial web search engines. Only open source search results can be fully trusted to be without bias. (Or at least their bias is public.) All existing major search engines have proprietary ranking formulas, and will not explain why a given page ranks as it does. Additionally, some search engines determine which sites to index based on payments, rather than on the merits of the sites themselves. Nutch, on the other hand, has nothing to hide and no motive to bias its results or its crawler in any way other than to try to give each user the best results possible.

Nutch aims to enable anyone to easily and cost-effectively deploy a world-class web search engine. This is a substantial challenge. To succeed, Nutch software must be able to:

  • fetch several billion pages per month
  • maintain an index of these pages
  • search that index up to 1000 times per second
  • provide very high quality search results
  • operate at minimal cost


其他Java 搜索引擎 开源项目资源:

本网站对列举的开源项目、软件、源码、类库所评定的分值(PR, Progject Rank),是根据该项目的规模、复杂度、采用人数、开发人数、活跃度、说明文档、演示网站等诸多因素综合所给出主观评价,仅供你参考之用。
List of Companies, Suppliers, Distributors, Importers & Exporters
收藏本网站 | 联系我们 | 英文图书网 | 十万个为什么 | Sitemap生成器 | 国际商贸
Copyright © 2007 - 2012 Why and How