Web-Harvest

本项目在 Web爬虫工具 中的评分:   8
Web-Harvest is Open Source Web Data Extraction tool written in Java. It offers a way to collect desired Web pages and extract useful data from them. In order to do that, it leverages well established techniques and technologies for text/xml manipulation such as XSLT, XQuery and Regular Expressions. Web-Harvest mainly focuses on HTML/XML based web sites which still make vast majority of the Web content. On the other hand, it could be easily supplemented by custom Java libraries in order to augment its extraction capabilities.

其他Java Web爬虫工具 开源项目资源:

本网站对列举的开源项目、软件、源码、类库所评定的分值(PR, Progject Rank),是根据该项目的规模、复杂度、采用人数、开发人数、活跃度、说明文档、演示网站等诸多因素综合所给出主观评价,仅供你参考之用。
List of Companies, Suppliers, Distributors, Importers & Exporters
收藏本网站 | 联系我们 | 英文图书网 | 十万个为什么 | Sitemap生成器 | 国际商贸
Copyright © 2007 - 2012 Why and How