HtmlCleaner

本项目在 HTML解析器 中的评分:   7

HtmlCleaner is open-source HTML parser written in Java. HTML found on Web is usually dirty, ill-formed and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring the order to tags, attributes and ordinary text. For the given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows similar rules that the most of web-browsers use in order to create document object model. However, user may provide custom tag and rule set for tag filtering and balancing.



其他Java HTML解析器 开源项目资源:

本网站对列举的开源项目、软件、源码、类库所评定的分值(PR, Progject Rank),是根据该项目的规模、复杂度、采用人数、开发人数、活跃度、说明文档、演示网站等诸多因素综合所给出主观评价,仅供你参考之用。
List of Companies, Suppliers, Distributors, Importers & Exporters
收藏本网站 | 联系我们 | 英文图书网 | 十万个为什么 | Sitemap生成器 | 国际商贸
Copyright © 2007 - 2012 Why and How