全部搜索项
buster  ] [  bullseye  ] [  bookworm  ] [  trixie  ] [  sid  ]
[ 源代码: tagsoup  ]

软件包:libtagsoup-java(1.2.1+-1)

libtagsoup-java 的相关链接

Screenshot

Debian 的资源:

下载源码包 tagsoup

维护小组:

外部的资源:

相似软件包:

SAX-compliant parser for real-life HTML

TagSoup, a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design.

By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.

TagSoup is designed as a parser, not a whole application; it isn't intended to permanently clean up bad HTML, as HTML Tidy does, only to parse it on the fly. Therefore, it does not convert presentation HTML to CSS or anything similar. It does guarantee well-structured results: tags will wind up properly nested, default attributes will appear appropriately, and so on.

标签: 实做语言: Java, 支持的格式: HTML, 超本文标记语言, works-with-format::xml, works-with::text

其他与 libtagsoup-java 有关的软件包

  • 依赖
  • 推荐
  • 建议
  • 增强

下载 libtagsoup-java

下载可用于所有硬件架构的
硬件架构 软件包大小 安装后大小 文件
all 98.3 kB170.0 kB [文件列表]