Version 14 of htmlparse

Updated 2005-08-09 11:15:21 by schlenk

htmlparse is a module in the tcllib library of Tcl code.

The htmlparse package provides commands that allow libraries and applications to parse HTML in a string into a representation of their choice. (From the man page [L1 ])

Documentation can be found at http://tcllib.sourceforge.net/doc/htmlparse.html


escargo 8 Aug 2005 - Once you have parsed the HTML file and have it in a tree (thanks to htmlparse::2tree), is there a convenient way to write the resulting tree back out as HTML? Or is that supposed to be obvious?

schlenk - As the tree is implemented via the struct::tree datastructure you should be able to simply its walk method with a simple formatting proc to serialize the tree back to html. The html package may be helpful there.


MSW Either it's me or htmlparse gets the structure of a HTML doc wrong.

 (description deleted)

schlenk Put a bug report on tcllib at SF for this.

MSW Done, #1008619.


Anybody know where to find an online document for the HTML DTD ?

Try the W3C: http://www.w3.org/TR/html4/sgml/dtd.html


Category Package, subset Tcllib