Simon St Laurent pointed me at Tag Soup, a Java solution to the messy HTML problem. I can't find a similar system for Perl, unfortunately.
--Nat
HTML::TreeBuilder
pudge on 2003-07-08T23:17:40
It works. And not just because Sean Burke wrote it. OK, maybe that is why.
Also, in Slash, I have some code that adds in missing HTML tags, if you are interested. So when you post a comment in Slash that has:
<B>fooey!</I> <I>you suck!
It converts it to:
<B>fooey! <I>you suck!</I></B>
etc.