Screen scraping. The bane of web programming. Thankfully, its hideousness has spurred a bevy of alternatives to transmitting information over the web - XML-RPC, SOAP, RDDL, RDF, RSS, and various task-specific XML document types and other bizarre file formats.
I'm working on an article at the moment that deals with grabbing information out of an HTML document using XSLT of all things. It's convinced me that what's been lacking is a good way to concisely say I want this out of an HTML document, and do it reliably, interactively and robustly. Alternatives in Perl are a little more painful.
Watch this space for more details.
Re:TIMTOWTDI
ziggy on 2001-11-27T14:01:55
That's step two.:-) The idea here is to support quick turnaround prototyping with XSLT (presuming you know XSLT). Once that's done, copying the XPath expressions back to Perl is trivial (and leads back to single-language programming).