All the Perl that's Practical to Extract and Report
meeting page http://wiki.perlchina.org/BJPW-20090212>Time: Feb 12, 7pm - 9pm Location: Flow cafe & bar - Chengfu Street - Haidian - Beijing Map: Mobile: 158 1088 0868Title: A Firefox cluster driven by JavaScript, Perl, and PL/PgSQLSummary: In this talk, agentzh will present a Firefox cluster for extracting deep information from web pages even with AJAX contents, which is already being used in production. Various popular software like Firefox, Apache, PostgreSQL have been glued together using JavaScript, Perl, and OpenResty's web services. And Firefox's performance has been greatly improved by content prefetching and "hard caching". It will be shown that, this solution not only offers great opportunities for automated data extraction based on vision information (from the Gecko rendering engine), but also provide a way for scaling Firefox extensions on the cluster level. It's now the time to put frontend technologies like Firefox and JavaScript programming into very backend things like search engine crawling and content indexing. Everyone is welcome!