If you've ever used the strings command on *nix systems, you know how useful it can be at times. I've found it relatively easy to modify my code which uses PPI to search for SQL injection attacks to be pstrings. Just added caching and hope this speeds things up quite a bit.
Er, the caching doesn't seem to work. Must have misread the docs.
Update: OK, caching works now. I did misread the docs.
Update 2: Processing 854 documents without caching, 19 minutes and 32 seconds. With caching is only 6 minutes and 49 seconds.