Now that I've got the code working to separate suspicious email from benign email, I have to figure out a way to store data about the benign email so that there's not 19 extra megs of nothing taking up space on the server. (Mo' code, mo' problems, if you will.)
So the next step is the one I had prepared for originally: Parsing email files. NBD, I've got most of the code for this written (specifically one beefy little sub that scans the file, snips out the from/to/subj/body, etc.) Just need to modify the sub so that it only returns from/to/subj/time/length and then make the script dump this in a CSV database.
"Que?" you say. "CSV? Greg, isn't there already some database server in place?"
Yes, there is. Do I know how to get to it? Not yet. I want a working solution ASAP, and I can build something to drop the CSV data into an actual database later in twenty minutes. Right now I need to get cracking on three major things.