The nilsimsa alogrithm came up on the Ruby mailing list. It's a hashing algorithm that allows you to determine how similar two strings are.
I never saw it before but it looked interesting. A clever teacher could use this to determine if students are ripping off each other's homework, no?
Perl! CPAN! Now!
bart on 2002-06-16T12:04:53
Apparently, somebody recently uploaded
Digest::Nilsimsa on CPAN. Who's "VIPUL" AKA "chad" anyway? Oh, the archive includes a picture of him. Now we know.
;-)
Google also turned up
this mailing list message, so the idea isn't *that* brandnew. Still, I see no sign of Matt Sergeant's module on CPAN... But
SpamAssasin is.