nilsimsa algorithm

djberg96 on 2002-06-06T17:42:17

The nilsimsa alogrithm came up on the Ruby mailing list. It's a hashing algorithm that allows you to determine how similar two strings are.

I never saw it before but it looked interesting. A clever teacher could use this to determine if students are ripping off each other's homework, no?


Perl! CPAN! Now!

bart on 2002-06-16T12:04:53

Apparently, somebody recently uploaded Digest::Nilsimsa on CPAN. Who's "VIPUL" AKA "chad" anyway? Oh, the archive includes a picture of him. Now we know. ;-)

Google also turned up this mailing list message, so the idea isn't *that* brandnew. Still, I see no sign of Matt Sergeant's module on CPAN... But SpamAssasin is.