Finding Duplicates
faster-dupemerge (direct link to perl script)
Ask Slashdot: How do I de-dupe a system with 4.2 million files