I'm writing a program to find duplicates of files.
I have two folders, in which I have to find duplicates. In the worst case scenario i would have to compare all the files with each other. I was thinking to generate the checksum of each file, compare the checksums and then if the checksums are equal, perform a byte-by-byte check to be ensure the files are exactly the same.
The question is what checksum generator will be fast enough to waste time on it instead of just checking byte-by-byte?