On Thu, Nov 20, 2003 at 10:03:51AM -0800, Eric Soroos wrote: > >Well, as you have stated there is a non zero possibility to get the > >same fingerprint for two different messageblocks. > > Non-zero, but on the order of 1 in 2^127 chance, or virtually never in > the age of the universe, at least for md5. You can obviously get > duplicates, since if you hash every 128 bit string into 128 bits, > you're assured of getting a collision on the next string that you hash, > if you haven't done so already. But as I've said, that's a major time > investment.
What about adding a unique index on (fingerprint, blocksize). A collision on the exact same length text would be even more unlikely, right? xn
