Found on http://search.mnogo.ru/doc/msearch-howstore.html#sql-stor-crc

According to our tests there are only 250 pairs of words have the same CRC
in the list of about 1.600.000 unique words. Most of these pairs (>90%) have
at least one misspelled word.



----- Original Message -----
From: "Armand Turpel" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Sunday, February 22, 2004 12:21 PM
Subject: [PHP] crc32


Hi,

Currently I'm working on a search engine for a website. A mysql table on
which searching should be done was created which contains Words of website
articles:

CREATE TABLE docs_words (
  crc32_word int(11) NOT NULL default '0',
  id_doc int(11) NOT NULL default '0');

As you can see not the real word is taken for indexing but the crc32
checksum of a word.

So my question is: How big is the probability that 2 different words has the
same crc32 checksum?

Thanks for response

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to