Hi

For spell checking component I set extendedResults to get the frequencies and 
then select the word with the best frequency. I understand the spell check 
algorithm based on Edit Distance. For an example:

Query to Solr: Marien

Spell Check Text Returned: Marine (Freq: 120), Market (Freq: 900) and others. 
My dictionary here is based on indexed words.

So I chose Market (more frequency) however which is wrong as my intent was 
marine. Both have Edit Distance of 2.

Now how can I improve this Algorithm to select marine instead of market (based 
on something more than edit distance and frequency stuff)?

Do I have to incorporate some "soundex" algorithms too?

I am looking for simple stuff which I can quickly implement.

I even tried using Peter Norvig's spell corrector Algorithm (which is great) 
but again I ran in same problems.
</PRE>
<BR>
******************************************************************************************<BR>This
 message may contain confidential or proprietary information intended only for 
the use of the<BR>addressee(s) named above or may contain information that is 
legally privileged. If you are<BR>not the intended addressee, or the person 
responsible for delivering it to the intended addressee,<BR>you are hereby 
notified that reading, disseminating, distributing or copying this message is 
strictly<BR>prohibited. If you have received this message by mistake, please 
immediately notify us by<BR>replying to the message and delete the original 
message and any copies immediately thereafter.<BR>
<BR>
Thank you.~<BR>
******************************************************************************************<BR>
FAFLD<BR>
<PRE>

Reply via email to