Hi For spell checking component I set extendedResults to get the frequencies and then select the word with the best frequency. I understand the spell check algorithm based on Edit Distance. For an example:
Query to Solr: Marien Spell Check Text Returned: Marine (Freq: 120), Market (Freq: 900) and others. My dictionary here is based on indexed words. So I chose Market (more frequency) however which is wrong as my intent was marine. Both have Edit Distance of 2. Now how can I improve this Algorithm to select marine instead of market (based on something more than edit distance and frequency stuff)? Do I have to incorporate some "soundex" algorithms too? I am looking for simple stuff which I can quickly implement. I even tried using Peter Norvig's spell corrector Algorithm (which is great) but again I ran in same problems. </PRE> <BR> ******************************************************************************************<BR>This message may contain confidential or proprietary information intended only for the use of the<BR>addressee(s) named above or may contain information that is legally privileged. If you are<BR>not the intended addressee, or the person responsible for delivering it to the intended addressee,<BR>you are hereby notified that reading, disseminating, distributing or copying this message is strictly<BR>prohibited. If you have received this message by mistake, please immediately notify us by<BR>replying to the message and delete the original message and any copies immediately thereafter.<BR> <BR> Thank you.~<BR> ******************************************************************************************<BR> FAFLD<BR> <PRE>