On Thu, 14 Jan 2010 11:09:41 -0800 (PST) Otis Gospodnetic <otis_gospodne...@yahoo.com> wrote: [...] > If you are using Solr in some clever, interesting, or unusual way > and are willing to share this information, please get in touch. > 5 to max 10 pages (soft limits) per study is what we are hoping > for. Feel free to respond on the list or reply to me directly. [...]
We have been getting to grips with Solr over the last couple of months, and while I am not sure how interesting this is to people outside of India, one of the things that we have just finished a beta version of is phonetic filters and spell-checking components for Solr, dealing with Indian languages. The aim is to have these work both for Unicode content/search terms, and for Indian languages transliterated into English. The latter is useful as many people, especially current computer users in India, find it more comfortable to type in transliterated English. These components use the standard Solr facilities, as well as established open-source spell-checking libraries like aspell, and the design goal includes fuzzy matches, such as between "Amitav" and "Amitabh", as there is often a fair amount of variance in English transliteration. We see great potential for this as there is already a large amount of content in Indian language, and the government of India is putting in huge amounts of effort into generating more content. Please do let me know if this sounds interesting as a case study. Regards, Gora