I guess synonyms would give me a similar result as using regexes, like Jack wrote about.
I've thought about that, but I don't think it would be good enough. Substituting "k" for "ch" is easy enough, but the problem is that I have to think of every possible substitution in advance. I'd like "Fil*" to find Phillip, I'd like "Hen*" to find "Hansen", and so on. The possibilities are quite endless, and I can't think of them all. I can't limit myself to Norwegian names either, a lot of people living in Norway have names from other countries. I'd like "Moha*" to find "Mouhammed", etc.. Or am I too pessimistic? I haven't read enough about Ngrams yet, so I'm not sure if I've understood it properly. It divides the word into several pieces and tries to find one or more matches? Would that really help in my "Chr*" example? I guess you mean the combination of synonyms and ngrams? Is it possible to combine ngrams with a fuzzy query? So that every piece of a word is matched in a fuzzy way? Could that help me? I'll certainly look into ngrams more, thanks for the suggestion. Regards, Hågen On Oct 8, 2012, at 7:23 PM, Otis Gospodnetic wrote: > Hi, > > Consider looking into synonyms and ngrams. > > Otis > -- > Performance Monitoring - http://sematext.com/spm > On Oct 8, 2012 11:21 AM, "Hågen Pihlstrøm Hasle" <haagenha...@gmail.com> > wrote: > >> Hi! >> >> I'm quite new to Solr, I was recently asked to help out on a project where >> the previous "Solr-person" quit quite suddenly. I've noticed that some of >> our searches don't return the expected result, and I'm hoping you guys can >> help me out. >> >> We've indexed a lot of names, and would like to search for a person in our >> system using these names. We previously used Oracle Text for this, and we >> experience that Solr is much faster. So far so good! :) But when we try >> to use wildcards things start to to wrong. >> >> We're using Solr 3.4, and I see that some of our problems are solved in >> 3.6. Ref SOLR-2438: >> https://issues.apache.org/jira/browse/SOLR-2438 >> >> But we would also like to be able to combine wildcards with fuzzy >> searches, and wildcards with a phonetic filter. I don't see anything about >> phonetic filters in SOLR-2438 or SOLR-2921. ( >> https://issues.apache.org/jira/browse/SOLR-2921) >> Is it possible to make the phonetic filters MultiTermAware? >> >> Regarding fuzzy queries, in Oracle Text I can search for "chr%" ("chr*" in >> Solr..) and find both christian and kristian. As far as I understand, this >> is not possible in Solr, WildcardQuery and FuzzyQuery cannot be combined. >> Is this correct, or have I misunderstood anything? Are there any >> workarounds or filter-combinations I can use to achieve the same result? >> I've seen people suggest using a boolean query to combine the two, but I >> don't really see how that would solve my "chr*"-problem. >> >> As I mentioned earlier I'm quite new to this, so I apologize if what I'm >> asking about only shows my ignorance.. >> >> >> Regards, Hågen