Re: Wildcards and fuzzy/phonetic query

2012-12-11 Thread Haagen Hasle
Thank you! I actually tried to look through Jira, but I didn't focus on the minor issues. For me, this is quite critical.. :-) Any chance of merging this into the 4.0.1 release? Regards, Haagen Den 11. des. 2012 kl. 12:45 skrev Ahmet Arslan: >> Lowercasing actually seems to work with Wildc

Re: Wildcards and fuzzy/phonetic query

2012-12-11 Thread Ahmet Arslan
> Lowercasing actually seems to work with Wildcard queries, > but not with fuzzy queries.  Are there any reasons why > I should experience such a difference? Hi Haagen, Yonik added this recently. https://issues.apache.org/jira/browse/SOLR-4076

Re: Wildcards and fuzzy/phonetic query

2012-12-10 Thread Haagen Hasle
Lowercasing actually seems to work with Wildcard queries, but not with fuzzy queries. Are there any reasons why I should experience such a difference? Regards, Haagen Den 10. des. 2012 kl. 13:24 skrev Haagen Hasle: > > It's been two months since I asked about wildcards and phonetic filters

Re: Wildcards and fuzzy/phonetic query

2012-12-10 Thread Haagen Hasle
It's been two months since I asked about wildcards and phonetic filters, and finally the task of upgrading Solr to version 4.0 was prioritized in our project. So the last couple of days I've been working on it. Another team member upgraded Solr from 3.4 to 4.0, and I've been making changes to

Re: Wildcards and fuzzy/phonetic query

2012-10-09 Thread Erick Erickson
It won't crash Solr if you include it, but it probably won't do what you expect either due to how wildcards are expanded. And it gets worse. DoubleMetaphone tries to reduce what it analyzes, well, phonetically with "close" letters (or multiple choices). Some phonetic filters change to fixed 4 lett

Re: Wildcards and fuzzy/phonetic query

2012-10-09 Thread Jan Høydahl
Hi, Also be sure to check out the new BeiderMorse phonetic: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.BeiderMorseFilterFactory which handles middle eastern and eastern european names very well. Phonetic algorithms use tons of rules for how to transform an input depending

Re: Wildcards and fuzzy/phonetic query

2012-10-09 Thread Haagen Hasle
I used the admin/analysis page (great tip, I had never used it before - thank you!) and it seems to me that the DoubleMetaphone filter converts "Hågen" to both "JN" and "KN". Will that crash the Solr analysis if I try to include this filter in the multiterm-analysis? Do you know where I can f

Re: Wildcards and fuzzy/phonetic query

2012-10-08 Thread Erick Erickson
To answer your first question, yes, you've got it right. If you define a multiterm section in your fieldType, whatever you put in that section gets applied whether the underlying class is MultiTermAware or not. Which means you can shoot yourself in the foot really bad ... Well, you have 6 or so po

Re: Wildcards and fuzzy/phonetic query

2012-10-08 Thread Hågen Pihlstrøm Hasle
I understand that I'm quickly reaching the boundaries of my Solr-competence when I'm supposed to read about "Expert Level" concepts.. :) I had already read it once, but now I read it again. Twice. And I'm not sure if I understand it correctly.. So let me ask a follow-up question: If I define

Re: Wildcards and fuzzy/phonetic query

2012-10-08 Thread Hågen Pihlstrøm Hasle
I guess synonyms would give me a similar result as using regexes, like Jack wrote about. I've thought about that, but I don't think it would be good enough. Substituting "k" for "ch" is easy enough, but the problem is that I have to think of every possible substitution in advance. I'd like

Re: Wildcards and fuzzy/phonetic query

2012-10-08 Thread Otis Gospodnetic
Hi, Consider looking into synonyms and ngrams. Otis -- Performance Monitoring - http://sematext.com/spm On Oct 8, 2012 11:21 AM, "Hågen Pihlstrøm Hasle" wrote: > Hi! > > I'm quite new to Solr, I was recently asked to help out on a project where > the previous "Solr-person" quit quite suddenly.

Re: Wildcards and fuzzy/phonetic query

2012-10-08 Thread Erick Erickson
whether phonetic filters can be multiterm aware: I'd be leery of this, as I basically don't quite know how that would behave. You'd have to insure that the algorithms changed the first parts of the words uniformly, regardless of what followed. I'm pretty sure that _some_ phonetic algorithms do no

Re: Wildcards and fuzzy/phonetic query

2012-10-08 Thread Jack Krupansky
slower than desirable. -- Jack Krupansky -Original Message- From: Hågen Pihlstrøm Hasle Sent: Monday, October 08, 2012 11:21 AM To: solr-user@lucene.apache.org Subject: Wildcards and fuzzy/phonetic query Hi! I'm quite new to Solr, I was recently asked to help out on a project where

Wildcards and fuzzy/phonetic query

2012-10-08 Thread Hågen Pihlstrøm Hasle
Hi! I'm quite new to Solr, I was recently asked to help out on a project where the previous "Solr-person" quit quite suddenly. I've noticed that some of our searches don't return the expected result, and I'm hoping you guys can help me out. We've indexed a lot of names, and would like to sear