Re: Multi-words synonyms matching

2012-06-05 Thread Bernd Fehling
Do you have test cases? What are you sending to your SynonymFilterFactory? What are you expecting it should return? What is it returning when setting to Version.LUCENE_33? What is it returning when setting to Version.LUCENE_36? Am 05.06.2012 10:56, schrieb O. Klein: > The reason multi word s

Re: Multi-words synonyms matching

2012-06-05 Thread O. Klein
The reason multi word synonyms work better if you use LUCENE_33 is because then Solr uses the SlowSynonymFilter instead of SynonymFilterFactory (FSTSynonymFilterFactory). But I don't know if the difference between them is a bug or not. Maybe someone has more insight? Bernd Fehling-2 wrote > >

Re: Multi-words synonyms matching

2012-06-01 Thread O. Klein
Looking for some more background information I stumbled upon https://issues.apache.org/jira/browse/LUCENE-3668. If you read the last post it confirms my issue. So maybe this is a bug? Bernd Fehling-2 wrote > > Are you sure with LUCENE_33 (Use of BitVector)? > > > Am 31.05.2012 17:20, schrieb

Re: Multi-words synonyms matching

2012-05-31 Thread Bernd Fehling
Are you sure with LUCENE_33 (Use of BitVector)? Am 31.05.2012 17:20, schrieb O. Klein: > I have been struggling with this as well and found that using LUCENE_33 gives > the best results. > > But as it will be deprecated this is no everlasting solution. May somebody > knows one? >

Re: Multi-words synonyms matching

2012-05-31 Thread O. Klein
I have been struggling with this as well and found that using LUCENE_33 gives the best results. But as it will be deprecated this is no everlasting solution. May somebody knows one? -- View this message in context: http://lucene.472066.n3.nabble.com/Multi-words-synonyms-matching-tp3898950p398704

Re: Multi-words synonyms matching

2012-05-29 Thread Lance Norskog
I recently have had the same use case. I wound up doing this: in both index and query time, the synonyms file is 'expand=false'. All multi-word synonyms map to one single-word synonym (per group). This way, only the main word is indexed or queried. If the synonym file changes, you have to re-index

Re: Multi-words synonyms matching

2012-05-29 Thread elisabeth benoit
Hello Bernd, Thanks a lot for your answer. I'll work on this. Best regards, Elisabeth 2012/5/29 Bernd Fehling > Hello Elisabeth, > > my synonyms.txt is like your 2nd example: > > naturwald, φυσικό\ δάσος, естествена\ гора, prírodný\ les, naravni\ gozd, > foresta\ naturale, natuurbos, natural\

Re: Multi-words synonyms matching

2012-05-29 Thread Bernd Fehling
Hello Elisabeth, my synonyms.txt is like your 2nd example: naturwald, φυσικό\ δάσος, естествена\ гора, prírodný\ les, naravni\ gozd, foresta\ naturale, natuurbos, natural\ forest, bosque\ natural, természetes\ erdő, natūralus\ miškas, prirodna\ šuma, dabiskais\ mežs, floresta\ natural, natursko

Re: Multi-words synonyms matching

2012-05-22 Thread elisabeth benoit
Hello Bernd, Thanks for your advice. I have one question: how did you manage to map one word to a multiwords synonym??? I've tried (in synonyms.txt) mairie, hotel de ville mairie, hotel\ de\ ville mairie => mairie, hotel de ville mairie => mairie, hotel\ de\ ville but nothing prevents mairi

Re: Multi-words synonyms matching

2012-05-15 Thread Bernd Fehling
Without reading the whole thread let me say that you should not trust the solr admin analysis. It takes the whole multiword search and runs it all together at once through each analyzer step (factory). But this is not how the real system works. First pitfall, the query parser is also splitting at w

Re: Multi-words synonyms matching

2012-05-14 Thread elisabeth benoit
Just for the record, I'd like to conclude this thread First, you were right, there was no behaviour difference between fq and q parameters. I realized that: 1) my synonym (hotel de ville) has a stopword in it (de) and since I used tokenizerFactory="solr.KeywordTokenizerFactory" in my synonyms de

Re: Multi-words synonyms matching

2012-04-25 Thread Erick Erickson
A little farther down the debug info output you'll find something like this (I specified fq=name:features) name:features so it may well give you some clue. But unless I'm reading things wrong, your q is going against a field that has much more information than the CATEGORY_ANALYZED field, is i

Re: Multi-words synonyms matching

2012-04-25 Thread elisabeth benoit
I'm not at the office until next Wednesday, and I don't have my Solr under hand, but isn't debugQuery=on giving informations only about q parameter matching and nothing about fq parameter? Or do you mean "parsed_filter_querie"s gives information about fq? CATEGORY_ANALYZED is being populated by a

Re: Multi-words synonyms matching

2012-04-24 Thread Erick Erickson
Elisabeth: What shows up in the debug section of the response when you add &debugQuery=on? There should be some bit of that section like: "parsed_filter_queries" My other question is "are you absolutely sure that your CATEGORY_ANALYZED field has the correct content?". How does it get populated?

Re: Multi-words synonyms matching

2012-04-24 Thread elisabeth benoit
yes, thanks, but this is NOT my question. I was wondering why I have multiple matches with q="hotel de ville" and no match with fq=CATEGORY_ANALYZED:"hotel de ville", since in both case I'm searching in the same solr fieldType. Why is q parameter behaving differently in that case? Why do the quot

Re: Multi-words synonyms matching

2012-04-24 Thread Jeevanandam
usage of q and fq q => is typically the main query for the search request fq => is Filter Query; generally used to restrict the super set of documents without influencing score (more info. http://wiki.apache.org/solr/CommonQueryParameters#q) For example: q="hotel de ville" ===>

Re: Multi-words synonyms matching

2012-04-24 Thread elisabeth benoit
Hello, I'd like to resume this post. The only way I found to do not split synonyms in words in synonyms.txt it to use the line in schema.xml where tokenizerFactory="solr.KeywordTokenizerFactory" instructs SynonymFilterFactory not to break synonyms into words on white spaces when parsing syn

Re: Multi-words synonyms matching

2012-04-11 Thread elisabeth benoit
oh, that's right. thanks a lot, Elisabeth 2012/4/11 Jeevanandam Madanagopal > Elisabeth - > > As you described, below mapping might suit for your need. > mairie => hotel de ville, mairie > > mairie gets expanded to "hotel de ville" and "mairie" at index time. So > "mairie" and "hotel de ville"

Re: Multi-words synonyms matching

2012-04-11 Thread Jeevanandam Madanagopal
Elisabeth - As you described, below mapping might suit for your need. mairie => hotel de ville, mairie mairie gets expanded to "hotel de ville" and "mairie" at index time. So "mairie" and "hotel de ville" searchable on document. However, still white space tokenizer splits at query time will be

Re: Multi-words synonyms matching

2012-04-11 Thread elisabeth benoit
<' mapping instead? Something < mairie Have you tried the "=>' mapping instead? Something > like > hotel de ville => mairie > might work for y

Re: Multi-words synonyms matching

2012-04-10 Thread Markus Jelsma
To map `mairie` to `hotel de ville` as single token you must escape your white space. mairie, hotel\ de\ ville This results in a problem if your tokenizer splits on white space at query time. On Tuesday 10 April 2012 16:39:21 Erick Erickson wrote: > Have you tried the "=>' mapping instead? So

Re: Multi-words synonyms matching

2012-04-10 Thread Erick Erickson
Have you tried the "=>' mapping instead? Something like hotel de ville => mairie might work for you. Best Erick On Tue, Apr 10, 2012 at 1:41 AM, elisabeth benoit wrote: > Hello, > > I've read several post on this issue, but can't find a real solution to my > multi-words synonyms matching problem