Re: Having trouble with German compound words in Solr 4.7

2014-04-24 Thread Siegfried Goeschl
Hi Alistair, it seems that there are many ways to skin the cat so I describe the approach I used with SOLR 3.6 :-) * Using a patched DictionaryCompoundWordTokenFilterFactory in the "index" phase - so the german compound noun "Leinenhose" (linen trousers) would be indexed in addition to "Lein

Re: Having trouble with German compound words in Solr 4.7

2014-04-22 Thread Alistair
I've managed to solve this (in a quite hacky sort of way) by using filter queries and the edismax queryparser. I added in my solrconfig.xml the following parameters: edismax 75% Then when searching for multiple keywords (for example: schwarzkleid wenz, where wenz is a german brand name)

Re: Having trouble with German compound words in Solr 4.7

2014-04-21 Thread Alistair
Hi Siegfried, the debug shows that the separated keywords get OR'd together so a match to either keyword appears in the results. So if I am searching for: *keywords:schwarzkleid* this will get transformed to *keywords:schwarz keywords:kleid *which is equivalent to *keywords:schwarz OR keywords:kl

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Siegfried Goeschl
Hi Alistair, quick email before getting my plane - I worked with similar requirements in the past and tuning SOLR can be tricky * are you hitting the same SOLR query handler (application versus manual checking)? * turn on debugging for your application SOLR queries so you see what query is act

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Alistair
Hey Jack, thanks for the reply. I added autoGeneratePhraseQueries="true" to the fieldType and now it's giving me even more results! I'm not sure if the debug of my query will be helpful but I'll paste it just in case someone might have an idea. This produces 113524 results, whereas if I manually e

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Jack Krupansky
Make sure your field type has the autoGeneratePhraseQueries="true" attribute (default is false). q.op only applies to explicit terms, not to terms which decompose into multiple terms. Confusing? Yes! -- Jack Krupansky -Original Message- From: Alistair Sent: Friday, April 18, 2014 6:1