Re: Having trouble with German compound words in Solr 4.7

2014-04-24 Thread Siegfried Goeschl
Hi Alistair, it seems that there are many ways to skin the cat so I describe the approach I used with SOLR 3.6 :-) * Using a patched DictionaryCompoundWordTokenFilterFactory in the "index" phase - so the german compound noun "Leinenhose" (linen trousers) would be indexed in addition to "Lein

Re: Having trouble with German compound words in Solr 4.7

2014-04-22 Thread Alistair
I've managed to solve this (in a quite hacky sort of way) by using filter queries and the edismax queryparser. I added in my solrconfig.xml the following parameters: edismax 75% Then when searching for multiple keywords (for example: schwarzkleid wenz, where wenz is a german brand name)

Re: Having trouble with German compound words in Solr 4.7

2014-04-21 Thread Alistair
Hi Siegfried, the debug shows that the separated keywords get OR'd together so a match to either keyword appears in the results. So if I am searching for: *keywords:schwarzkleid* this will get transformed to *keywords:schwarz keywords:kleid *which is equivalent to *keywords:schwarz OR keywords:kl

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Siegfried Goeschl
Hi Alistair, quick email before getting my plane - I worked with similar requirements in the past and tuning SOLR can be tricky * are you hitting the same SOLR query handler (application versus manual checking)? * turn on debugging for your application SOLR queries so you see what query is act

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Alistair
Hey Jack, thanks for the reply. I added autoGeneratePhraseQueries="true" to the fieldType and now it's giving me even more results! I'm not sure if the debug of my query will be helpful but I'll paste it just in case someone might have an idea. This produces 113524 results, whereas if I manually e

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Jack Krupansky
18, 2014 6:11 AM To: solr-user@lucene.apache.org Subject: Having trouble with German compound words in Solr 4.7 Hello all, I'm a fairly new Solr user and I need my search function to handle compound words in German. I've searched through the archives and found that Solr already has a Fil

Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Alistair
Hello all, I'm a fairly new Solr user and I need my search function to handle compound words in German. I've searched through the archives and found that Solr already has a Filter Factory made for such words called DictionaryCompoundWordTokenFilterFactory. I've already built a list of words that I