Re: Wildcard-Search Solr 3.5.0

2012-06-03 Thread Erick Erickson
And I closed the JIRA, see the comments. But the short form is that it's not worth the effort because of the edge cases. Jack writes up some of them; the short form is "what does stemming do with terms like organiz* ". Sure, it would produce one token (which is the main restriction on a MultiTermAw

Re: Wildcard-Search Solr 3.5.0

2012-06-03 Thread Erick Erickson
Chiming in late here, just back from vacation. But off the top of my head, I don't see any reason SnowballPorterFilterFactory shouldn't be MultiTermAware. I've created https://issues.apache.org/jira/browse/SOLR-3503 as a placeholder. Erick On Fri, May 25, 2012 at 1:31 PM, wrote: >> I don't kno

RE: Wildcard-Search Solr 3.5.0

2012-05-25 Thread spring
> I don't know the specific rules in these specific stemmers, > but generally a > "less aggressive" stemming (e.g., "plural-only") of > "paintings" would be > "painting", while a "more aggressive" stemming would be > "paint". For some > "aggressive" stemmers the stemmed word is not even a wor

Re: Wildcard-Search Solr 3.5.0

2012-05-25 Thread Jack Krupansky
ggressive" stemmers the stemmed word is not even a word. It would be nice to have doc with some example words for each stemmer. -- Jack Krupansky -Original Message- From: spr...@gmx.eu Sent: Friday, May 25, 2012 5:59 AM To: solr-user@lucene.apache.org Subject: RE: Wildcard-Sear

RE: Wildcard-Search Solr 3.5.0

2012-05-25 Thread spring
012 03:25 > To: solr-user@lucene.apache.org > Subject: Re: Wildcard-Search Solr 3.5.0 > > I tried it and it does appear to be the > SnowballPorterFilterFactory that > normally does the accent folding but can't here because it is > not multi-term > aware. I did notic

Re: Wildcard-Search Solr 3.5.0

2012-05-24 Thread Jack Krupansky
k version, and try to use GermanNormalizationFilterFactory, and that may be good enough for your situation. -- Jack Krupansky -Original Message- From: spr...@gmx.eu Sent: Wednesday, May 23, 2012 10:16 AM To: solr-user@lucene.apache.org Subject: RE: Wildcard-Search Solr 3.5.0 I'd

RE: Wildcard-Search Solr 3.5.0

2012-05-23 Thread spring
> I'd guess that this is because SnowballPorterFilterFactory > does not implement MultiTermAwareComponent. Not sure, though. Yes, I think this hinders the automagically multiterm awarness to do it's job. Could an own analyzer chain with help? Like described (very, very short, too short...) here:

RE: Wildcard-Search Solr 3.5.0

2012-05-23 Thread Michael Ryan
I'd guess that this is because SnowballPorterFilterFactory does not implement MultiTermAwareComponent. Not sure, though. -Michael

RE: Wildcard-Search Solr 3.5.0

2012-05-23 Thread spring
> Maybe a filter like ISOLatin1AccentFilter that doesn't get > applied when > using wildcards? How do the terms actually appear in the index? Bär get indexed as bar. I use not ISOLatin1AccentFilter . My field def is this:

Re: Wildcard-Search Solr 3.5.0

2012-05-23 Thread Jens Grivolla
Maybe a filter like ISOLatin1AccentFilter that doesn't get applied when using wildcards? How do the terms actually appear in the index? Jens On 05/23/2012 01:19 PM, spr...@gmx.eu wrote: No one an idea? Thx. The text may contain "FooBar". When I do a wildcard search like this: "Foo*" -

RE: Wildcard-Search Solr 3.5.0

2012-05-23 Thread spring
> -Original Message- > From: Dmitry Kan [mailto:dmitry@gmail.com] > Sent: Mittwoch, 23. Mai 2012 14:02 > To: solr-user@lucene.apache.org > Subject: Re: Wildcard-Search Solr 3.5.0 > > do umlauts arrive properly on the server side, no encoding > issues? Y

Re: Wildcard-Search Solr 3.5.0

2012-05-23 Thread Dmitry Kan
the umlauts but I have no idea what... > > > -Original Message- > > From: Dmitry Kan [mailto:dmitry@gmail.com] > > Sent: Mittwoch, 23. Mai 2012 13:36 > > To: solr-user@lucene.apache.org > > Subject: Re: Wildcard-Search Solr 3.5.0 > > > > wha

RE: Wildcard-Search Solr 3.5.0

2012-05-23 Thread spring
No. No hits for bä*. It's something with the umlauts but I have no idea what... > -Original Message- > From: Dmitry Kan [mailto:dmitry@gmail.com] > Sent: Mittwoch, 23. Mai 2012 13:36 > To: solr-user@lucene.apache.org > Subject: Re: Wildcard-Search Solr 3.5.0 >

Re: Wildcard-Search Solr 3.5.0

2012-05-23 Thread Dmitry Kan
what about bä*->hits? -- Dmitry On Wed, May 23, 2012 at 2:19 PM, wrote: > No one an idea? > > Thx. > > > > > The text may contain "FooBar". > > > > > > When I do a wildcard search like this: "Foo*" - no hits. > > > When I do a wildcard search like this: "foo*" - doc is > > > found. > > > >

RE: Wildcard-Search Solr 3.5.0

2012-05-23 Thread spring
No one an idea? Thx. > > The text may contain "FooBar". > > > > When I do a wildcard search like this: "Foo*" - no hits. > > When I do a wildcard search like this: "foo*" - doc is > > found. > > Please see http://wiki.apache.org/solr/MultitermQueryAnalysis Well, it works in 3.6. With one

RE: Wildcard-Search Solr 3.5.0

2012-05-22 Thread spring
> > The text may contain "FooBar". > > > > When I do a wildcard search like this: "Foo*" - no hits. > > When I do a wildcard search like this: "foo*" - doc is > > found. > > Please see http://wiki.apache.org/solr/MultitermQueryAnalysis Well, it works in 3.6. With one exception: If I use german

RE: Wildcard-Search Solr 3.5.0

2012-05-20 Thread Ahmet Arslan
> so your advice is to upgrade to 3.6? Or, as a workaround, you can lowercase wildcard queries on the client side.

RE: Wildcard-Search Solr 3.5.0

2012-05-20 Thread spring
Hi Ahmet, > Please see http://wiki.apache.org/solr/MultitermQueryAnalysis so your advice is to upgrade to 3.6? Thank you

Re: Wildcard-Search Solr 3.5.0

2012-05-20 Thread Ahmet Arslan
> The text may contain "FooBar". > > When I do a wildcard search like this: "Foo*" - no hits. > When I do a wildcard search like this: "foo*" - doc is > found. Please see http://wiki.apache.org/solr/MultitermQueryAnalysis

Wildcard-Search Solr 3.5.0

2012-05-20 Thread spring
Hi, I have a tokenized text field with german content: The text may contain "FooBar". When I do a wildcard search like this: "Foo*" - no hits. When I