And I closed the JIRA, see the comments. But the short form is that
it's not worth the effort because of the edge cases. Jack writes
up some of them; the short form is "what does stemming
do with terms like organiz* ". Sure, it would produce one token (which is
the main restriction on a MultiTermAw
Chiming in late here, just back from vacation. But off the top of my
head, I don't see any reason SnowballPorterFilterFactory shouldn't
be MultiTermAware.
I've created https://issues.apache.org/jira/browse/SOLR-3503 as
a placeholder.
Erick
On Fri, May 25, 2012 at 1:31 PM, wrote:
>> I don't kno
> I don't know the specific rules in these specific stemmers,
> but generally a
> "less aggressive" stemming (e.g., "plural-only") of
> "paintings" would be
> "painting", while a "more aggressive" stemming would be
> "paint". For some
> "aggressive" stemmers the stemmed word is not even a wor
ggressive" stemmers the stemmed word is not even a word.
It would be nice to have doc with some example words for each stemmer.
-- Jack Krupansky
-Original Message-
From: spr...@gmx.eu
Sent: Friday, May 25, 2012 5:59 AM
To: solr-user@lucene.apache.org
Subject: RE: Wildcard-Sear
012 03:25
> To: solr-user@lucene.apache.org
> Subject: Re: Wildcard-Search Solr 3.5.0
>
> I tried it and it does appear to be the
> SnowballPorterFilterFactory that
> normally does the accent folding but can't here because it is
> not multi-term
> aware. I did notic
k
version, and try to use GermanNormalizationFilterFactory, and that may be
good enough for your situation.
-- Jack Krupansky
-Original Message-
From: spr...@gmx.eu
Sent: Wednesday, May 23, 2012 10:16 AM
To: solr-user@lucene.apache.org
Subject: RE: Wildcard-Search Solr 3.5.0
I'd
> I'd guess that this is because SnowballPorterFilterFactory
> does not implement MultiTermAwareComponent. Not sure, though.
Yes, I think this hinders the automagically multiterm awarness to do it's
job.
Could an own analyzer chain with help? Like
described (very, very short, too short...) here:
I'd guess that this is because SnowballPorterFilterFactory does not implement
MultiTermAwareComponent. Not sure, though.
-Michael
> Maybe a filter like ISOLatin1AccentFilter that doesn't get
> applied when
> using wildcards? How do the terms actually appear in the index?
Bär get indexed as bar.
I use not ISOLatin1AccentFilter . My field def is this:
Maybe a filter like ISOLatin1AccentFilter that doesn't get applied when
using wildcards? How do the terms actually appear in the index?
Jens
On 05/23/2012 01:19 PM, spr...@gmx.eu wrote:
No one an idea?
Thx.
The text may contain "FooBar".
When I do a wildcard search like this: "Foo*" -
> -Original Message-
> From: Dmitry Kan [mailto:dmitry@gmail.com]
> Sent: Mittwoch, 23. Mai 2012 14:02
> To: solr-user@lucene.apache.org
> Subject: Re: Wildcard-Search Solr 3.5.0
>
> do umlauts arrive properly on the server side, no encoding
> issues?
Y
the umlauts but I have no idea what...
>
> > -Original Message-
> > From: Dmitry Kan [mailto:dmitry@gmail.com]
> > Sent: Mittwoch, 23. Mai 2012 13:36
> > To: solr-user@lucene.apache.org
> > Subject: Re: Wildcard-Search Solr 3.5.0
> >
> > wha
No. No hits for bä*.
It's something with the umlauts but I have no idea what...
> -Original Message-
> From: Dmitry Kan [mailto:dmitry@gmail.com]
> Sent: Mittwoch, 23. Mai 2012 13:36
> To: solr-user@lucene.apache.org
> Subject: Re: Wildcard-Search Solr 3.5.0
>
what about bä*->hits?
-- Dmitry
On Wed, May 23, 2012 at 2:19 PM, wrote:
> No one an idea?
>
> Thx.
>
>
> > > The text may contain "FooBar".
> > >
> > > When I do a wildcard search like this: "Foo*" - no hits.
> > > When I do a wildcard search like this: "foo*" - doc is
> > > found.
> >
> >
No one an idea?
Thx.
> > The text may contain "FooBar".
> >
> > When I do a wildcard search like this: "Foo*" - no hits.
> > When I do a wildcard search like this: "foo*" - doc is
> > found.
>
> Please see http://wiki.apache.org/solr/MultitermQueryAnalysis
Well, it works in 3.6. With one
> > The text may contain "FooBar".
> >
> > When I do a wildcard search like this: "Foo*" - no hits.
> > When I do a wildcard search like this: "foo*" - doc is
> > found.
>
> Please see http://wiki.apache.org/solr/MultitermQueryAnalysis
Well, it works in 3.6. With one exception: If I use german
> so your advice is to upgrade to 3.6?
Or, as a workaround, you can lowercase wildcard queries on the client side.
Hi Ahmet,
> Please see http://wiki.apache.org/solr/MultitermQueryAnalysis
so your advice is to upgrade to 3.6?
Thank you
> The text may contain "FooBar".
>
> When I do a wildcard search like this: "Foo*" - no hits.
> When I do a wildcard search like this: "foo*" - doc is
> found.
Please see http://wiki.apache.org/solr/MultitermQueryAnalysis
Hi,
I have a tokenized text field with german content:
The text may contain "FooBar".
When I do a wildcard search like this: "Foo*" - no hits.
When I
20 matches
Mail list logo