Re: Need tokenization that finds part of stringvalue

2012-03-06 Thread PeterKerk
edismax did the trick! Thanks! -- View this message in context: http://lucene.472066.n3.nabble.com/Need-tokenization-that-finds-part-of-stringvalue-tp3785366p3805045.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Need tokenization that finds part of stringvalue

2012-03-06 Thread Ahmet Arslan
> @iorixxx: Sorry it took so long, had > some difficulties upgrading to 3.5.0 > > It still doesnt work. Here's what I have now: > > I copied text_general_rev from > http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/example/solr/conf/schema.xml > to my schema.xml: >     class="solr.TextField"

Re: Need tokenization that finds part of stringvalue

2012-03-06 Thread PeterKerk
@iorixxx: Sorry it took so long, had some difficulties upgrading to 3.5.0 It still doesnt work. Here's what I have now: I copied text_general_rev from http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/example/solr/conf/schema.xml to my schema.xml:

Re: Need tokenization that finds part of stringvalue

2012-03-04 Thread Ahmet Arslan
> @iorixxx > I tried making my title_search of type text_rev and tried > adding the > ReversedWildcardFilterFactory to my existing "text" type, > but in both cases > no luck. I was able to perform *query* types of searches with solr 3.5 distro. Here is what I did: Download apache-solr-3.5.0 Edit

Re: Need tokenization that finds part of stringvalue

2012-03-04 Thread PeterKerk
@iorixxx I tried making my title_search of type text_rev and tried adding the ReversedWildcardFilterFactory to my existing "text" type, but in both cases no luck. @Erick Erickson "On frequent method of doing leading and trailing wildcards is to use ngrams (as distinct from edgengrams). That in com

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Ahmet Arslan
> @iorixxx: Where can I find that > example schema.xml? Please find text_general_rev at http://svn.apache.org/repos/asf/lucene/dev/trunk/solr/example/solr/conf/schema.xml > And when I find it, can I just make the title field which > currently is of > "text" type then of "text_rev" type? Yes,

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Erick Erickson
On frequent method of doing leading and trailing wildcards is to use ngrams (as distinct from edgengrams). That in combination with phrase queries might work well in this case. You also might be surprised at how little space bigrams take, give it a test and see .. Best Erick On Thu, Mar 1, 2012

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread PeterKerk
@iorixxx: Where can I find that example schema.xml? I downloaded the latest version here: ftp://apache.mirror.easycolocate.nl//lucene/solr/3.5.0 And checked \example\example-DIH\solr\db\conf\schema.xml But no text_rev type is defined in there. And when I find it, can I just make the title field w

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Ahmet Arslan
--- On Thu, 3/1/12, PeterKerk wrote: > From: PeterKerk > Subject: Re: Need tokenization that finds part of stringvalue > To: solr-user@lucene.apache.org > Date: Thursday, March 1, 2012, 6:59 PM > @iorixxx: yes, that is what I need. > But also when its IN the text, not &g

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread PeterKerk
@iorixxx: yes, that is what I need. But also when its IN the text, not necessarily at the beginning. So using the * character like: q=smart* the product is found, but when I do this: q=*mart* it isnt...why is that? -- View this message in context: http://lucene.472066.n3.nabble.com/Need-toke

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Ahmet Arslan
> if title holds "smartphone" I want it to be found when > someone types > "martph" or "smar" or "smart". Peter, so you want to beginsWith startsWith type of search? You can use use wildcard search (with start operator) for this. e.g. &q=smar* Alternatively, if your index size is not huge, you

RE: Need tokenization that finds part of stringvalue

2012-03-01 Thread Dyer, James
[mailto:wun...@wunderwood.org] Sent: Thursday, March 01, 2012 9:59 AM To: solr-user@lucene.apache.org Subject: Re: Need tokenization that finds part of stringvalue I once used a spell checker to break up compound words. It was slow, but worked pretty well. wunder On Mar 1, 2012, at 5:53 AM,

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Walter Underwood
I once used a spell checker to break up compound words. It was slow, but worked pretty well. wunder On Mar 1, 2012, at 5:53 AM, Erick Erickson wrote: > Right, there's nothing in Solr that I know of that'll help here. How would > a tokenizer understand that "smartphone" should be "smart" "phone"

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread PeterKerk
I think I didnt explain myself clearly: I need to be able to find substrings. So, its not that I'd expect Solr to find synonyms, but rather if a piece of text contains the searched text, for example: if title holds "smartphone" I want it to be found when someone types "martph" or "smar" or "smart"

Re: Need tokenization that finds part of stringvalue

2012-03-01 Thread Erick Erickson
Right, there's nothing in Solr that I know of that'll help here. How would a tokenizer understand that "smartphone" should be "smart" "phone"? There's no general solution for this issue. You can do domain-specific solutions with synonyms for instance, or some other word list that contains terms yo