Re: How to index long words with StandardTokenizerFactory?

2010-10-24 Thread Sergey Bartunov
It's a kind of research. There is no particular practical use case as far as I know. Do you know how to set all these max token lengths? On 24 October 2010 19:19, Yonik Seeley wrote: > On Sun, Oct 24, 2010 at 10:47 AM, Sergey Bartunov wrote: >> I did it just as you recommended

Re: How to index long words with StandardTokenizerFactory?

2010-10-24 Thread Sergey Bartunov
tMaxTokenLength(Integer.MAX_VALUE); >     return tokenizer; >   } > } > > positionIncrementGap="100"> >       >         >       >     > > --- On Sat, 10/23/10, Sergey Bartunov wrote: > >> From: Sergey Bartunov >> Subject: Re: How to index long words wit

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
t; >> 5) after that I recompile >> solr/example/webapps/solr.war On 23 October 2010 18:53, Ahmet Arslan wrote: > I think you should replace your new lucene-core-2.9.3-dev.jar in > \apache-solr-1.4.1\lib and then create a new solr.war under > \apache-solr-1.4.1\dist. And

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
3 October 2010 18:55, Yonik Seeley wrote: > On Fri, Oct 22, 2010 at 12:07 PM, Sergey Bartunov wrote: >> I'm trying to force solr to index words which length is more than 255 > > If the field is not a text field, the Solr's default analyzer is used, > which currently l

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
Yes. I did. Won't help. On 23 October 2010 17:45, Ahmet Arslan wrote: > Did you delete the folder Jetty_0_0_0_0_8983_solr.war_** under > apache-solr-1.4.1\example\work? > > --- On Sat, 10/23/10, Sergey Bartunov wrote: > >> From: Sergey Bartunov >> Subject:

Re: How to index long words with StandardTokenizerFactory?

2010-10-23 Thread Sergey Bartunov
look like?  Does StandardTokenizer think > it's just one token? > > What doesn't work?  What happens? > > Steve > >> -Original Message- >> From: Sergey Bartunov [mailto:sbos@gmail.com] >> Sent: Friday, October 22, 2010 3:18 PM >> To: sol

Re: How to index long words with StandardTokenizerFactory?

2010-10-22 Thread Sergey Bartunov
why you didn't get the results you wanted with your Lucene hack > - is it possible you have other Lucene jars in your Solr classpath? > > Steve > >> -Original Message- >> From: Sergey Bartunov [mailto:sbos@gmail.com] >> Sent: Friday, October 22, 2010 12:

How to index long words with StandardTokenizerFactory?

2010-10-22 Thread Sergey Bartunov
I'm trying to force solr to index words which length is more than 255 symbols (this constant is DEFAULT_MAX_TOKEN_LENGTH in lucene StandardAnalyzer.java) using StandardTokenizerFactory as 'filter' tag in schema configuration XML. Specifying the maxTokenLength attribute won't work. I'd tried to mak

Re: Search within a subset of documents

2010-10-11 Thread Sergey Bartunov
October 2010 21:25, Gora Mohanty wrote: > On Mon, Oct 11, 2010 at 8:20 PM, Sergey Bartunov wrote: >> Whether it will be enough effective if the subset is really large? > [...] > > If the subset of IDs is large, and disjoint (so that you cannot use ranges), > the query

Re: Search within a subset of documents

2010-10-11 Thread Sergey Bartunov
Whether it will be enough effective if the subset is really large? On 11 October 2010 18:39, Gora Mohanty wrote: > On Mon, Oct 11, 2010 at 7:00 PM, Sergey Bartunov wrote: >> Is it possible to use Solr for searching within a subset of documents >> represented by enumeration

Search within a subset of documents

2010-10-11 Thread Sergey Bartunov
Is it possible to use Solr for searching within a subset of documents represented by enumeration of document IDs?