how to implemente a query like " like '%pattern%' "

2011-09-26 Thread libnova
Hi all. how can we do a query similar to 'like' ? if I have this phrase like a single token in the index: "This phrase has various words" (using KeywordTokenizerFactory) and i like a exact match of: "phrase has various" or "various words" form instance... How can i do this?? Thanks a lot.

RE: how to implemente a query like " like '%pattern%' "

2011-09-27 Thread libnova
7;t switch to a > WhitespaceTokenizer for example? then you could use a simple phrase query > for your search case. if you need everything as a Token, you could use a > copyfield and duplicate the field and have them both. > > Are those acceptable options for you? > > Tomá

response time for pdf indexing

2011-06-22 Thread libnova
Hi ! We are using Zend Search based on Lucene. Our indexing pdf consultations take longer than 2 seconds. We want to change to solr to try to solve this problem. i. Can anyone tell me the response time for querys on pdf documents on solr? ii. Can anyone tell me some strategies to reduce

ideas for indexing large amount of pdf docs

2011-08-13 Thread Rode Gonzalez (libnova)
Hi all, I want to ask about the best way to implement a solution for indexing a large amount of pdf documents between 10-60 MB each one. 100 to 1000 users connected simultaneously. I actually have 1 core of solr 3.3.0 and it works fine for a few number of pdf docs but I'm afraid about the mome

Re: ideas for indexing large amount of pdf docs

2011-08-13 Thread Rode Gonzalez (libnova)
on-the-fly indexing/searching or what? I'm mostly curious what your projected max ingestion rate is... Best Erick On Sat, Aug 13, 2011 at 4:49 AM, Rode Gonzalez (libnova) wrote: > Hi all, > > I want to ask about the best way to implement a solution for indexing a > large amo

Re: ideas for indexing large amount of pdf docs

2011-08-13 Thread Rode Gonzalez (libnova)
es as you need to handle your maximum load. So scaling wide is a very quick process. Don't go to sharding unless and until your machine can't give adequate response times at all... Mark's paper outlines this very well. Best Erick On Sat, Aug 13, 2011 at 2:13 PM, Rode G