Re: Solr with many indexes

2011-08-02 Thread Vikram Kumar
We have a multi-tenant Solr deployment with a core for each user. Due to the limitations we are facing with number of cores, lazy-loading (and associated warm-up times), we are researching about consolidating several users into one core with queries limited by user-id field. My question is about

Re: What is the best scalable scheme to support multiple users?

2009-02-26 Thread Vikram Kumar
Hi Wunder, Can you please elaborate? Vikram On Thu, Feb 26, 2009 at 10:13 AM, Walter Underwood wrote: > 1a. Multiple Solr instances partitioned by user_id%N, with index > files segmented by user_id field. > > That can scale rather gracefully, though it does need reindexing > to add a server. > >

Re: Use of scanned documents for text extraction and indexing

2009-02-26 Thread Vikram Kumar
Tesseract is pure OCR. Ocropus builds on Tesseract. Vikram On Thu, Feb 26, 2009 at 12:11 PM, Shashi Kant wrote: > Another project worth investigating is Tesseract. > > http://code.google.com/p/tesseract-ocr/ > > > > > - Original Message > From: Hannes Carl Meyer > To: solr-user@lucene.

Re: Use of scanned documents for text extraction and indexing

2009-02-27 Thread Vikram Kumar
shi Kant wrote: > Can anyone back that up? > > IMHO Tesseract is the state-of-the-art in OCR, but not sure that "Ocropus > builds on Tesseract". > Can you confirm that Vikram has a point? > > Shashi > > > > > - Original Message > From