We have a multi-tenant Solr deployment with a core for each user.
Due to the limitations we are facing with number of cores,
lazy-loading (and associated warm-up times), we are researching about
consolidating several users into one core with queries limited by
user-id field.
My question is about
Hi Wunder,
Can you please elaborate?
Vikram
On Thu, Feb 26, 2009 at 10:13 AM, Walter Underwood
wrote:
> 1a. Multiple Solr instances partitioned by user_id%N, with index
> files segmented by user_id field.
>
> That can scale rather gracefully, though it does need reindexing
> to add a server.
>
>
Tesseract is pure OCR. Ocropus builds on Tesseract.
Vikram
On Thu, Feb 26, 2009 at 12:11 PM, Shashi Kant wrote:
> Another project worth investigating is Tesseract.
>
> http://code.google.com/p/tesseract-ocr/
>
>
>
>
> - Original Message
> From: Hannes Carl Meyer
> To: solr-user@lucene.
shi Kant wrote:
> Can anyone back that up?
>
> IMHO Tesseract is the state-of-the-art in OCR, but not sure that "Ocropus
> builds on Tesseract".
> Can you confirm that Vikram has a point?
>
> Shashi
>
>
>
>
> - Original Message
> From