Re: SOLR Sizing

2016-10-14 Thread Shawn Heisey
On 10/14/2016 12:18 AM, Vasu Y wrote: > Thank you all for the insight and help. Our SOLR instance has multiple > collections. > Do you know if the spreadsheet at LucidWorks ( > https://lucidworks.com/blog/2011/09/14/estimating-memory-and-storage-for-lucenesolr/) > is meant to be used to calculate s

Re: SOLR Sizing

2016-10-13 Thread Vasu Y
Thank you all for the insight and help. Our SOLR instance has multiple collections. Do you know if the spreadsheet at LucidWorks ( https://lucidworks.com/blog/2011/09/14/estimating-memory-and-storage-for-lucenesolr/) is meant to be used to calculate sizing per collection or is it meant to be used f

Re: SOLR Sizing

2016-10-06 Thread Walter Underwood
The square-root rule comes from a short paper draft (unpublished) that I can’t find right now. But this paper gets the same result: http://nflrc.hawaii.edu/rfl/April2005/chujo/chujo.html Perfect OCR would follow this rule, but even great

Re: SOLR Sizing

2016-10-06 Thread Erick Erickson
OCR _without errors_ wouldn't break it. That comment assumed that the OCR was dirty I thought. Honest, I once was trying to index an OCR'd image of a "family tree" that was a stylized tree where the most remote ancestor was labeled in vertical text on the trunk, and descendants at various angles a

Re: SOLR Sizing

2016-10-06 Thread Rick Leir
I am curious to know where the square-root assumption is from, and why OCR (without errors) would break it. TIA cheers - - Rick On 2016-10-04 10:51 AM, Walter Underwood wrote: No, we don’t have OCR’ed text. But if you do, it breaks the assumption that vocabulary size is the square root of the

Re: SOLR Sizing

2016-10-04 Thread Walter Underwood
>>> Thanks, >>> Susheel >>> >>> On Mon, Oct 3, 2016 at 3:19 PM, Allison, Timothy B. >>> wrote: >>> >>>> This doesn't answer your question, but Erick Erickson's blog on this topic >>>> is invaluable: >>&

Re: SOLR Sizing

2016-10-04 Thread Rick Leir
https://lucidworks.com/blog/2012/07/23/sizing-hardware-in- the-abstract-why-we-dont-have-a-definitive-answer/ -Original Message- From: Vasu Y [mailto:vya...@gmail.com] Sent: Monday, October 3, 2016 2:09 PM To: solr-user@lucene.apache.org Subject: SOLR Sizing Hi, I am trying to estimate dis

Re: SOLR Sizing

2016-10-03 Thread Walter Underwood
doc >>>> and then multiply with # docs to get a rough index size. >>>> >>>> { >>>> "id":"product12345" >>>> "name":"productA", >>>> "category":"xyz", >>>> .

Re: SOLR Sizing

2016-10-03 Thread Walter Underwood
gt;>> } >>> >>> Thanks, >>> Susheel >>> >>> On Mon, Oct 3, 2016 at 3:19 PM, Allison, Timothy B. >>> wrote: >>> >>>> This doesn't answer your question, but Erick Erickson's blog on this topic >>

Re: SOLR Sizing

2016-10-03 Thread Erick Erickson
>> wrote: >> >>> This doesn't answer your question, but Erick Erickson's blog on this topic >>> is invaluable: >>> >>> https://lucidworks.com/blog/2012/07/23/sizing-hardware-in- >>> the-abstract-why-we-dont-have-a-definitive

Re: SOLR Sizing

2016-10-03 Thread Walter Underwood
answer your question, but Erick Erickson's blog on this topic >> is invaluable: >> >> https://lucidworks.com/blog/2012/07/23/sizing-hardware-in- >> the-abstract-why-we-dont-have-a-definitive-answer/ >> >> -Original Message- >> From: Vas

Re: SOLR Sizing

2016-10-03 Thread Susheel Kumar
, but Erick Erickson's blog on this topic > is invaluable: > > https://lucidworks.com/blog/2012/07/23/sizing-hardware-in- > the-abstract-why-we-dont-have-a-definitive-answer/ > > -Original Message- > From: Vasu Y [mailto:vya...@gmail.com] > Sent: Monday, October 3

RE: SOLR Sizing

2016-10-03 Thread Allison, Timothy B.
er 3, 2016 2:09 PM To: solr-user@lucene.apache.org Subject: SOLR Sizing Hi, I am trying to estimate disk space requirements for the documents indexed to SOLR. I went through the LucidWorks blog ( https://lucidworks.com/blog/2011/09/14/estimating-memory-and-storage-for-lucenesolr/) and using t

SOLR Sizing

2016-10-03 Thread Vasu Y
Hi, I am trying to estimate disk space requirements for the documents indexed to SOLR. I went through the LucidWorks blog ( https://lucidworks.com/blog/2011/09/14/estimating-memory-and-storage-for-lucenesolr/) and using this as the template. I have a question regarding estimating "Avg. Document Si

Re: solr sizing

2013-07-29 Thread Shawn Heisey
On 7/29/2013 2:18 PM, Torsten Albrecht wrote: we have - 70 mio documents to 100 mio documents and we want - 800 requests per second How many servers Amazon EC2/real hardware we Need for this? Solr 4.x with solr cloud or better shards with loadbalancer? Is anyone here who can give me some i

solr sizing

2013-07-29 Thread Torsten Albrecht
Hi all, we have - 70 mio documents to 100 mio documents and we want - 800 requests per second How many servers Amazon EC2/real hardware we Need for this? Solr 4.x with solr cloud or better shards with loadbalancer? Is anyone here who can give me some information, or who operates a similar