Re: Large fields storage

2014-12-04 Thread Shawn Heisey
On 12/1/2014 3:10 PM, Avishai Ish-Shalom wrote: > I have very large documents (as big as 1GB) which i'm indexing and planning > to store in Solr in order to use highlighting snippets. I am concerned > about possible performance issues with such large fields - does storing the > fields require addit

Re: Large fields storage

2014-12-04 Thread Michael Sokolov
met Arslan Sent: Monday, December 1, 2014 6:02 PM To: solr-user@lucene.apache.org Subject: Re: Large fields storage Hi Avi, I assume your documents are rich documents like pdf word, am I correct? When you extract textual content from them, their size will shrink. Ahmet On Tuesday, December 2

Re: Large fields storage

2014-12-04 Thread Avishai Ish-Shalom
r-user@lucene.apache.org > Subject: Re: Large fields storage > > > Hi Avi, > > I assume your documents are rich documents like pdf word, am I correct? > When you extract textual content from them, their size will shrink. > > Ahmet > > > > On Tuesday, December 2, 20

Re: Large fields storage

2014-12-01 Thread Jack Krupansky
In particular, if they are image-intensive, all the images go away. And the formatting as well. -- Jack Krupansky -Original Message- From: Ahmet Arslan Sent: Monday, December 1, 2014 6:02 PM To: solr-user@lucene.apache.org Subject: Re: Large fields storage Hi Avi, I assume your

Re: Large fields storage

2014-12-01 Thread Erick Erickson
I really have to question the utility of this. The doc will match a _lot_ of queries, but I guess they'll be scored quite low due to length normalization. And even if the user does decide to click on the document, are they going to then download a bigger than 1G document? All in all, your concern

Re: Large fields storage

2014-12-01 Thread Ahmet Arslan
Hi Avi, I assume your documents are rich documents like pdf word, am I correct? When you extract textual content from them, their size will shrink. Ahmet On Tuesday, December 2, 2014 12:11 AM, Avishai Ish-Shalom wrote: Hi all, I have very large documents (as big as 1GB) which i'm indexing a

Large fields storage

2014-12-01 Thread Avishai Ish-Shalom
Hi all, I have very large documents (as big as 1GB) which i'm indexing and planning to store in Solr in order to use highlighting snippets. I am concerned about possible performance issues with such large fields - does storing the fields require additional RAM over what is required to index/fetch/