Lewis,

A large maxFieldLength may not necessarily result in OOM - it depends on -Xmx 
you are using, the number of concurrent documents being processed, and such.
So the first thing I'd look would be my machine's RAM, then -Xmx I can afford, 
then based on that set maxFieldLengthmay.

Otis
----
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
Lucene ecosystem search :: http://search-lucene.com/



----- Original Message ----
> From: "McGibbney, Lewis John" <lewis.mcgibb...@gcu.ac.uk>
> To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org>
> Sent: Wed, February 2, 2011 10:20:58 AM
> Subject: value for maxFieldLength
> 
> Hello list,
> 
> I am aware that setting the value of maxFieldLength in  solrconfig.xml too 
> high 
>may/will result in out-of-mem errors. I wish to provide  content extraction on 
>a 
>number of pdf documents which are large, by large I mean  8-11MB (occasionally 
>more), and I am also not sure how many terms reside in each  field when it is 
>indexed. My question is therefore what is a sensible number to  set this value 
>to in order to include the majority/all terms within documents of  this size.
> 
> Thank you
> 
> Lewis
> 
> 
> Glasgow Caledonian  University is a registered Scottish charity, number 
>SC021474
> 
> Winner:  Times Higher Education's Widening Participation Initiative of the 
> Year 
>2009 and  Herald Society's Education Initiative of the Year 2009.
>http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html
>l
> 
> Winner:  Times Higher Education's Outstanding Support for Early Career 
>Researchers of  the Year 2010, GCU as a lead with Universities Scotland 
>partners.
>http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html
>l
> 

Reply via email to