Lewis, A large maxFieldLength may not necessarily result in OOM - it depends on -Xmx you are using, the number of concurrent documents being processed, and such. So the first thing I'd look would be my machine's RAM, then -Xmx I can afford, then based on that set maxFieldLengthmay.
Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ ----- Original Message ---- > From: "McGibbney, Lewis John" <lewis.mcgibb...@gcu.ac.uk> > To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org> > Sent: Wed, February 2, 2011 10:20:58 AM > Subject: value for maxFieldLength > > Hello list, > > I am aware that setting the value of maxFieldLength in solrconfig.xml too > high >may/will result in out-of-mem errors. I wish to provide content extraction on >a >number of pdf documents which are large, by large I mean 8-11MB (occasionally >more), and I am also not sure how many terms reside in each field when it is >indexed. My question is therefore what is a sensible number to set this value >to in order to include the majority/all terms within documents of this size. > > Thank you > > Lewis > > > Glasgow Caledonian University is a registered Scottish charity, number >SC021474 > > Winner: Times Higher Education's Widening Participation Initiative of the > Year >2009 and Herald Society's Education Initiative of the Year 2009. >http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html >l > > Winner: Times Higher Education's Outstanding Support for Early Career >Researchers of the Year 2010, GCU as a lead with Universities Scotland >partners. >http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html >l >