Hi Ganesh,

you can increase the heap size but parsing a 4 GB PDF document will very likely consume A LOT OF memory - I think you need to check if that large PDF can be parsed at all :-)

Cheers,

Siegfried Goeschl

On 14.01.15 18:04, Michael Della Bitta wrote:
Yep, you'll have to increase the heap size for your Tomcat container.

http://stackoverflow.com/questions/6897476/tomcat-7-how-to-set-initial-heap-size-correctly

Michael Della Bitta

Senior Software Engineer

o: +1 646 532 3062

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions <https://twitter.com/Appinions> | g+:
plus.google.com/appinions
<https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts>
w: appinions.com <http://www.appinions.com/>

On Wed, Jan 14, 2015 at 12:00 PM, <ganesh.ya...@sungard.com> wrote:

Hello,

Can someone pass on the hints to get around following error? Is there any
Heap Size parameter I can set in Tomcat or in Solr webApp that gets
deployed in Solr?

I am running Solr webapp inside Tomcat on my local machine which has RAM
of 12 GB. I have PDF document which is 4 GB max in size that needs to be
loaded into Solr




Exception in thread "http-apr-8983-exec-6" java.lang.    : Java heap space
         at java.util.AbstractCollection.toArray(Unknown Source)
         at java.util.ArrayList.<init>(Unknown Source)
         at
org.apache.pdfbox.cos.COSDocument.getObjects(COSDocument.java:518)
         at org.apache.pdfbox.cos.COSDocument.close(COSDocument.java:575)
         at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:254)
         at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1238)
         at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1203)
         at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:111)
         at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
         at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
         at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
         at
org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:219)
         at
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74)
         at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
         at
org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handleRequest(RequestHandlers.java:246)
         at org.apache.solr.core.SolrCore.execute(SolrCore.java:1967)
         at
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:777)
         at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:418)
         at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:207)
         at
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:241)
         at
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
         at
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:220)
         at
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:122)
         at
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:170)
         at
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:103)
         at
org.apache.catalina.valves.AccessLogValve.invoke(AccessLogValve.java:950)
         at
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:116)
         at
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:421)
         at
org.apache.coyote.http11.AbstractHttp11Processor.process(AbstractHttp11Processor.java:1070)
         at
org.apache.coyote.AbstractProtocol$AbstractConnectionHandler.process(AbstractProtocol.java:611)
         at
org.apache.tomcat.util.net.AprEndpoint$SocketProcessor.doRun(AprEndpoint.java:2462)
         at
org.apache.tomcat.util.net.AprEndpoint$SocketProcessor.run(AprEndpoint.java:2451)


Thanks
Ganesh




Reply via email to