Hmmm... what's going on here with email names and addresses???

My email client says "From: chris.a.mattm...@jpl.nasa.gov" for the name, but shows an email address of "csnsha...@gmail.com". Is this message from Chris A. Mattmann or not?!?

And in the actual eamil header I see this:
From: =?utf-8?b?Y2hyaXMuYS5tYXR0bWFubkBqcGwubmFzYS5nb3Y=?= <csnsha...@gmail.com>

Very strange.

-- Jack Krupansky

-----Original Message----- From: chris.a.mattm...@jpl.nasa.gov
Sent: Friday, May 25, 2012 7:08 AM
To: solr-user@lucene.apache.org
Subject: Re: Solr Performance

Jack Krupansky <jack <at> basetechnology.com> writes:


I vaguely recall some thread blocking issue with trying to parse too many
PDF files at one time in the same JVM.

Occasionally Tika (actually PDFBox) has been known to hang for some PDF
docs.

Do you have enough memory in the JVM? When the CPU is busy, is there much
memory available in the JVM? Maybe garbage collection is taking too much of
the CPU.



Hi Jack,

Thanks for your quick response. Yes. I hope I have enough JVM memory. Here is
the mem settings.

-Xms11g -Xmx11g -XX:MaxPermSize=2g

Is this a common issue seen for PDF extraction and indexing? Why i am not able
to do more than 1k documents per hour?

Thanks,
Surendra.

Reply via email to