Re: Errors on master after upgrading to 4.10.3

2016-02-17 Thread Joseph Hagerty
e in the Solr code, if you're concerned about it, > you'll probably have better luck asking the PDFBox developers directly, via > Jira or their mailing list. > > > On Tue, Feb 16, 2016 at 12:08 PM, Joseph Hagerty wrote: > > > Does literally nobody else see this error

Re: Errors on master after upgrading to 4.10.3

2016-02-16 Thread Joseph Hagerty
Does literally nobody else see this error in their logs? I see this error hundreds of times per day, in occasional bursts. Should I file this as a bug? On Mon, Feb 15, 2016 at 4:56 PM, Joseph Hagerty wrote: > After migrating from 3.5 to 4.10.3, I'm seeing the following error with &g

Errors on master after upgrading to 4.10.3

2016-02-15 Thread Joseph Hagerty
After migrating from 3.5 to 4.10.3, I'm seeing the following error with alarming regularity in the master's error log: 2/15/2016, 4:32:22 PM ERROR PDSimpleFont Can't determine the width of the space character using 250 as default I can't seem to glean much information about this one from the web.

Re: JVM heap constraints and garbage collection

2014-01-31 Thread Joseph Hagerty
hawn Heisey wrote: > On 1/30/2014 3:20 PM, Joseph Hagerty wrote: > >> I'm using Solr 3.5 over Tomcat 6. My index has reached 30G. >> > > > > > - The box is an m1.large on AWS EC2. 2 virtual CPUs, 4 ECU, 7.5 GiB RAM >> > > One detail that you did

JVM heap constraints and garbage collection

2014-01-30 Thread Joseph Hagerty
Greetings esteemed Solr-ites, I'm using Solr 3.5 over Tomcat 6. My index has reached 30G. Since my average load during peak hours is becoming quite high, and since I'm finally starting to notice a little bit of performance degradation and intermittent errors (e.g. "Solr returned response 0" on pe

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
> "metax", but all the doc metadata appears in "meta". > > Although, I did get 400 errors with Solr complaining that "meta" was not a > multivalued field. This is with Solr 3.6. What release of Solr are you > using? > > I was not aware of th

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
eld for a wildcard pattern that copies to > "meta", which would copy all of the Tika-generated fields to "meta." > > -- Jack Krupansky > > -----Original Message- From: Joseph Hagerty > Sent: Wednesday, May 02, 2012 9:56 AM > To: solr-user@lucene.a

ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
Greetings Solr folk, How can I instruct the extract request handler to ignore metadata/headers etc. when it constructs the "content" of the document I send to it? For example, I created an MS Word document containing just the word "SEARCHWORD" and nothing else. However, when I ship this doc to my