Re: ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
is undocumented feature. I haven't checked the code > yet. > > > -- Jack Krupansky > > -Original Message- From: Joseph Hagerty > Sent: Wednesday, May 02, 2012 11:10 AM > To: solr-user@lucene.apache.org > Subject: Re: ExtractRH: How to strip metadata > >

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Jack Krupansky
he code yet. -- Jack Krupansky -Original Message- From: Joseph Hagerty Sent: Wednesday, May 02, 2012 11:10 AM To: solr-user@lucene.apache.org Subject: Re: ExtractRH: How to strip metadata I do not. I commented out all of the copyFields provided in the default schema.xml that ships wi

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
eld for a wildcard pattern that copies to > "meta", which would copy all of the Tika-generated fields to "meta." > > -- Jack Krupansky > > -Original Message- From: Joseph Hagerty > Sent: Wednesday, May 02, 2012 9:56 AM > To: solr-user@lucene.a

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Jack Krupansky
e.apache.org Subject: ExtractRH: How to strip metadata Greetings Solr folk, How can I instruct the extract request handler to ignore metadata/headers etc. when it constructs the "content" of the document I send to it? For example, I created an MS Word document containing just the word

ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
Greetings Solr folk, How can I instruct the extract request handler to ignore metadata/headers etc. when it constructs the "content" of the document I send to it? For example, I created an MS Word document containing just the word "SEARCHWORD" and nothing else. However, when I ship this doc to my