Re: Text Only Extraction Using Solr and Tika

2011-05-05 Thread Emyr James
n help you for the meta data fields. References- http://wiki.apache.org/solr/SchemaXml#Dynamic_fields http://wiki.apache.org/solr/ExtractingRequestHandler#Input_Parameters http://wiki.apache.org/solr/TikaExtractOnlyExampleOutput Regards, Anuj On Thu, May 5, 2011 at 7:28 PM, Emyr James wrote:

Re: Text Only Extraction Using Solr and Tika

2011-05-05 Thread Emyr James
Luker wrote: Hi Emyr, You could try using the "extractOnly=true" parameter [1]. Of course, you'll need to repost the extracted text manually. --jay [1] http://wiki.apache.org/solr/ExtractingRequestHandler#Extract_Only On Thu, May 5, 2011 at 9:36 AM, Emyr James wrote: Hi All,

Text Only Extraction Using Solr and Tika

2011-05-05 Thread Emyr James
Hi All, I have solr and tika installed and am happily extracting and indexing various files. Unfortunately on some word documents it blows up since it tries to auto-generate a 'title' field but my title field in the schema is single valued. Here is my config for the extract handler... class