Re: Controlling Tika's metadata

2011-06-17 Thread Jan Høydahl
This is the same issue I brought up in this thread: http://search-lucene.com/m/s8sOH1YG1TP As a workaround I wrote an UpdateProcessor to copy/move fields around (SOLR-2599). I think we need a separate fmap for TIKA generated fields (say tmap), so the problem could be fixed by: tmap.title=tika

Re: Controlling Tika's metadata

2011-06-17 Thread alexander sulz
I have the same problem with discarding the metadata title. I thought the parameter "captureAttr" (can be provided at the solrconfig.xml and via get/post as a parameter) is responsible for that? I set it to false in in the xml and as a parameter, still, I get "not multivalued field" errors due

Re: Controlling Tika's metadata

2011-02-02 Thread Grant Ingersoll
On Jan 28, 2011, at 5:38 PM, Andreas Kemkes wrote: > Just getting my feet wet with the text extraction using both schema and > solrconfig settings from the example directory in the 1.4 distribution, so I > might miss something obvious. > > Trying to provide my own title (and discarding the one

Controlling Tika's metadata

2011-01-28 Thread Andreas Kemkes
Just getting my feet wet with the text extraction using both schema and solrconfig settings from the example directory in the 1.4 distribution, so I might miss something obvious. Trying to provide my own title (and discarding the one received through Tika's metadata) wasn't straightforward. I h