On May 28, 2009, at 11:29 AM, Eric Pugh wrote:
Hi all,
I want to use the Tika attribute stream_name as my unique key, which
I can do if I specify <uniqueKey>stream_name</uniqueKey/> and run
curl:
curl http://localhost:8983/solr/karaoke/update/extract?
ext.def.fl=text\&ext.capture=stream_name
\&ext.map.stream_name=stream_name -F "fi...@angeleyes.kar"
Why do you need to have the ext.capture and why do you need to map
stream_name to stream_name? If the name in tika metadata is a field
name, you don't need to map.
Also, I assume I'm missing something here because why can't you just
pass in id=<name of the stream> since presumably, in your examples
anyway, you have this info, right? If not, I don't know where else
you are getting it from, b/c it is a Solr thing, not a Tika thing. In
fact, that reminds me, I should document those values that the ERH
adds to the Metadata.
However, this means that I can't use the ext.metadata.prefix to
capture the other metadata fields via:
curl http://localhost:8983/solr/karaoke/update/extract?
ext.def.fl=text\&ext.metadata.prefix=metadata_
\&ext.capture=stream_name\&ext.map.stream_name=stream_name -F "fi...@angeleyes.kar
"
If I do, it seems like stream_name is lost becasue it is now
metadata_stream_name, but I can't use that name in my ext.capture
and ext.map:
curl http://localhost:8983/solr/karaoke/update/extract?
ext.def.fl=text\&ext.metadata.prefix=metadata_
\&ext.capture=metadata_stream_name
\&ext.map.metadata_stream_name=stream_name -F "fi...@angeleyes.kar"
Any ideas? Currently seems like an either/or, but I'd like both!
Eric
-----------------------------------------------------
Eric Pugh | Principal | OpenSource Connections, LLC | 434.466.1467 |
http://www.opensourceconnections.com
Free/Busy: http://tinyurl.com/eric-cal
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)
using Solr/Lucene:
http://www.lucidimagination.com/search