On 06/27/2011 11:23 AM, lee carroll wrote:
Hi Tod,
A list of keywords would be fine in a non multi valued field:

keywords : "xxx yyy sss aaa qqqq"

multi value field would allow you to repeat the field when indexing

keywords: "xxx"
keywords: "yyy"
keywords: "sss"
etc


Thanks Lee. the problem is I'm manually pushing a document (via stream.url) and its metadata from a database with the Solr /update/extract REST service, HTTP GET, using Perl.

I'm streaming over the document content (presumably via tika) and its gathering the document's metadata which includes the keywords metadata field. Since I'm also passing that field from the DB to the REST call as a list (as you suggested) there is a collision because the keywords field is single valued.

I can change this behavior using a copy field. What I wanted to know is if there was a specific reason the default schema defined a field like keywords single valued so I could make sure I wasn't missing something before I changed things.

While I'm at it, I'd REALLY like to know how to use DIH to index the metadata from the database while simultaneously streaming over the document content and indexing it. I've never quite figured it out yet but I have to believe it is a possibility.


- Tod

Reply via email to