: > : If I can find the bandwidth, I'd like to make something which allows
: > : file uploads via the XMLUpdateHandler as well... Do you have any ideas
: > 
: > the XmlUpdateRequestHandler already supports file uploads ... all request

: But it doesn't do what Jacob is asking for... he wants (if I'm not mistaken)

Hmm ... i thought this was an offshoot question ... the main point of this 
thread seems to have already been solved by the new 
ext.literal.${fieldname}=${fieldvalue} param support Grant just added 
to ExtractingRequestHandler right?

what am i missunderstanding about the usecase that isn't solved by that?  
the "tika doc" from the ContentStream is the primary "guts" of the doc, 
with additional literal "metadata" fields being added, correct?

(I can imagine a more complicated usecase where someone might want a 
single document built from multiple ContentStreams parsed by Tika, with 
differnet pieces of each TikaDoc contributing in differnet ways ... ie: my 
name is Hoss, my address is X, my phone number is Y, this 
first ContentStream should be indexed as my bio field (doesn't matter if 
it's PDF, HTML, MS-Word, etc.), index and store the ID3 Title & length 
from any MP3 ContentStreams in the multivalued "lecture_title" and 
"lecture_length" fields, and any ContentStreams left over should be 
indexed in the misc "other_text" field.  .... but that's not what we're 
talking about here, correct?)


-Hoss

Reply via email to