If you are using the ExtractingRequestHandler, you can also try using
the stream.file or stream.url.

e.g. curl 
"http://localhost:8080/solr/core0/update/extract?stream.file=C:/777045.zip&literal.id=777045&literal.title=Test&commit=true";

More detailed explaination @
http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Content-Extraction-Tika

The literal prefix attributes with normal fields and the content
extracted from the document is stored in the text field by default

Regards,
Jayendra

On Thu, Mar 3, 2011 at 7:16 AM, Gary Taylor <g...@inovem.com> wrote:
> As an example, I run this in the same directory as the msword1.doc file:
>
> curl
> "http://localhost:8983/solr/core0/update/extract?literal.docid=74&literal.type=5";
> -F "file=@msword1.doc"
>
> The "type" literal is just part of my schema.
>
> Gary.
>
>
> On 03/03/2011 11:45, Ken Foskey wrote:
>>
>> On Thu, 2011-03-03 at 12:36 +0100, Markus Jelsma wrote:
>>>
>>> Here's a complete example
>>>
>>> http://wiki.apache.org/solr/UpdateXmlMessages#Passing_commit_parameters_as_part_of_the_URL
>>
>> I should have been clearer.   A rich text document,  XML I can make work
>> and a script is in the example docs folder
>>
>> http://wiki.apache.org/solr/ExtractingRequestHandler
>>
>> I also read the solr 1.4 book and tried samples in there,   could not
>> make them work.
>>
>> Ta
>>
>>
>
>

Reply via email to