Look again at ExtractingRequestHandler.
I havn't looked at what post.jar does internally, but it probably
doesn't work with ExtractingRequestHandler unless you can send other
params as well. I would use curl as the examples in the doc for
ExtractingRequestHandler does. Or figure out if post.jar will work for
you and use it correctly. What Handler is 'update..' mapped to? If its
not mapped to ExtractingRequestHandler than you have no hope of this
working in any case. Looks to me like its trying to process the file as
SolrXml - which means you are not submitting it to ExtractingRequestHandler.
--
- Mark
http://www.lucidimagination.com
Kevin Miller wrote:
I am trying to post a Word document using the Solr post.jar file. When
I attempt this, using a command line interface, I get a fatal error.
I have looked at the following resources:
Solr.com: Tutorial, Docs, FAQ, & ExtractingRequestHandler.
As near as I can tell, I have all the files in the proper place.
Following is a portion of the error displayed in the cmd window:
C:\Solr\Apache~1\example\exampledocs>java -jar post.jar *.doc
SimplePostTool: version 1.2
SimplePostTool: WARNING: Make sure your XML documents are encoded in
UTF-8, other encodings are not currently supported
SimplePostTool: POSTing files to http://localhost:8983/solr/update..
SimplePostTool: POSTing file BadNews.doc
SimplePostTool: FATAL: Solr returned an error:
Unexpected_character__code_65533__0xfffd_in_prolog_expected___at_rowcol_
unknownsoruce_11_javaioIOException_Unexpected_charater__code65533__0xfff
d_in_prolog_expected___at_rowcol_unknownsource_11___at_orgapachesolrhand
lerXMLLoaderloadXMLLoaderjava73___at_orgapahcesolrhandlerContentStreamHa
ndlerBasehandlerRequrestBodyContentStreamHandlerBasejava54___...
There is more and if needed I will be happy to post all of it.
Here is the information that posted into the log file:
127.0.0.1 - - [30/07/2009:15:20:09 +0000] "POST /solr/update HTTP/1.1"
500 4011
Kevin Miller
Web Services