Hello all, 
my Bitnami/*Solr-5.0.0* instalation is not able to index any type of
file(found in the provided examples folders or anywhere else) except HTML. 

Tested on the files in "exampledocs" folder
(books.csv,books.json,...,utf8-example.xml, vidcard.xml) I get:
for *.csv* files I get the reponse "Unexpected character 'i' " (depending on
what is the 1st character in file),
for *.xml* files I get the response "ERROR: unknown field 'id' "
for *.pdf* files I get the response "Invalid UTF-8 middle byte 0xe5"
and so forth.
Even *.TXT* files are not handled:
I get the reponse "Unexpected character 'T' " (depending on what is the 1st
character in file--This is a test of TXT extraction in Solr, it is only a
test. Do not panic.")


The only type that works is *HTML* :

C:\Bitnami\solr-5.0.0-0\apache-solr\solr\exampledocs>java -Dc=tika -jar
post.jar  *.html

SimplePostTool version 5.0.0
Posting files to [base] url http://localhost:8983/solr/tika/update using
content-type application/xml...
POSTing file sample.html to [base]
1 files indexed.
COMMITting Solr index changes to http://localhost:8983/solr/tika/update...
Time spent: 0:00:00.313

I use Windows 8.1, java version "1.8.0_40".

Any ideas of how to fix this? Many thanks.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/possible-SimplePostTool-problem-Windows-Bitnami-distribution-tp4199980.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to