Hi everyone,
Now i have had a problem with tika and solr. I successed in index data from
various file formats (pdf, doc...) with a file absolute path. but now I have
a link from internet (ex: http://myweb/filename.pdf). I want to index from
this link, But it's not ok. I don't why? This is my file dataconfig.xml:
*<dataConfig>
<dataSource type="BinFileDataSource" name="bin"/>
<document>
<entity name="tika-test" processor="TikaEntityProcessor" url="
http://myweb/filename.pdf" format="text" dataSource="bin" >
<field column="Author" name="author" meta="true"/>
<field column="title" name="title" meta="true"/>
<field column="text" name="text"/>
</entity>
</document>
</dataConfig>*
when i change url=" http://myweb/filename.pdf" by a file absolute path, it
work very good.
Any one know this?
Thanks for your help.
--
View this message in context:
http://lucene.472066.n3.nabble.com/indexing-data-from-rich-documents-Tika-with-solr3-1-tp3322555p3322555.html
Sent from the Solr - User mailing list archive at Nabble.com.