post.jar and curl do the same thing. Look at post.sh, which uses curl. On Mon, May 7, 2012 at 12:57 PM, Tolga <to...@ozses.net> wrote: > On 05/07/2012 10:35 PM, Jack Krupansky wrote: >> >> Try SolrCell (ExtractingRequestHandler). >> >> See: >> http://wiki.apache.org/solr/ExtractingRequestHandler >> >> -- Jack Krupansky >> >> -----Original Message----- From: Tolga Sent: Monday, May 07, 2012 3:24 PM >> To: solr-user@lucene.apache.org Subject: PDF indexing >> Hi, >> >> From what I have read, I think I have to use Tika (?) to index PDF, xls, >> doc, etc files. How do I start? Do I use mvn clean install in the source >> directory to get all the jar files to begin? Centos doesn't provide mvn, how >> do I build Tika after getting it from http://maven.apache.org ? >> >> Sorry for the noob questions, I'm just beginning. > > Jack, > > Thank you very much, I've managed to index a pdf file after a few tries. > With this curl syntax, would it be possible to index an xml file as well or > do we need to use java -jar post.jar file.xml? Or let me put it this way, > how is post.jar different than curl? > > Regards,
-- Lance Norskog goks...@gmail.com