Re: Getting indexed content of files using ExtractingRequestHandler

2013-07-14 Thread Erick Erickson
I'm completely ignorant of all things PHP, including the state of any Solr client code, so I'm afraid I can't help with that... Best Erick On Sun, Jul 14, 2013 at 11:03 AM, xan wrote: > Thanks for the link. Also, having gone quite far with my work using the PHP > Solr client, isn't there anythin

Re: Getting indexed content of files using ExtractingRequestHandler

2013-07-14 Thread xan
Thanks for the link. Also, having gone quite far with my work using the PHP Solr client, isn't there anything that could be done using the PHP Solr client only? -- View this message in context: http://lucene.472066.n3.nabble.com/Getting-indexed-content-of-files-using-ExtractingRequestHandler-tp

Re: Getting indexed content of files using ExtractingRequestHandler

2013-07-14 Thread Erick Erickson
Right, sorry... http://searchhub.org/dev/2012/02/14/indexing-with-solrj/ On Sun, Jul 14, 2013 at 8:31 AM, xan wrote: > Sorry, but did you forget to send me the example's link? > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Getting-indexed-content-of-files-using-

Re: Getting indexed content of files using ExtractingRequestHandler

2013-07-14 Thread xan
Sorry, but did you forget to send me the example's link? -- View this message in context: http://lucene.472066.n3.nabble.com/Getting-indexed-content-of-files-using-ExtractingRequestHandler-tp4077856p4077877.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Getting indexed content of files using ExtractingRequestHandler

2013-07-14 Thread Erick Erickson
Well, cURL is generally not what people use for production. What I'd consider is using SolrJ (which you can access Tika from) and then store the raw pdf (or whatever) document as a binary data type in Solr. Here's an example (with DB indexing mixed in, but you should be able to pull that part out)