Frederik, We have also used separate service, which uses tika & then use solrj to index the content. The main reason, why we went for this approach is to have flexibility to manipulate/transform data over and above what tika does.
What I understand is that, if there is no other transformation needed "ExtractingRequestHandler" should be fine in production too. Regards, Sreenivas On 8 February 2018 at 17:17, Frederik Van Hoyweghen < frederik.vanhoyweg...@chapoo.com> wrote: > Hey everyone, > > What are your experiences on making (in production) use of Solr's > ExtractingRequestHandler? > > I've been reading some mixed remarks so I was wondering what your actual > experiences with it are. > > Personally, I feel like setting up a separate service which is solely > responsible for parsing file contents (to be indexed by Solr later on in > the process) using Tika is a safer approach, so we can use whatever Tika > version we want along with other things we might want to add. > > Looking forward to your response! > > Kind regards, > Frederik >