Could you expand on that please? I’m currently building a webApp that submits 
documents to Solr/Tika via the update/extract handler and it’s working fine. 
What do you mean when you say “You do not want to have your Solr instance 
processing via Tika”? If that’s a bad design choice please elaborate. 
Thanks,
Geoff


> On Mar 19, 2019, at 5:15 PM, Phil Scadden <p.scad...@gns.cri.nz> wrote:
> 
> As per Erick advice, I would strongly recommend that you do anything tika in 
> a  separate solrj programme. You do not want to have your solr instance 
> processing via tika.
> 
> -----Original Message-----
> From: Tannen, Lev (USAEO) [Contractor] <lev.tan...@usdoj.gov.INVALID>
> Sent: Wednesday, 20 March 2019 08:17
> To: solr-user@lucene.apache.org
> Subject: RE: Upgrading tika
> 
> Sorry Erick,
> Please disregard my previous message. Somehow I downloaded the version 
> without those two files. I am going to download the latest version solr 8.0.0 
> and try it.
> Best
> Lev Tannen
> 
> -----Original Message-----
> From: Erick Erickson <erickerick...@gmail.com>
> Sent: Tuesday, March 19, 2019 2:48 PM
> To: solr-user <solr-user@lucene.apache.org>
> Subject: Re: Upgrading tika
> 
> Yes, Solr is distributed with Tika. Look in:
> ./solr/contrib/extraction/lib
> 
> Tika is upgraded when new versions come out, so the underlying files are 
> whatever are current at the time.
> 
> The integration is a fairly loose coupling, if you're using some external 
> program (say a SolrJ program) to parse the files, there's no requirement to 
> use the jars distributed with Solr, use whatever suits your fancy. An 
> external program just constructs a SolrDocument to send to Solr. What you use 
> to create that document is irrelevant. See:
> https://lucidworks.com/2012/02/14/indexing-with-solrj/ for some background.
> 
> If you're using the ExtractingRequestHandler, where you just send the 
> semi-structured docs to Solr (PDFs, Word or whatever), then needing to know 
> anything about individual Tika-related jar files is kind of strange.
> 
> If your predecessors wrote some custom code that runs as part of Solr, I 
> don't know what to say...
> 
> Best,
> Erick
> 
> On Tue, Mar 19, 2019 at 10:47 AM Tannen, Lev (USAEO) [Contractor] 
> <lev.tan...@usdoj.gov.invalid> wrote:
>> 
>> Thank you Shawn.
>> I assumed that tika has been integrated with solr. I the project written 
>> before me they used two tika files taken from solr distribution. I am trying 
>> to do the same with solr 7.7.1. However this version contains a different 
>> set of tika related files. So I am confused. Does  solr does not have 
>> integrated tika anymore, or I just cannot recognize them?
>> 
>> -----Original Message-----
>> From: Shawn Heisey <apa...@elyograg.org>
>> Sent: Tuesday, March 19, 2019 11:11 AM
>> To: solr-user@lucene.apache.org
>> Subject: Re: Upgrading tika
>> 
>> On 3/19/2019 9:03 AM, levtannen wrote:
>>> Could anybody suggest me what files do I need to use the latest
>>> version of Tika and where to find them?
>> 
>> This mailing list is solr-user.  Tika is an entirely separate project from 
>> Solr within the Apache Foundation.  To get help with Tika, you'll need to 
>> ask that project.
>> 
>> https://tika.apache.org/mail-lists.html
>> 
>> Thanks,
>> Shawn
> Notice: This email and any attachments are confidential and may not be used, 
> published or redistributed without the prior written consent of the Institute 
> of Geological and Nuclear Sciences Limited (GNS Science). If received in 
> error please destroy and immediately notify GNS Science. Do not copy or 
> disclose the contents.

Reply via email to