Re: Upgrading tika

2019-03-20 Thread Geoffrey Willis
ctor] >> Sent: Wednesday, 20 March 2019 08:17 >> To: solr-user@lucene.apache.org >> Subject: RE: Upgrading tika >> >> Sorry Erick, >> Please disregard my previous message. Somehow I downloaded the version >> without those two files. I am going to download th

RE: Upgrading tika

2019-03-20 Thread Phil Scadden
ey Willis Sent: Thursday, 21 March 2019 06:52 To: solr-user@lucene.apache.org Subject: Re: Upgrading tika Could you expand on that please? I’m currently building a webApp that submits documents to Solr/Tika via the update/extract handler and it’s working fine. What do you mean when you say “You

Re: Upgrading tika

2019-03-20 Thread Geoffrey Willis
e- > From: Tannen, Lev (USAEO) [Contractor] > Sent: Wednesday, 20 March 2019 08:17 > To: solr-user@lucene.apache.org > Subject: RE: Upgrading tika > > Sorry Erick, > Please disregard my previous message. Somehow I downloaded the version > without those two files. I

RE: Upgrading tika

2019-03-20 Thread Tannen, Lev (USAEO) [Contractor]
Thank you Shawn and Erick, I truly did not want to dive into Tika and Cxf worlds, but it looks I have no choice. -Original Message- From: Shawn Heisey Sent: Wednesday, March 20, 2019 11:09 AM To: solr-user@lucene.apache.org Subject: Re: Upgrading tika On 3/20/2019 8:24 AM, Tannen

Re: Upgrading tika

2019-03-20 Thread Shawn Heisey
On 3/20/2019 8:24 AM, Tannen, Lev (USAEO) [Contractor] wrote: I still need your advice. The program I have to fix uses class AutoDetectParser along with Solrj for parsing PDF files before sending the result to the solr server. To do this it linked two tika jar files taken from the solr distribu

Re: Upgrading tika

2019-03-20 Thread Erick Erickson
eed. > > Thank you, > Lev Tannen > > -Original Message- > From: Erick Erickson > Sent: Tuesday, March 19, 2019 2:48 PM > To: solr-user > Subject: Re: Upgrading tika > > Yes, Solr is distributed with Tika. Look in: > ./solr/contrib/extraction/lib > &

RE: Upgrading tika

2019-03-20 Thread Tannen, Lev (USAEO) [Contractor]
e projects especially because I was not able to find a binary distribution. So could you please advise what is the best way to proceed. Thank you, Lev Tannen -Original Message- From: Erick Erickson Sent: Tuesday, March 19, 2019 2:48 PM To: solr-user Subject: Re: Upgrading tika Ye

RE: Upgrading tika

2019-03-19 Thread Phil Scadden
@lucene.apache.org Subject: RE: Upgrading tika Sorry Erick, Please disregard my previous message. Somehow I downloaded the version without those two files. I am going to download the latest version solr 8.0.0 and try it. Best Lev Tannen -Original Message- From: Erick Erickson Sent: Tuesday

RE: Upgrading tika

2019-03-19 Thread Tannen, Lev (USAEO) [Contractor]
Subject: Re: Upgrading tika Yes, Solr is distributed with Tika. Look in: ./solr/contrib/extraction/lib Tika is upgraded when new versions come out, so the underlying files are whatever are current at the time. The integration is a fairly loose coupling, if you're using some external pr

RE: Upgrading tika

2019-03-19 Thread Tannen, Lev (USAEO) [Contractor]
-Original Message- From: Erick Erickson Sent: Tuesday, March 19, 2019 2:48 PM To: solr-user Subject: Re: Upgrading tika Yes, Solr is distributed with Tika. Look in: ./solr/contrib/extraction/lib Tika is upgraded when new versions come out, so the underlying files are whatever are current at the

Re: Upgrading tika

2019-03-19 Thread Erick Erickson
gt; Sent: Tuesday, March 19, 2019 11:11 AM > To: solr-user@lucene.apache.org > Subject: Re: Upgrading tika > > On 3/19/2019 9:03 AM, levtannen wrote: > > Could anybody suggest me what files do I need to use the latest > > version of Tika and where to find them? > >

RE: Upgrading tika

2019-03-19 Thread Tannen, Lev (USAEO) [Contractor]
solr does not have integrated tika anymore, or I just cannot recognize them? -Original Message- From: Shawn Heisey Sent: Tuesday, March 19, 2019 11:11 AM To: solr-user@lucene.apache.org Subject: Re: Upgrading tika On 3/19/2019 9:03 AM, levtannen wrote: > Could anybody suggest me w

Re: Upgrading tika

2019-03-19 Thread Shawn Heisey
On 3/19/2019 9:03 AM, levtannen wrote: Could anybody suggest me what files do I need to use the latest version of Tika and where to find them? This mailing list is solr-user. Tika is an entirely separate project from Solr within the Apache Foundation. To get help with Tika, you'll need to a

RE: Upgrading tika

2019-03-19 Thread Tannen, Lev (USAEO) [Contractor]
Thank you Jeremy, I am not using Maven, but I took both jars from the same distribution of cxf so they supposed to be compatible. -Original Message- From: Branham, Jeremy (Experis) Sent: Tuesday, March 19, 2019 11:10 AM To: solr-user@lucene.apache.org Subject: Re: Upgrading tika I’m

Re: Upgrading tika

2019-03-19 Thread Branham, Jeremy (Experis)
I’m not positive – But I think you should match the CXF jar versions. "cxf-core-3.2.8.jar" org.apache.cxf cxf-rt-frontend-jaxrs 3.2.8 Jeremy Branham jb...@allstate.com On 3/19/19, 10:03 AM, "levtannen" wrote: "cxf-core-3.2.8.jar" and "cxf-rt-fromtend-jaxrs-2.6.3.jar"

Upgrading tika

2019-03-19 Thread levtannen
Hello community, I am using Tika to extract the text content from pdf files before indexing them. I used version 1.7 and it worked OK except it produced a lot warnings like "Font not found". Now I am trying to move to the newer version 1.19.1 and I have problems finding all necessary dependencies.

RE: Upgrading Tika "in place"

2013-02-05 Thread Markus Jelsma
Hi, You also need pdfbox-1.7.1 and possibly also fontbox and jempbox 1.7.1. Cheers, Markus -Original message- > From:Tod > Sent: Tue 05-Feb-2013 13:59 > To: solr-user@lucene.apache.org > Subject: Upgrading Tika "in place" > > I'm

Upgrading Tika "in place"

2013-02-05 Thread Tod
I'm running an older version of Solr - 3.4.0.2011.09.09.09.06.17. It seems the version of Tika that came with it has trouble with some PDF files and newer Office documents. I've checked the latest Tika release and it solves these problems. I'd like to just drop in the necessary Tika jars wit

Re: Upgrading Tika in Solr

2010-02-18 Thread Christian Vogler
Just a word of caution: I've been bitten by this bug, which affects Tika 0.6: https://issues.apache.org/jira/browse/PDFBOX-541 It causes the parser to go into an infinite loop, which isn't exactly great for server stability. Tika 0.4 is not affected in the same way - as far as I remember, the p

Re: Upgrading Tika in Solr

2010-02-17 Thread Liam O'Boyle
I just copied in the newer .jars and got rid of the old ones and everything seemed to work smoothly enough. Liam On Tue, 2010-02-16 at 13:11 -0500, Grant Ingersoll wrote: > I've got a task open to upgrade to 0.6. Will try to get to it this week. > Upgrading is usually pretty trivial. > > > O

Re: Upgrading Tika in Solr

2010-02-16 Thread Grant Ingersoll
I've got a task open to upgrade to 0.6. Will try to get to it this week. Upgrading is usually pretty trivial. On Feb 14, 2010, at 12:37 AM, Liam O'Boyle wrote: > Afternoon, > > I've got a large collections of documents which I'm attempting to add to > a Solr index using Tika via the Extracti

Upgrading Tika in Solr

2010-02-13 Thread Liam O'Boyle
Afternoon, I've got a large collections of documents which I'm attempting to add to a Solr index using Tika via the ExtractingRequestHandler, but there are a large number that it has problems with (PDFs, PPTX and XLS documents mainly). I've tried them with the most recent stand alone version of