Hi, all,
I tried to upgrade tika0.8 to tika0.10 on solr3.3.0, following the similar
steps, but failed.
1. Replace the following jars in /contrib/extraction/
fontbox-1.6.0, jempbox-1.6.0, pdfbox-1.6.0, tika-core-0.10,
tika-parsers-0.10;
2. Copy all the jars in /contrib/langid/* from solr3.5.0
I have upgraded my Solr Distribution to 3.2 and also the referring jars of my
application (especially the solr jar was 1.4.1 in my application which calls
solr...hence causing javabin exception...) . Also updated the
pdfbox/jempbox/fontbox to latest versions and Tika to 0.9 version...which made
thi
Glad it worked out!
Cheers,
Chris
On Jun 22, 2011, at 5:14 AM, Surendra wrote:
> Hi Chris ,Andreas
>
> I have upgraded to solr 3.2 ... everything seems fine now. I will have to
> integrate this to my application and observe if any further issues...again
> thanks for your patience and time...
>
Hi Chris ,Andreas
I have upgraded to solr 3.2 ... everything seems fine now. I will have to
integrate this to my application and observe if any further issues...again
thanks for your patience and time...
--Surendra
irely certain, if a recompile of Solr was necessary or not.
Andreas
From: Surendra
To: solr-user@lucene.apache.org
Sent: Tue, June 21, 2011 5:18:31 AM
Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1
Hi Andreas
I tried solr 3.1 as well as 3.2... i was not able to ove
Hi Surendra,
Thanks. Besides replacing the tika-*-0.9.jar files, you also need to replace
the dependency jar files for the other libs as well since they have been
upgraded. It's also possible that b/c of API changes, Solr 1.4.1 won't work
with Tika 0.9 without modifying the ExtractingRequestHan
Hi Andreas
I tried solr 3.1 as well as 3.2... i was not able to overcome these issues with
the newer versions too. For me, I need the attr_content:* should return me
results (with 1.4.1 this is successful) which is not happening . It indexes well
in 3.1 but in 3.2 i have the following issue.
Invali
Hi Chris
I did a proper checkout of TIKA 0.9 and built the jars as specified in the
"http://tika.apache.org/0.9/gettingstarted.html"; and replaced the existing
tika0.4 jars with 0.9 jars. I don't see any difference. The documents are
getting indexed but the fmap.content(attr_content) is still not
ive). Ultimately, I decided that the then
newly released Solr 3.1 was the less rocky route. Not sure if that is an
option
for you.
Andreas
From: "Mattmann, Chris A (388J)"
To: "solr-user@lucene.apache.org"
Sent: Mon, June 20, 2011 7:18:
Hi Surendra,
On Jun 20, 2011, at 4:59 AM, Surendra wrote:
> Hey Chris
>
> I have added tika-core 0.9 and tika-parsers 0.9 to Solr1.4.1 (extraction/lib)
> after building them using the source provided by TIKA. Now I have an issue
> with
> this. I am working with extracting PDF content using Solr
Mattmann, Chris A (388J jpl.nasa.gov> writes:
>
> Hi Jo,
>
> You may consider checking out Tika trunk, where we recently have a Tika JAX-RS
web service [1] committed as
> part of the tika-server module. You could probably wire DIH into it and
accomplish the same thing.
>
> Cheers,
> Chris
>
a
> 0.8 has been fixed. (TIKA-548)
>
> Andreas
>
>
>
>
> From: Darx Oman
> To: solr-user@lucene.apache.org
> Sent: Fri, February 25, 2011 10:33:39 AM
> Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1
>
> hi
> if you
e.org
Sent: Fri, February 25, 2011 10:33:39 AM
Subject: Re: upgrading to Tika 0.9 on Solr 1.4.1
hi
if you want to index pdf files then use tika 0.6
because 0.7 and 0.8 does not detect the correctly the pdfParse
hi
if you want to index pdf files then use tika 0.6
because 0.7 and 0.8 does not detect the correctly the pdfParse
You guys are great.. I will stick for now to the release version and if I
have problem parsing I will give the branch jars a try the reason I am
looking for upgrading tika is because tika keeps improving on things like
languages, mime type support, and so on
thanks again
JO
--
View thi
Hi Jo,
You may consider checking out Tika trunk, where we recently have a Tika JAX-RS
web service [1] committed as part of the tika-server module. You could probably
wire DIH into it and accomplish the same thing.
Cheers,
Chris
[1] https://issues.apache.org/jira/browse/TIKA-593
On Feb 24, 201
You don't want to use 0.8 if you're parsing PDF.
> Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev
> (http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/) It
> includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar)
> replacement with lots of other bug
Your best bet is perhaps upgrading to latest 1.4 branch, i.e. 1.4.2-dev
(http://svn.apache.org/repos/asf/lucene/solr/branches/branch-1.4/)
It includes Tika 0.8-SNAPSHOT and is a compatible drop-in (war/jar) replacement
with lots of other bug fixes you'd also like (check changes.txt).
svn co http
I have tried the steps indicated here:
http://wiki.apache.org/solr/ExtractingRequestHandler
http://wiki.apache.org/solr/ExtractingRequestHandler
and when I try to parse a document nothing would happen, no error.. I have
copied the jar files everywhere, and nothing.. can anyone give me the steps
19 matches
Mail list logo