Re: How is Tika used with Solr

2016-02-12 Thread xavi jmlucjav
t;> > >> I've been wanting to add a similar watchdog to tika-server ... any > >> interest in that? > >> > >> > >> -Original Message- > >> From: xavi jmlucjav [mailto:jmluc...@gmail.com] > >> Sent: Thursday, February 11, 2016

RE: How is Tika used with Solr

2016-02-11 Thread Allison, Timothy B.
ge- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, February 11, 2016 7:46 PM To: solr-user Subject: Re: How is Tika used with Solr Well, I'd imagine you could spawn threads and monitor/kill them as necessary, although that doesn't deal with OOM errors FWIW,

Re: How is Tika used with Solr

2016-02-11 Thread Erick Erickson
o, y, if your use case allows , then we now have >> that in Tika. >> >> I've been wanting to add a similar watchdog to tika-server ... any >> interest in that? >> >> >> -Original Message- >> From: xavi jmlucjav [mailto:jmluc...@gmail.com]

Re: How is Tika used with Solr

2016-02-11 Thread xavi jmlucjav
, y, if your use case allows , then we now have > that in Tika. > > I've been wanting to add a similar watchdog to tika-server ... any > interest in that? > > > -Original Message- > From: xavi jmlucjav [mailto:jmluc...@gmail.com] > Sent: Thursday, February

Re: How is Tika used with Solr

2016-02-11 Thread Steven White
hat in Tika. > > I've been wanting to add a similar watchdog to tika-server ... any > interest in that? > > > -Original Message- > From: xavi jmlucjav [mailto:jmluc...@gmail.com] > Sent: Thursday, February 11, 2016 2:16 PM > To: solr-user > Subject: Re: How

RE: How is Tika used with Solr

2016-02-11 Thread Allison, Timothy B.
e that in Tika. I've been wanting to add a similar watchdog to tika-server ... any interest in that? -Original Message- From: xavi jmlucjav [mailto:jmluc...@gmail.com] Sent: Thursday, February 11, 2016 2:16 PM To: solr-user Subject: Re: How is Tika used with Solr I have found tha

Re: How is Tika used with Solr

2016-02-11 Thread xavi jmlucjav
sary for the vast majority of use cases. > > > > > > [0] > http://openpreservation.org/blog/2014/03/21/tika-ride-characterising-web-content-nanite/ > > > > > > > > -Original Message- > > From: Erick Erickson [mailto:erickerick...@gmail

Re: How is Tika used with Solr

2016-02-10 Thread Erick Erickson
cases. > > > [0] > http://openpreservation.org/blog/2014/03/21/tika-ride-characterising-web-content-nanite/ > > > > -Original Message- > From: Erick Erickson [mailto:erickerick...@gmail.com] > Sent: Tuesday, February 09, 2016 10:05 PM > To: solr-user >

RE: How is Tika used with Solr

2016-02-10 Thread Allison, Timothy B.
fixed this over in our tika-server. -Original Message- From: Charlie Hull [mailto:char...@flax.co.uk] Sent: Wednesday, February 10, 2016 3:55 AM To: solr-user@lucene.apache.org Subject: Re: How is Tika used with Solr On 09/02/2016 22:49, Alexandre Rafalovitch wrote: > Solr uses Tika

RE: How is Tika used with Solr

2016-02-10 Thread Allison, Timothy B.
blog/2014/03/21/tika-ride-characterising-web-content-nanite/ -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Tuesday, February 09, 2016 10:05 PM To: solr-user Subject: Re: How is Tika used with Solr My impulse would be to _not_ run Tika in its own JVM, j

Re: How is Tika used with Solr

2016-02-10 Thread Charlie Hull
On 09/02/2016 22:49, Alexandre Rafalovitch wrote: Solr uses Tika directly. And not in the most efficient way. It is there mostly for convenience rather than performance. So, for performance, Solr recommendation is also to run Tika separately and only send Solr the processed documents. Absolute

Re: How is Tika used with Solr

2016-02-09 Thread Erick Erickson
riginal Message- > From: Steven White [mailto:swhite4...@gmail.com] > Sent: Tuesday, February 09, 2016 6:33 PM > To: solr-user@lucene.apache.org > Subject: Re: How is Tika used with Solr > > Thank you Erick and Alex. > > My main question is with a long running process using Tika

RE: How is Tika used with Solr

2016-02-09 Thread Allison, Timothy B.
ginal Message- From: Steven White [mailto:swhite4...@gmail.com] Sent: Tuesday, February 09, 2016 6:33 PM To: solr-user@lucene.apache.org Subject: Re: How is Tika used with Solr Thank you Erick and Alex. My main question is with a long running process using Tika in the same JVM as my applic

Re: How is Tika used with Solr

2016-02-09 Thread Steven White
Thank you Erick and Alex. My main question is with a long running process using Tika in the same JVM as my application. I'm running my file-system-crawler in its own JVM (not Solr's). On Tika mailing list, it is suggested to run Tika's code in it's own JVM and invoke it from my file-system-crawl

Re: How is Tika used with Solr

2016-02-09 Thread Erick Erickson
Here's a writeup that should help https://lucidworks.com/blog/2012/02/14/indexing-with-solrj/ On Tue, Feb 9, 2016 at 2:49 PM, Alexandre Rafalovitch wrote: > Solr uses Tika directly. And not in the most efficient way. It is > there mostly for convenience rather than performance. > > So, for p

Re: How is Tika used with Solr

2016-02-09 Thread Alexandre Rafalovitch
Solr uses Tika directly. And not in the most efficient way. It is there mostly for convenience rather than performance. So, for performance, Solr recommendation is also to run Tika separately and only send Solr the processed documents. Regards, Alex. Newsletter and resources for Solr beg