RE: Tesseract language

2018-10-28 Thread Martin Frank Hansen (MHQ)
OCR' and now it works for Danish. Thanks again for helping. Best regards Martin -Original Message- From: Tim Allison Sent: 27. oktober 2018 14:37 To: solr-user@lucene.apache.org; u...@tika.apache.org Subject: Re: Tesseract language Martin, Let’s move this over to user@tika. Rohan,

Re: Tesseract language

2018-10-27 Thread Tim Allison
x27;t make it work with Tika alone. > > > > Best regards > > Martin > > > > > > -Original Message- > > From: Rohan Kasat > > Sent: 26. oktober 2018 21:45 > > To: solr-user@lucene.apache.org > > Subject: Re: Tesseract language > >

Re: Tesseract language

2018-10-27 Thread Rohan Kasat
tess4j if I can't make it work with Tika alone. > > Best regards > Martin > > > -Original Message- > From: Rohan Kasat > Sent: 26. oktober 2018 21:45 > To: solr-user@lucene.apache.org > Subject: Re: Tesseract language > > Hi Martin, > > Are you

RE: Tesseract language

2018-10-27 Thread Martin Frank Hansen (MHQ)
ult settings (which I > could before). Am I missing something or just mixing some things up? > > > > -Original Message- > From: Tim Allison > Sent: 26. oktober 2018 19:58 > To: solr-user@lucene.apache.org > Subject: Re: Tesseract language > > Tika reli

Re: Tesseract language

2018-10-26 Thread Rohan Kasat
I could > before). Am I missing something or just mixing some things up? > > > > -Original Message- > From: Tim Allison > Sent: 26. oktober 2018 19:58 > To: solr-user@lucene.apache.org > Subject: Re: Tesseract language > > Tika relies on you to install tes

RE: Tesseract language

2018-10-26 Thread Martin Frank Hansen (MHQ)
ge- From: Tim Allison Sent: 26. oktober 2018 19:58 To: solr-user@lucene.apache.org Subject: Re: Tesseract language Tika relies on you to install tesseract and all the language libraries you'll need. If you can successfully call `tesseract testing/eurotext.png testing/eurotext-dan -l dan

Re: Tesseract language

2018-10-26 Thread Tim Allison
> > > Lautrupparken 40-42, DK-2750 Ballerup > E-mail m...@kmd.dk Web www.kmd.dk > Mobil +4525571418 > > -Oprindelig meddelelse- > Fra: Erick Erickson > Sendt: 21. oktober 2018 22:49 > Til: solr-user > Emne: Re: Tesseract language > > Here's a skel

RE: Tesseract language

2018-10-26 Thread Martin Frank Hansen (MHQ)
en, Senior Data Analytiker Data, IM & Analytics Lautrupparken 40-42, DK-2750 Ballerup E-mail m...@kmd.dk Web www.kmd.dk Mobil +4525571418 -Oprindelig meddelelse- Fra: Erick Erickson Sendt: 21. oktober 2018 22:49 Til: solr-user Emne: Re: Tesseract language Here's a skeletal p

Re: Tesseract language

2018-10-21 Thread Gus Heck
load data to a Solr instance? > > > > Best regards > > > > Martin Frank Hansen > > > > -Oprindelig meddelelse- > > Fra: Alexandre Rafalovitch > > Sendt: 21. oktober 2018 16:26 > > Til: solr-user > > Emne: Re: Tesseract language > &

Re: Tesseract language

2018-10-21 Thread Erick Erickson
production usage, what is the > > recommended method(s) to upload data to a Solr instance? > > > > Best regards > > > > Martin Frank Hansen > > > > -Oprindelig meddelelse- > > Fra: Alexandre Rafalovitch > > Sendt: 21. oktober 2018 16:26

Re: Tesseract language

2018-10-21 Thread Alexandre Rafalovitch
ded method(s) to upload data to a Solr instance? > > Best regards > > Martin Frank Hansen > > -Oprindelig meddelelse- > Fra: Alexandre Rafalovitch > Sendt: 21. oktober 2018 16:26 > Til: solr-user > Emne: Re: Tesseract language > > There is a couple

Re: Tesseract language

2018-10-21 Thread Alexandre Rafalovitch
There is a couple of things mixed in here: 1) Extract handler is not recommended for production usage. It is great for a quick test, just like you did it, but going to production, running it externally is better. Tika - especially with large files can use up a lot of memory and trip up the Solr ins