Re: linux intelligent ocr solution settings dialog problem

2022-09-23 Thread George N. White III
packages. I use OCR for data rescue when the only copy of the data is a PDF made from a line-printer report, and to get text in languages I don't know from screen captures, so vaguely familiar with lios. I have had a number of colleagues with severe vision issues, so have also dealt with

Re: linux intelligent ocr solution settings dialog problem

2022-09-23 Thread Mgr. Janusz Chmiel
Oh I AM very sorry. I Am big confuser. It is comment which I have allready created on 17 Feb. I have stopped to watch it. So I will rebuild Lios. ___ users mailing list -- users@lists.fedoraproject.org To unsubscribe send an email to users-le...@lists

Re: linux intelligent ocr solution settings dialog problem

2022-09-23 Thread Mgr. Janusz Chmiel
Dear MR White,     I have thought, that this branch is not official so I have only watched The Github tree link which I have sent previously to this mailing list. Sure, I will remove non functioning Lios and I will try to compile LIOS thanks to your adviced new Github tree link. __

Re: linux intelligent ocr solution settings dialog problem

2022-09-23 Thread George N. White III
On Fri, Sep 23, 2022 at 10:04 AM Mgr. Janusz Chmiel wrote: > > > Unfortunately, on Fedora 35, 36, some dependent Python component was > changed. As A result, when ever user want to access The settings dialog, > which is necessary, app hang. No errors are printed to The terminal > related to Pytho

linux intelligent ocr solution settings dialog problem

2022-09-23 Thread Mgr. Janusz Chmiel
Because I do not see at all, I have "fallen in love with" excellent app named Lios. It is complex app writeen in Python and it uses GTK 3 toolkit to create its GUI. Unfortunately, on Fedora 35, 36, some dependent Python component was changed. As A result, when ever user want to access The se

Re: A reliable ocr program for Fedora

2015-12-18 Thread jd1008
On 12/15/2015 02:37 PM, Gordon Messmer wrote: On 12/15/2015 12:45 PM, jd1008 wrote: Does anyone know of a free OCR program for linux, that WORX My wife used a tesseract-ocr frontend (gimagereader, on Windows) successfully. There are a list of others: https://code.google.com/p/tesseract

Re: A reliable ocr program for Fedora

2015-12-16 Thread jd1008
. These images are NOT encrypted as they are public documents like from the DMV, ... etc. But are they good quality images? OCR needs a reasonable resolution, *and* clean character definition. When I was using tesseract a few years ago (as mentioned earlier in this thread) I was getting PDFs made of

Re: A reliable ocr program for Fedora

2015-12-15 Thread Doug
any of the pdf images I have. These images are NOT encrypted as they are public documents like from the DMV, ... etc. But are they good quality images? OCR needs a reasonable resolution, *and* clean character definition. When I was using tesseract a few years ago

Re: A reliable ocr program for Fedora

2015-12-15 Thread Fred Smith
cuments like from the DMV, ... etc. > > But are they good quality images? OCR needs a reasonable resolution, > *and* clean character definition. When I was using tesseract a few years ago (as mentioned earlier in this thread) I was getting PDFs made of scanned legal documents (from Groklaw

Re: A reliable ocr program for Fedora

2015-12-15 Thread Tim
Allegedly, on or about 15 December 2015, jd1008 sent: > Downloaded and tried tesseract and cuneiform, and both fail to > work on any of the pdf images I have. These images are NOT encrypted > as they are public documents like from the DMV, ... etc. But are they good quality images? OC

Re: A reliable ocr program for Fedora

2015-12-15 Thread Fred Smith
On Tue, Dec 15, 2015 at 09:44:01PM -0500, Fred Smith wrote: > On Tue, Dec 15, 2015 at 01:45:20PM -0700, jd1008 wrote: > > Downloaded and tried tesseract and cuneiform, and both fail to > > work on any of the pdf images I have. These images are NOT encrypted > > as they are public documents like fro

Re: A reliable ocr program for Fedora

2015-12-15 Thread Fred Smith
On Tue, Dec 15, 2015 at 01:45:20PM -0700, jd1008 wrote: > Downloaded and tried tesseract and cuneiform, and both fail to > work on any of the pdf images I have. These images are NOT encrypted > as they are public documents like from the DMV, ... etc. Last time I used tesseract (4 or 5 years, perha

Re: A reliable ocr program for Fedora

2015-12-15 Thread jd1008
On 12/15/2015 02:00 PM, Tom Horsley wrote: If you have pdf files with actual characters, the pdftotext tool works well for extracting the text (though not necessarily the layout). As far as doing OCR from actual image files, I always found tesseract to work better than most (but it was still

Re: A reliable ocr program for Fedora

2015-12-15 Thread Gordon Messmer
On 12/15/2015 12:45 PM, jd1008 wrote: Does anyone know of a free OCR program for linux, that WORX My wife used a tesseract-ocr frontend (gimagereader, on Windows) successfully. There are a list of others: https://code.google.com/p/tesseract-ocr/wiki/3rdParty -- users mailing list users

Re: A reliable ocr program for Fedora

2015-12-15 Thread dwoody5654
On 12/15/2015 03:00 PM, Tom Horsley wrote: If you have pdf files with actual characters, the pdftotext tool works well for extracting the text (though not necessarily the layout). there is an option: -layout It does a good job with preserving the layout. David As far as doing OCR from actual

Re: A reliable ocr program for Fedora

2015-12-15 Thread Tom Horsley
If you have pdf files with actual characters, the pdftotext tool works well for extracting the text (though not necessarily the layout). As far as doing OCR from actual image files, I always found tesseract to work better than most (but it was still pretty feeble). -- users mailing list users

A reliable ocr program for Fedora

2015-12-15 Thread jd1008
Downloaded and tried tesseract and cuneiform, and both fail to work on any of the pdf images I have. These images are NOT encrypted as they are public documents like from the DMV, ... etc. Does anyone know of a free OCR program for linux, that WORX :) ? -- users mailing list users

tesseract OCR and page layout

2015-09-08 Thread Gary Stainburn
as such. However, when I use tesseract to extract text from PDF files that don't have embedded text I can't seem to get the same effect. Am I missing something with tesseract, or is that an alternative OCR that can give me what I want? -- users mailing list users@lists.fedoraproj

Re: OCR

2014-01-11 Thread Garry T. Williams
On 1-9-14 22:56:39 Robert Moskowitz wrote: > http://www.physics.ohio-state.edu/~bcd/humor/instruction.set.html Zero and add packed (ZAP) *is* an instruction on the IBM System 370, 390, etc. http://www.simotime.com/asmins01.htm#ZAP -- Garry T. Williams -- users mailing list users@lists.fe

Re: OCR

2014-01-10 Thread Richard Vickery
On Fri, Jan 10, 2014 at 5:56 AM, Robert Moskowitz wrote: > > On 01/10/2014 05:14 AM, poma wrote: > >> On 10.01.2014 04:56, Robert Moskowitz wrote: >> >>> For f20, is there an OCR program for extracting the text out of a pdf >>> scan? >>> >

Re: OCR

2014-01-10 Thread g
On 01/10/2014 07:55 AM, Robert Moskowitz wrote: On 01/10/2014 12:50 AM, g wrote: <<>> do you have the pdf file or are you talking about files that were run thru a scanner? I scaned my old printed copy to pdf. ok. then "pdf2txt" or "pdftotext" will/should

Re: OCR

2014-01-10 Thread poma
On 10.01.2014 14:56, Robert Moskowitz wrote: > I can just copy the text from the page into gedit and go from there. I > don't have lynx installed. Never mind that :) but take a look at this site, http://code.google.com/p/tesseract-ocr/wiki/3rdParty GUIs and Other Projects using Te

Re: OCR

2014-01-10 Thread Robert Moskowitz
On 01/10/2014 12:50 AM, g wrote: hello robert. On 01/09/2014 09:56 PM, Robert Moskowitz wrote: For f20, is there an OCR program for extracting the text out of a pdf scan? do you have the pdf file or are you talking about files that were run thru a scanner? I scaned my old printed copy to

Re: OCR

2014-01-10 Thread Robert Moskowitz
On 01/10/2014 05:14 AM, poma wrote: On 10.01.2014 04:56, Robert Moskowitz wrote: For f20, is there an OCR program for extracting the text out of a pdf scan? I have an old document of 'Assembly Instructions'. Some can be found at: http://www.physics.ohio-state.edu/

Re: OCR

2014-01-10 Thread poma
On 10.01.2014 04:56, Robert Moskowitz wrote: > For f20, is there an OCR program for extracting the text out of a pdf scan? > > I have an old document of 'Assembly Instructions'. Some can be found > at: http://www.physics.ohio-state.edu/~bcd/humor/instruction.set.html, &

Re: OCR

2014-01-09 Thread g
hello robert. On 01/09/2014 09:56 PM, Robert Moskowitz wrote: For f20, is there an OCR program for extracting the text out of a pdf scan? do you have the pdf file or are you talking about files that were run thru a scanner? I have an old document of 'Assembly Instructions'. S

OCR

2014-01-09 Thread Robert Moskowitz
For f20, is there an OCR program for extracting the text out of a pdf scan? I have an old document of 'Assembly Instructions'. Some can be found at: http://www.physics.ohio-state.edu/~bcd/humor/instruction.set.html, but I have a few more. And a lot less. But I want the ones

tesseract-ocr ??

2011-04-26 Thread james tate
F14 The package tesseract-ocr , where do I find it ? I have the package tesseract-3.00-1.fc14.i686 installed. -- users mailing list users@lists.fedoraproject.org To unsubscribe or change subscription options: https://admin.fedoraproject.org/mailman/listinfo/users Guidelines: http

Re: OCR program for plots recognition

2010-09-08 Thread Hiisi
2010/9/7 Kwan Lowe : > On Tue, Sep 7, 2010 at 8:44 AM, Hiisi wrote: >> <--SNIP--> > > I don't see an RPM, but the installation is pretty simple: > > export PATH=/path/to/your/java/bin:$PATH > > sh PlotDigitizer_2.4.1_Linux_installer.bin > > It will open an installer. I install in /home/kwan/bin/

Re: OCR program for plots recognition

2010-09-07 Thread Kwan Lowe
On Tue, Sep 7, 2010 at 8:44 AM, Hiisi wrote: > > Nice try, Marco! Thank you. But I need some tool to produce data in a > text file from graph image. > And something that I can just yum' install? I don't see an RPM, but the installation is pretty simple: export PATH=/path/to/your/java/bin:$PATH

Re: OCR program for plots recognition

2010-09-07 Thread Marco Guazzone
On Tue, Sep 7, 2010 at 2:44 PM, Hiisi wrote: > 2010/9/7 Marco Guazzone : >> On Tue, Sep 7, 2010 at 2:08 PM, Hiisi wrote: >>> 2010/9/7 Kwan Lowe : This might help: http://plotdigitizer.sourceforge.net/ >>> >>> Thank you, Kwan. I'll try it. >>> Any other suggestions? Something f

Re: OCR program for plots recognition

2010-09-07 Thread Hiisi
2010/9/7 Marco Guazzone : > On Tue, Sep 7, 2010 at 2:08 PM, Hiisi wrote: >> 2010/9/7 Kwan Lowe : >>> >>> This might help: >>> >>> http://plotdigitizer.sourceforge.net/ >> >> Thank you, Kwan. I'll try it. >> Any other suggestions? Something from standard fedora repositories? > > potrace : http://po

Re: OCR program for plots recognition

2010-09-07 Thread Marco Guazzone
On Tue, Sep 7, 2010 at 2:08 PM, Hiisi wrote: > 2010/9/7 Kwan Lowe : >> >> This might help: >> >> http://plotdigitizer.sourceforge.net/ > > Thank you, Kwan. I'll try it. > Any other suggestions? Something from standard fedora repositories? potrace : http://potrace.sourceforge.net Never used but l

Re: OCR program for plots recognition

2010-09-07 Thread Hiisi
2010/9/7 Kwan Lowe : > > This might help: > > http://plotdigitizer.sourceforge.net/ Thank you, Kwan. I'll try it. Any other suggestions? Something from standard fedora repositories? -- Hiisi. Registered Linux User #487982. Be counted at: http://counter.li.org/ -- Spandex is a privilege, not a rig

Re: OCR program for plots recognition

2010-09-07 Thread Kwan Lowe
On Tue, Sep 7, 2010 at 7:37 AM, Hiisi wrote: > Could anybody suggest me a program in Fedora repos for plot > recognition? I have a bunch of graphs images scanned from different > papers and I want to put them into my Ph.D. theses. I have to replot > them in GNUPlot for uniformity. > TIA This migh

OCR program for plots recognition

2010-09-07 Thread Hiisi
Could anybody suggest me a program in Fedora repos for plot recognition? I have a bunch of graphs images scanned from different papers and I want to put them into my Ph.D. theses. I have to replot them in GNUPlot for uniformity. TIA -- Hiisi. Registered Linux User #487982. Be counted at: http://co

Re: A question on OCR for bad old document?

2010-06-13 Thread Jim
loaked wrote: >>>> >>>> >>>>> I have a scanned pdf of a very old document which was typewritten >>>>> about half a century ago. The scanned copy is noisy and the letters >>>>> are far from clear. The text can be made ou

Re: A question on OCR for bad old document?

2010-06-13 Thread Joachim Backes
tten >>>> about half a century ago. The scanned copy is noisy and the letters >>>> are far from clear. The text can be made out (mostly) by eye, but it >>>> is 19 pages long and I would like to OCR it to get a digitised text to >>>> save the eye strain

Re: A question on OCR for bad old document?

2010-06-13 Thread Joel Rees
is noisy and the letters >>> are far from clear. The text can be made out (mostly) by eye, but it >>> is 19 pages long and I would like to OCR it to get a digitised text to >>> save the eye strain and lots of typing. >>> >> You can't make a silk purse

Re: A question on OCR for bad old document?

2010-06-13 Thread Joel Rees
TV programmes (only joking!) > >> If you are having difficulty reading the scan yourself, then you're >> probably out of luck getting the computer to OCR it for you. >> >> Your best bet is to retype it.  It's only 19 pages so it shouldn't take &

Re: A question on OCR for bad old document?

2010-06-06 Thread Jim
The text can be made out (mostly) by eye, but it >> is 19 pages long and I would like to OCR it to get a digitised text to >> save the eye strain and lots of typing. >> > You can't make a silk purse out of a sow's ear. > > If you are having difficulty reading the

Re: A question on OCR for bad old document?

2010-06-06 Thread Paul Smith
On Sun, Jun 6, 2010 at 10:26 PM, mike cloaked wrote: >> The best OCR tool that I have found up to now is a commercial one: >> Acrobat Professional. > > Is that available for Fedora? I guess you can run it from inside Fedora, through a virtual machine running MS Windows

Re: A question on OCR for bad old document?

2010-06-06 Thread mike cloaked
yourself, then you're > probably out of luck getting the computer to OCR it for you. > > Your best bet is to retype it.  It's only 19 pages so it shouldn't take I was hoping you would not say that! -- mike c -- users mailing list users@lists.fedoraproject.org To uns

Re: A question on OCR for bad old document?

2010-06-06 Thread mike cloaked
On Sun, Jun 6, 2010 at 10:12 PM, Paul Smith wrote: > > Have you tried Tesseract? I suppose that Tesseract can work from > inside gscan2pdf. Yes I tried tesseract and it does not seem to fair much better than the other options - (it is a tough document to OCR though) > > (http://

Re: A question on OCR for bad old document?

2010-06-06 Thread Frank Cox
and I would like to OCR it to get a digitised text to > save the eye strain and lots of typing. You can't make a silk purse out of a sow's ear. If you are having difficulty reading the scan yourself, then you're probably out of luck getting the computer to OCR it for you. Your b

Re: A question on OCR for bad old document?

2010-06-06 Thread Paul Smith
I would like to OCR it to get a digitised text to > save the eye strain and lots of typing. > > I have tried various routes to doing this, including converting the > pdf to jpg, tif and other formats after fiddling with it in GIMP to > turn it (not very well) from grey scale to

A question on OCR for bad old document?

2010-06-06 Thread mike cloaked
I have a scanned pdf of a very old document which was typewritten about half a century ago. The scanned copy is noisy and the letters are far from clear. The text can be made out (mostly) by eye, but it is 19 pages long and I would like to OCR it to get a digitised text to save the eye strain and