https://bugs.kde.org/show_bug.cgi?id=117629

José Oliver-Didier <jose_oli...@hotmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jose_oli...@hotmail.com

--- Comment #8 from José Oliver-Didier <jose_oli...@hotmail.com> ---
Noticed that this bug got included in the summer of code event, and I belive
this would be an extremely helpful tool. Services such as Google Photos and
OneDrive Photos leverage such OCR functionalities. Of usefulness is that
OneDrive displays the recognized text in the info pane. Here are some questions
meant to aid in the development of such functionality for Digikam:
-       How would the text be stored using existing metadata fields or would
new fields be created under the digikam XMP schema namespace?
-       Would the location of the words in the image also be stored so that
they cab highlighted like face regions? 
-       Would the user be able to correct any OCR errors? 
-       Would it be capable of recognizing a type of document and adding a
keyword tag (Example: receipt, screenshot, business card, invoice, bank check,
blueprint, ect)?
-       Beyond, printed material (letters, receipts), would it be able to read
text in photos in which a sign appears (Example: A street sign)?
-       Aware this may be beyond the scope of OCR, but it would be interesting
if barcodes/qr codes could be read and such information could be also stored
within the file. The Metadata Working Group Spec provides for storing barcode
regions with type=BarCode (ref page 54, MWG Working group spec 2010)

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to