https://bugs.kde.org/show_bug.cgi?id=117629
José Oliver-Didier <jose_oli...@hotmail.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |jose_oli...@hotmail.com --- Comment #8 from José Oliver-Didier <jose_oli...@hotmail.com> --- Noticed that this bug got included in the summer of code event, and I belive this would be an extremely helpful tool. Services such as Google Photos and OneDrive Photos leverage such OCR functionalities. Of usefulness is that OneDrive displays the recognized text in the info pane. Here are some questions meant to aid in the development of such functionality for Digikam: - How would the text be stored using existing metadata fields or would new fields be created under the digikam XMP schema namespace? - Would the location of the words in the image also be stored so that they cab highlighted like face regions? - Would the user be able to correct any OCR errors? - Would it be capable of recognizing a type of document and adding a keyword tag (Example: receipt, screenshot, business card, invoice, bank check, blueprint, ect)? - Beyond, printed material (letters, receipts), would it be able to read text in photos in which a sign appears (Example: A street sign)? - Aware this may be beyond the scope of OCR, but it would be interesting if barcodes/qr codes could be read and such information could be also stored within the file. The Metadata Working Group Spec provides for storing barcode regions with type=BarCode (ref page 54, MWG Working group spec 2010) -- You are receiving this mail because: You are watching all bug changes.