Hi,

There's nothing... That was a comment made in 2014 in PDFBOX-2524. I hit that place at a later time and made another comment. I had a look at it... two things would have to be done, 1) PDFont should allow to retrieve that cMap, and 2) cMap should store an inverted list, e.g. to CMap.addCharMapping().

So I tried this and ran the test code from the SO issue, and saved the file. And yes, "Protocol" appeared on the second page.

Are you able to build from source? Do you want to test this with 2.0 or the trunk?

Tilman

Am 12.02.2021 um 10:46 schrieb Tamas Kocsis:
Hi Everyone!

We faced the issue described in this SO question:

https://stackoverflow.com/questions/61934819/pdfbox-no-glyph-for-u0050-in-extracted-font


The TODO in question is in PDCIDFontType2's encode method:

*//TODO: invert the ToUnicode CMap?*


I just wanted to ask whether you have any info on the implementation of
this one?
Is it on the roadmap, planned in the near/far future or still open...?

Best Regards: Tamas Kocsis



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to