Re: Error rendering png from PDF - Error at Type1Parser parseBinary for Type 1 Font

Tilman Hausherr Mon, 20 Sep 2021 10:38:15 -0700

Yes your suggestion makes sense. (The images can't be seen here but Isaw them in moderation). Please create an issue in JIRA. Just use thetext here.


Tilman


Am 20.09.2021 um 18:10 schrieb Fernando Sadu:

Component: PDFRenderer/Type1Parser
Affects Version/s: 2.0.23
Environment: Java 8

*Description:*
When I try to convert a pdf page using "pdfRenderer.renderImageWithDPI" to a png image using pdfboxversion 2.023 I get the following error. The pdf is a customerspecificone, I can't share the original file here.
Error [PDType1Font] Can't read the embedded Type1 fontAAAAAB+NimbusMonoPS-Regular_00java.io.IOException: Found Token[kind=NAME, text=readonly] butexpected def
at org.apache.fontbox.type1.Type1Parser.read(Type1Parser.java:867)
at org.apache.fontbox.type1.Type1Parser.parseBinary(Type1Parser.java:610)
at org.apache.fontbox.type1.Type1Parser.parse(Type1Parser.java:64)
atorg.apache.fontbox.type1.Type1Font.createWithSegments(Type1Font.java:85)
at org.apache.pdfbox.pdmodel.font.PDType1Font.<init>(PDType1Font.java:263)
atorg.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:76)
at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:146)
Using the PDFDebugger to have access to the FontFile of this pagethrowing the error I "Save the Stream as PFB" and using the t1utils<https://formulae.brew.sh/formula/t1utils> tools, using t1disasm I canobtain the readable equivalent of the binary private dictionary, whichlooks like this:
image.png
And following the trace in PDFBox I can see it blowing up at "Type1Parser.class" in the "parseBinary" method at line 602 they havethe check for ""RD".equals(key)", where key is "readonly" and the lastcheck from that list "read(Token.*/NAME/*, "def");" has to be def, andbecause the key is "readonly" it throws the error, even if it makes itpass "RD" it will have the same results when parsing the contents for"ND" and "NP".
I have seen most of the Type 1 Font files have "executeonly" insteadof "readonly" in the /Private dict section, even this same file if Iconvert it first to PS and back to PDF, extracting the font again Ican see that the instructions for RD, ND, and NP are rearranged to be"executeonly" using Mac Preview or GS ps2pdf., and the PNG isgenerated without issue using the PDFRenderer, I don't see a way howto do this step programmatically at the moment.
image.png
From the Type 1 Font Spec, they don't provide a must follow receipt onwhat instructions can be appended after RD, ND, or NP, theystate: "The RD, NP, and ND functions must be implemented by PostScriptlanguage
procedures" , If we take a look to the PostScript Language ReferenceManual:
*readonly*: When an object is read-only, its value cannot be modifiedby PostScript operators (an invalidaccess error will result), but itcan still be read by operators or executed by the PostScript interpreter.
*executeonly*: When an object is execute-only, its value cannot beread or modified explicitly by PostScript operators (an invalidaccesserror will result), but it can still be executed by the PostScriptinterpreter—for example, by invoking it with exec
Both instructions "*readonly" *and "*executeonly" *allows theinstructions to be executed.
Questions:
1. Would it be possible to add an optional "readMaybe(Token.*/NAME/*,"readonly");" to the "parseBinary" RD, "ND", and "NP" keywordssimilar to how it was done at PDFBOX-2202<https://issues.apache.org/jira/browse/PDFBOX-2202> ? checking PDFBoxlatest version 3.0 RC "Type1Parser.class" method still the same withno optional "readonly".
2. If this is not a valid constructed FontFile, and I'm not the onecreating the original pdf, neither deciding which font to embed, couldyou please suggest an alternative to deal with this case?
Thank you.

Re: Error rendering png from PDF - Error at Type1Parser parseBinary for Type 1 Font

Reply via email to