Hi,

I'm using pdf_fullrewrite as a robust and reliable PDF sanitizer for
defective/corrupt PDFs. It can handle quite a few PDFs where pdftk and
qpdf crash/abort.

Over the last few days I've been hacking on pdf_fullrewrite to report
the exact differences between the old and the rewritten PDF and the
results have been interesting. Quite a lot of differences happen for
corrupt input (the data is sanitized/corrected as expected), but it also
seems that Flate-encoded non-stream data is decoded upon rewrite whereas
Flate-encoded stream data stays Flate-encoded. Would patches to
implement an additional parameter decoding everything upon rewrite
(--decompress) be accepted?

Regards,
Carl-Daniel
-- 
http://www.hailfinger.org/
_______________________________________________
poppler mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/poppler

Reply via email to