Hi, I'm using pdf_fullrewrite as a robust and reliable PDF sanitizer for defective/corrupt PDFs. It can handle quite a few PDFs where pdftk and qpdf crash/abort.
Over the last few days I've been hacking on pdf_fullrewrite to report the exact differences between the old and the rewritten PDF and the results have been interesting. Quite a lot of differences happen for corrupt input (the data is sanitized/corrected as expected), but it also seems that Flate-encoded non-stream data is decoded upon rewrite whereas Flate-encoded stream data stays Flate-encoded. Would patches to implement an additional parameter decoding everything upon rewrite (--decompress) be accepted? Regards, Carl-Daniel -- http://www.hailfinger.org/ _______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
