I'd second Luis assumption but having a PDF for further analysis would be good. 
Feel free to open a case at PDFBox - I'll look into it.

BR
Maruan

Am 04.06.2013 um 11:27 schrieb Luis Bernardo <[email protected]>:

> Very likely the pdfbox merge utility does not merge the fonts used in the 
> documents. So if you have the same font used in two documents the font will 
> be embedded twice in the merged document. If you have many fonts the size can 
> grow considerably.
> 
> If you think the difference in size in the individual PDFs is an issue send 
> us a sample example to that we can investigate the reason for the difference. 
> 
> 
> On Tue, Jun 4, 2013 at 9:57 AM, chandone <[email protected]> wrote:
>> Is it indeed? Well, the fact is that I generate PDFs in two different ways:
>> 
>> - individual PDFs,
>> - the same PDFs, but concatenated together into one big PDF, using the
>> Apache pdfbox PDFMergerUtility.mergeDocuments() method.
>> 
>> Compared to the ones generated by FrameMaker, the individual PDFs grow from
>> 50-60 KB to 100-150 KB. In these PDF, the header and footer images are
>> present only once.
>> In the second case, the images are obviously repeated lots of times and the
>> document grow from 1-2 MB to 23 MB. In this case, I assume that, as the
>> images are originally part of separated document that were then merged, they
>> are actually present lots of times in the global document. But, as I said
>> before, images are not responsible for the huge size of the PDF alone, as
>> when I remove them the PDF is still 21 MB.
>> I'm afraid I am not at liberty to divulge the PDFs I'm referring to but I'll
>> see if I can manage to generate a dummy one. That may not be easy.
>> 
>> 
>> 
>> --
>> View this message in context: 
>> http://apache-fop.1065347.n5.nabble.com/reduce-size-of-PDF-generated-with-FOP-tp38619p38648.html
>> Sent from the FOP - Users mailing list archive at Nabble.com.
>> 
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [email protected]
>> For additional commands, e-mail: [email protected]
> 

Reply via email to