I debugged the PDF a little. FWIW, the following code (using iText) takes it to 9MB:
public static void main(String args[]) throws Exception { Document document = new Document(); PdfSmartCopy copy = new PdfSmartCopy(document, new FileOutputStream("/home/rmuir/Downloads/test.pdf")); //copy.setCompressionLevel(9); //copy.setFullCompression(); document.open(); PdfReader reader = new PdfReader("/home/rmuir/Downloads/apache-solr-ref-guide-4.7.pdf"); int pages = reader.getNumberOfPages(); for (int i = 0; i < pages; i++) { PdfImportedPage page = copy.getImportedPage(reader, i+1); copy.addPage(page); } copy.freeReader(reader); reader.close(); document.close(); } On Wed, Mar 5, 2014 at 10:17 AM, Steve Rowe <sar...@gmail.com> wrote: > Not sure if it’s relevant anymore, but a few years ago Atlassian resolved as > "won’t fix” a request to configure exported PDF compression ratio: > <https://jira.atlassian.com/browse/CONF-21329>. Their suggestion: zip the > PDF. I tried that - the resulting zip size is roughly 9MB, so it’s > definitely compressible. > > Steve > > On Mar 5, 2014, at 10:03 AM, Cassandra Targett <casstarg...@gmail.com> wrote: > >> You know, I didn't even notice that. It did go up to 30M. >> >> I've made a note to look into that before we release the 4.8 version to see >> if it can be reduced at all. I suspect the screenshots are causing it to >> balloon - we made some changes to the way they appear in the PDF for 4.7 >> which may be the cause, but also the software was upgraded and maybe the >> newer version is handling them differently. >> >> Thanks for pointing that out. >> >> >> On Tue, Mar 4, 2014 at 6:43 PM, Alexandre Rafalovitch >> <arafa...@gmail.com>wrote: >> >>> Has it really gone up in size from 5Mb for 4.6 version to 30Mb for 4.7 >>> version? Or some mirrors are playing tricks (mine is: >>> http://www.trieuvan.com/apache/lucene/solr/ref-guide/ ) >>> >>> Regards, >>> Alex. >>> Personal website: http://www.outerthoughts.com/ >>> LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch >>> - Time is the quality of nature that keeps events from happening all >>> at once. Lately, it doesn't seem to be working. (Anonymous - via GTD >>> book) >>> >>> >>> On Wed, Mar 5, 2014 at 1:39 AM, Cassandra Targett <ctarg...@apache.org> >>> wrote: >>>> The Lucene PMC is pleased to announce that we have a new version of the >>>> Solr Reference Guide available for Solr 4.7. >>>> >>>> The 395 page PDF serves as the definitive user's manual for Solr 4.7. It >>>> can be downloaded from the Apache mirror network: >>>> >>>> https://www.apache.org/dyn/closer.cgi/lucene/solr/ref-guide/ >>>> >>>> Cassandra >>> >