On 2014.05.12 10:35, Tom Wijsman wrote: > On Sun, 11 May 2014 19:46:50 +0200 > Michał Górny <mgo...@gentoo.org> wrote: > > > Rationale: xz-utils is quite widespread nowadays and it is a part > > of @system set. It can achieve better compression ratio than bzip2, > > and faster decompression at the same time. > > Some thoughts: > > What about putting multiple doc / man / info files in a single .xz > file > for each package? Would that further improve the situation? > > (As they can share dictionary, instead of having multiple > dictionaries) > > Some algorithms tend to work better for smaller files, whereas others > work better for larger files; might this be the case for bzip2 vs. > xz? > > -- > With kind regards, > > Tom Wijsman (TomWij) > Gentoo Developer > > E-mail address : tom...@gentoo.org > GPG Public Key : 6D34E57D > GPG Fingerprint : C165 AF18 AB4C 400B C3D2 ABF0 95B2 1FCD 6D34 E57D >
Some more thoughts ... What about not compressing files smaller than the filesysem block size at all. In my case its 4k. Any file gets allocated 4k on disc anyway, so compression/decompression is just a waste of resource for files <=4k. I'm not suggesting dynamically determining the output filesystem block size (unless you really want to), choose a static limit below which compression will not be applied. That eliminates the discussion about small files. -- Regards, Roy Bamford (Neddyseagoon) a member of elections gentoo-ops forum-mods trustees
pgpN77XN8fctg.pgp
Description: PGP signature