richardstartin commented on issue #7973: URL: https://github.com/apache/pinot/issues/7973#issuecomment-1013526820
Here are the sizes of 8KB (1024 doubles) of different distributions/patterns with Snappy and LZ4. There are encodings which can be introduced to reduce the size of metric columns (e.g. xor or delta encoding) but making it possible to compress metric column with general purpose compression algorithms isn't in the user's interest. |Compression|Distribution |Compressed Size (KB)| |-----------|-----------------------------|--------------------| |Uncompressed|integer increments |8.00 | |LZ4 |integer increments |4.09 | |Snappy |integer increments |4.02 | |Uncompressed|noisy increments |8.00 | |LZ4 |noisy increments |8.03 | |Snappy |noisy increments |8.00 | |Uncompressed|sinusoidal |8.00 | |LZ4 |sinusoidal |8.03 | |Snappy |sinusoidal |8.00 | |Uncompressed|normal(0,1) |8.00 | |LZ4 |normal(0,1) |8.03 | |Snappy |normal(0,1) |8.00 | |Uncompressed|exp(0.999) |8.00 | | |LZ4 |exp(0.999) |7.23 | | |Snappy |exp(0.999) |7.16 | | -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org For additional commands, e-mail: commits-h...@pinot.apache.org