Re: a question on cassandra data file size

2012-03-30 Thread Yiming Sun
Hi Ed, the "comp actions" stand for compaction or compression? Also, the size we obtained from the supercolumn schema was also taken many days after the data ingest, so it had to be after compact as well, no? In neither case we issued any nodetool compact commands. you are right that we probabl

Re: a question on cassandra data file size

2012-03-30 Thread Edward Capriolo
Standard columns save size over super columns. Not 50% but depending on the size of the data (3 byte values) the overhead could be significant. I have noticed that post sstable rebuild, 1.0 kicked off some comp actions behind the scenes shrinking some files significantly. On Fri, Mar 30, 2012 at 9

a question on cassandra data file size

2012-03-30 Thread Yiming Sun
Hi, I have a question on the size of cassandra data files. After we upgraded from cassandra 0.8 to 1.0, and changed our schema to use regular columns instead of supercolumns, the aggregated size of cassandra data files reduced by more than half. The source data set is the same, and we didn't set