I am just learning the ropes here so perhaps it is not CEP worthy.  That being said, It felt like there was a lot of information to put into and track in a ticket, particularly when I expected discussion about how to best encode, changes to the algorithms etc.  It feels like it would be difficult to track. But if that is standard for this project I will move the information there.

As to the benchmarking, I had thought that usage and performance measures should be included.  Thank you for calling out the subset of data selected query as being of particular importance.

Claude

On 06/09/2022 03:11, Abe Ratnofsky wrote:
Looking at this link: 
https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-23%3A++Enhancement+for+Sparse+Data+Serialization

Do you have any plans to include benchmarks in your test plan? It would be 
useful to include disk usage / read performance / write performance comparisons 
with the new encodings, particularly for sparse collections where a subset of 
data is selected out of a collection.

I do wonder whether this is CEP-worthy. The CEP says that the changes will not 
impact existing users, will be backwards compatible, and overall is an 
efficiency improvement. The CEP guidelines say a CEP is encouraged “for 
significant user-facing or changes that cut across multiple subsystems”. Any 
reason why a Jira isn’t sufficient?

Abe

On Sep 5, 2022, at 1:57 AM, Claude Warren via dev <dev@cassandra.apache.org> 
wrote:

I have just posted a CEP  covering an Enhancement for Sparse Data Serialzation. 
 This is in response to CASSANDRA-8959

I look forward to responses.


Reply via email to