Re: Time series modeling in C* for range queries

2017-11-19 Thread Jon Haddad
Hi Junaid, I wrote a blog post a few months ago on massively scalable time series, going into a couple techniques on bucketing that you might find helpful. http://thelastpickle.com/blog/2017/08/02/time-series-data-modeling-massive-scale.html

Re: Time series modeling in C* for range queries

2017-11-19 Thread Justin Cameron
Hi Junaid, Using a "bucketing" key ("day") is the recommended way to limit the size of partitions. In your case you would probably need something like: PRIMARY KEY ((deviceid, day), datetime). Have you considered computing a running aggregate as the data comes into Cassandra? Rather than execute

Time series modeling in C* for range queries

2017-11-19 Thread Junaid Nasir
We are building a IoT platform where time series data from millions of devices is to be collected and then used to do some analytics pertaining to Business Intelligence/Analytics (BI/BA). Within the above context, we are running into the issue of have range based queries, where the granularity of