Re: Avoiding Data Duplication

2015-06-08 Thread Paulo Motta
Some options I can think of: 1 - depending on your data size and stime query frequency, you may use spark to peform queries filtering by server time in the log table, maybe within an device time window to reduce the dataset your spark job will need to go through. more info on the spark connector:

Avoiding Data Duplication

2015-06-05 Thread Abhishek Singh Bailoo
Hello! I have a column family to log in data coming from my GPS devices. CREATE TABLE log( imei ascii, date ascii, dtime timestamp, data ascii, stime timestamp, PRIMARY KEY ((imei, date), dtime)) WITH CLUSTERING ORDER BY (dtime DESC) ; It is the standard schema for modeling time se