Programmable configuration in embedded cassandra
Hello, I understand that currently one must have the configuration file in place in order to start a cassandra node(even if it is embedded in the application, i.e. using only StorageProxy without CassandraServer). Is that correct? It would be practical and very easy to implement a "programmable" configuration object(like making it possible to create an instance of DataBaseDescriptor instance not from a file) in order to make it possible to fully embed cassandra instance into custom application. Thanks -- Jan Virin http://www.linkedin.com/in/yanvirin
Hadoop over Cassandra
Hi, Can someone explain how this works? As long as I know, there is no execution engine in Cassandra alone, so I assume that Hadoop gives the MapReduce execution engine which uses Cassandra as the distributed storage? Is data locality preserved? How mature this "couple" is? How is the performance of this compared to the original Hadoop over HDFS? Thanks, -- Jan Virin http://www.linkedin.com/in/yanvirin
Run several cassandra instances on local machine
It seems to be impossible to run several cassandra instances on a localmachine, due to the fact that the seeds are described as ip addresses and not couples of ip address and port. Is this correct? (In case I want to start a cluster of several nodes on one machine for some testing) -- Jan Virin http://www.linkedin.com/in/yanvirin
Go over super column data and delete the first X rows
Hi I want to use cassandra for storing some data which gets irrelevant with time. There will be a lot of data and I want to run a procedure which will delete the oldest from the repository from time to time. The column name which holds the data is a "timestamp" name, so I will want delete some range of it from the repository. What is the most efficient and the easiest way to do this? Thanks, -- Jan Virin http://www.linkedin.com/in/yanvirin
Monitoring tools for cassandra
Hi Can you please advise a good monitoring tool for cassandra? I am familiar with node-tool, and I heard about some web interface, can someone elaborate? Thanks, -- Jan Virin http://www.linkedin.com/in/yanvirin
Re: Go over super column data and delete the first X rows
I understand, I wanted to know how exactly should I do it manually... is there some "best" practice of going over all the "older" rows and deleting them? On Sun, May 23, 2010 at 5:25 PM, Jonathan Ellis wrote: > Answered on the user@ list. > > On Sun, May 23, 2010 at 3:35 AM, Yan Virin wrote: > > Hi > > I want to use cassandra for storing some data which gets irrelevant with > > time. There will be a lot of data and I want to run a procedure which > will > > delete the oldest from the repository from time to time. > > The column name which holds the data is a "timestamp" name, so I will > want > > delete some range of it from the repository. > > What is the most efficient and the easiest way to do this? > > > > Thanks, > > > > -- > > Jan Virin > > http://www.linkedin.com/in/yanvirin > > > > > > -- > Jonathan Ellis > Project Chair, Apache Cassandra > co-founder of Riptano, the source for professional Cassandra support > http://riptano.com > -- Jan Virin http://www.linkedin.com/in/yanvirin