Programmable configuration in embedded cassandra

2010-05-17 Thread Yan Virin
Hello,

I understand that currently one must have the configuration file in place in
order to start a cassandra node(even if it is embedded in the application,
i.e. using only StorageProxy without CassandraServer).
Is that correct?
It would be practical and very easy to implement a "programmable"
configuration object(like making it possible to create an instance of
DataBaseDescriptor instance not from a file) in order to make it possible to
fully embed cassandra instance into custom application.

Thanks

-- 
Jan Virin
http://www.linkedin.com/in/yanvirin


Hadoop over Cassandra

2010-05-17 Thread Yan Virin
Hi,
Can someone explain how this works? As long as I know, there is no execution
engine in Cassandra alone, so I assume that Hadoop gives the MapReduce
execution engine which uses Cassandra as the distributed storage? Is data
locality preserved? How mature this "couple" is? How is the performance of
this compared to the original Hadoop over HDFS?

Thanks,


-- 
Jan Virin
http://www.linkedin.com/in/yanvirin


Run several cassandra instances on local machine

2010-05-20 Thread Yan Virin
It seems to be impossible to run several cassandra instances on a
localmachine, due to the fact that the seeds are described as ip addresses
and not couples of ip address and port.
Is this correct? (In case I want to start a cluster of several nodes on one
machine for some testing)

-- 
Jan Virin
http://www.linkedin.com/in/yanvirin


Go over super column data and delete the first X rows

2010-05-23 Thread Yan Virin
Hi
I want to use cassandra for storing some data which gets irrelevant with
time. There will be a lot of data and I want to run a procedure which will
delete the oldest from the repository from time to time.
The column name which holds the data is a "timestamp" name, so I will want
delete some range of it from the repository.
What is the most efficient and the easiest way to do this?

Thanks,

-- 
Jan Virin
http://www.linkedin.com/in/yanvirin


Monitoring tools for cassandra

2010-05-23 Thread Yan Virin
Hi

Can you please advise a good monitoring tool for cassandra?
I am familiar with node-tool, and I heard about some web interface, can
someone elaborate?
Thanks,

-- 
Jan Virin
http://www.linkedin.com/in/yanvirin


Re: Go over super column data and delete the first X rows

2010-05-23 Thread Yan Virin
I understand, I wanted to know how exactly should I do it manually... is
there some "best" practice of going over all the "older" rows and deleting
them?

On Sun, May 23, 2010 at 5:25 PM, Jonathan Ellis  wrote:

> Answered on the user@ list.
>
> On Sun, May 23, 2010 at 3:35 AM, Yan Virin  wrote:
> > Hi
> > I want to use cassandra for storing some data which gets irrelevant with
> > time. There will be a lot of data and I want to run a procedure which
> will
> > delete the oldest from the repository from time to time.
> > The column name which holds the data is a "timestamp" name, so I will
> want
> > delete some range of it from the repository.
> > What is the most efficient and the easiest way to do this?
> >
> > Thanks,
> >
> > --
> > Jan Virin
> > http://www.linkedin.com/in/yanvirin
> >
>
>
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of Riptano, the source for professional Cassandra support
> http://riptano.com
>



-- 
Jan Virin
http://www.linkedin.com/in/yanvirin