Re: Cassandra to store 1 billion small 64KB Blobs

2010-07-25 Thread Michael Widmann
Hi Wow that was lot of information... Think about users storing files online (means with their customer name) - each customer maintains his own "hashtable" of files. Each File can consist of some or several thousand entries (depends on the size of the whole file). for example: File Test.doc c

Re: Requesting data model suggestions

2010-07-25 Thread Benjamin Black
Yes: ignore everything you are doing in your MySQL schema and don't try to figure out how to map that to Cassandra. Figure out your queries and work back from there. On Thu, Jul 22, 2010 at 9:56 AM, Scott Mann wrote: > Hello, > > Although,  I've done a bit of reading about Cassandra's data model

Re: Cassandra Chef recipe and EC2 snitch

2010-07-25 Thread Benjamin Black
the link mentions an ec2snitch, but i don't. the cookbook can just generate property files for the propertysnitch (which i much prefer to the use of a dedicated snitch). the algorithm for decided locality is at least as dumb as the one in the rack-aware snitch. on the plus side, it's just a litt

Re: Cassandra to store 1 billion small 64KB Blobs

2010-07-25 Thread Aaron Morton
Some background reading.. http://ria101.wordpress.com/2010/02/22/cassandra-randompartitioner-vs-orderpreservingpartitioner/Not sure on your follow up question, so I'll just wildly blather on about things :)My assumption of your data is you have 64K chunks that are identified by a hash, which can so

Re: Re: Re: What is consuming the heap?

2010-07-25 Thread Benjamin Black
Have you changed the default Memtable settings? Are you running on nodes with a single 1TB drive? Are you monitoring your I/O load on the nodes? On Thu, Jul 22, 2010 at 6:40 PM, 王一锋 wrote: > The version we are using is 0.6.1 > > 2010-07-23 > > >

Re: Can we filter a key or a column name using regular expression?

2010-07-25 Thread Benjamin Black
Nope. On Sun, Jul 25, 2010 at 6:05 AM, Miriam Allalouf wrote: > Hi, > I need to build a mode where I can retrieve an ordered list of objects > sharing the same prefix and contains a certain delimiter. > > For example:  Get all the object names that start with 'root' and > contain '/' in it, > giv

Re: Cassandra to store 1 billion small 64KB Blobs

2010-07-25 Thread Michael Widmann
Thanks for this detailed description ... You mentioned the secondary index in a standard column, would it be better to build several indizes? Is that even possible to build a index on for example 32 columns? The hint with the smaller boxes is very valuable! Mike 2010/7/26 Aaron Morton > For w

Re: Failing to create a 2 Node cluster on a Windows machine

2010-07-25 Thread Jonathan Ellis
I know on a mac you need to explicitly create an alias for 127.0.0.2 before it can be used. Maybe something similar applies to Windows. On Fri, Jul 23, 2010 at 11:36 AM, Alaa Zubaidi wrote: > Hi, > > I am new to Cassandra, and I want to create a 2 node cluster on the SAME > machine running windo

Re: Cassandra and Lucene

2010-07-25 Thread Aaron Morton
Sorry, also moving to User list. AaronOn 26 Jul, 2010,at 12:14 PM, Aaron Morton wrote:You may need to provide a some more information. What's the cluster configuration, what version, what's in the logs etc. AaronOn 24 Jul, 2010,at 03:40 AM, Michelan Arendse wrote:Hi I have recently started worki

Re: Cassandra to store 1 billion small 64KB Blobs

2010-07-25 Thread Aaron Morton
For what it's worth...* Many smaller boxes with local disk storage are preferable to 2 with huge NAS storage. * To cache the hash values look at the KeysCached setting in the storage-config* There are some row size limits see http://wiki.apache.org/cassandra/CassandraLimitations* If you wanted to g

Re: Can we filter a key or a column name using regular expression?

2010-07-25 Thread Aaron Morton
Have a look at how the SliceRange works  for both get_slice and KeyRange works for get_range_slices. You can query for things that start with a sub string but specifying a start value and an empty end value or an end value that is the start value concatenated with the max ascii character. Then just

Re: Cassandra crashes after reboot

2010-07-25 Thread Pieter Maes
Hi, I'm sorry for the lack of information I'm using 0.6.3. The move was moving the data dir and the commitlog dir But i now removed them and let the system bootstrap from the ring. i know i'm lacking in information here.. but i thought i needed to be mentioned overhere this could happen. Pieter

Re: Cassandra crashes after reboot

2010-07-25 Thread Peter Schuller
> I've moved my cassandra to another machine, started it up again, but got > this error Which version of Cassandra exactly? (So that one can look at matching source code) Also, were you running the exact same version of Cassandra on both servers (i.e., both the "source" and the "destination")? W

Cassandra crashes after reboot

2010-07-25 Thread Pieter Maes
Hi, I've moved my cassandra to another machine, started it up again, but got this error INFO 22:06:28,931 Replaying /var/lib/cassandra/commitlog/CommitLog-1279609619367.log, /var/lib/cassandra/commitlog/CommitLog-1279805020866.log, /var/lib/cassandra/commitlog/CommitLog-1279840051243.log INFO

Can we filter a key or a column name using regular expression?

2010-07-25 Thread Miriam Allalouf
Hi, I need to build a mode where I can retrieve an ordered list of objects sharing the same prefix and contains a certain delimiter. For example: Get all the object names that start with 'root' and contain '/' in it, given the names: r...@abc/ddd r...@bbb/c r...@ should return the first two