RE: OrderPreservingPartitioner for get_range_slices

2010-09-15 Thread Marie-Anne
I have tried twice to remove myself from this listserv by following the directions I received when I signed up. Can anyone help me? Marie-Anne From: Phil Stanhope [mailto:stanh...@gmail.com] Sent: Wednesday, September 15, 2010 12:43 PM To: user@cassandra.apache.org Subject: Re: OrderPreservi

Build an index to for join query

2010-09-15 Thread Alvin UW
Hello, I am going to build an index to join two CFs. First, we see this index as a CF/SCF. The difference is I don't materialise it. Assume we have two tables: ID_Address(*Id*, address) , Name_ID(*name*, id) Then,the index is: Name_Address(*name*, address) When the application tries to query on

Re: OrderPreservingPartitioner for get_range_slices

2010-09-15 Thread Phil Stanhope
My experience for the last question is ... it depends. If you have NO changes to the store (which I would argue could be abnormal, it's not in a production environment allowing writes) ... then I you can do a full range/key scan and get no repeats. Factors that will impact scanning all keys includ

Re: OrderPreservingPartitioner for get_range_slices

2010-09-15 Thread Michal Augustýn
And what about uniqueness? Can we be sure that we get each row just once? Thanks. 2010/9/15 Janne Jalkanen > > Correct. You can use get_range_slices with RandomPartitioner too, BUT the > iteration order is non-predictable, that is, you will not know in which > order you get the rows (RandomPart

Re: Minor question on index design

2010-09-15 Thread Janne Jalkanen
Ok, thanks. I'm going with Option 1, and try to steer away from SuperColumns. That also gives me the option to tweak the caches depending on the use pattern (User CF will be accessed in a lot of different ways, not just with relation to Objects). /Janne On Sep 14, 2010, at 23:46 , Aaron

Re: OrderPreservingPartitioner for get_range_slices

2010-09-15 Thread Janne Jalkanen
Correct. You can use get_range_slices with RandomPartitioner too, BUT the iteration order is non-predictable, that is, you will not know in which order you get the rows (RandomPartitioner would probably better be called ObscurePartitioner - it ain't random, but it's as good as if it were

Re: Bootstrapping stays stuck

2010-09-15 Thread Gurpreet Singh
This problem still stays unresolved despite numerous restarts to the cluster. I cant seem to find a way out of this one, and I am not really looking for a workaround, kinda need this to work if i need to go to production. Turned on the ALL logging in log4j, and now I see the following exception (E

OrderPreservingPartitioner for get_range_slices

2010-09-15 Thread Rana Aich
Hi All, I was under the impression that in order to query with get_range_slices one has to have a OrderPreservingPartitioner. Can we do get_range_slices with RandomPartitioner also? I can distinctly remember I read that(OrderPreservingPartitioner for get_range_slices) in Cassnadra WIKI but now so

Re: Large File Storage

2010-09-15 Thread Lucas Nodine
Jonathan, So it is "safe" to use a column to hold the entire data file assuming there is enough heap space? Or are there other considerations of which I should be concerned? As thrift is used and all data must be loaded into memory (see Wiki), I should still expect to benefit from breaking the d

Re: Cassandra performance

2010-09-15 Thread Wayne
If MySQL is faster then use it. I struggled to do side by side comparisons with Mysql for months until finally realizing they are too different to do side by side comparisons. Mysql is always faster out of the gate when you come at the problem thinking in terms of relational databases. Add in repli

Re: Large File Storage

2010-09-15 Thread Jonathan Ellis
the row-in-memory-during-compaction was fixed some time ago for 0.7 (CASSANDRA-16). On Wed, Sep 15, 2010 at 10:03 AM, Lucas Nodine wrote: > Hello Users, > > I am planning a system where both metadata and data will be stored.  Usually > it will be small file such as word documents along with some

Re: how long to wait for action (waiting for bootstrap token)

2010-09-15 Thread Jonathan Ellis
On Wed, Sep 15, 2010 at 10:10 AM, Matthias L. Jugel wrote: > Strange, it needed a few attempts before they actuallyfound each other with > several UP and downs > Now the node started bootstrapping, finally. It should be noted in the docs > that this is a blocker. One expects that they pick up af

Re: Embedded Cassandra in JBoss

2010-09-15 Thread Gary Dusbabek
On Wed, Sep 15, 2010 at 09:17, Sandor Molnar wrote: > Hello guys! > > I just started researching today whether we can use Cassandra in our software > or not and I need to know if it is possible to use embedded Cassandra in > Jboss somehow. I googled it and I found several pages like > http://wik

Re: how long to wait for action (waiting for bootstrap token)

2010-09-15 Thread Matthias L. Jugel
Strange, it needed a few attempts before they actuallyfound each other with several UP and downs Now the node started bootstrapping, finally. It should be noted in the docs that this is a blocker. One expects that they pick up after finding each other again as the docs say be patient. Leo. On

Large File Storage

2010-09-15 Thread Lucas Nodine
Hello Users, I am planning a system where both metadata and data will be stored. Usually it will be small file such as word documents along with some specific data about the file. Sometimes, there will be a large file, possibly a few hundred meg - a gig such as video. I have read a lot about su

Embedded Cassandra in JBoss

2010-09-15 Thread Sandor Molnar
Hello guys! I just started researching today whether we can use Cassandra in our software or not and I need to know if it is possible to use embedded Cassandra in Jboss somehow. I googled it and I found several pages like http://wiki.apache.org/cassandra/StorageProxy http://wiki.apache.org/cassa

Re: how long to wait for action (waiting for bootstrap token)

2010-09-15 Thread Jonathan Ellis
The source node going down (whether it was really down/unavailable or a FD false positive doesn't matter) will prevent bootstrap from completing. You'll have to restart the new node. On Wed, Sep 15, 2010 at 5:09 AM, Matthias L. Jugel wrote: > Hi, > > I would like to extend our cassandra system by

Re: max columns number

2010-09-15 Thread Jonathan Ellis
2 billion. In 0.6 (but not 0.7) there is also a 2GB row size limitation. On Wed, Sep 15, 2010 at 1:43 AM, Mark Zitnik wrote: > HI > > What is the max columns number in a key that cassandra supports. > > Thanks > -Mark Zitnik > -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of R

how long to wait for action (waiting for bootstrap token)

2010-09-15 Thread Matthias L. Jugel
Hi, I would like to extend our cassandra system by another node. It contains about 800GB data right now. I have set up another host and installed cassandra (both 0.6.3). The new cassandra node starts up and contacts the first to receive "load info" and then tries to receive the bootstrap token:

Re: Cassandra performance

2010-09-15 Thread Peter Schuller
> But to be honest I'm pretty disappointed that Cassandra doesn't really > scale linearly (or "semi-linearly" :)) when adding new machines. I It really should scale linearly for this workload unless I have missed something important (in which case I hope someone will chime in). But note that you a