Re: "timestamp" parameter for Thrift "insert" API ??

2010-09-20 Thread Kuan(謝冠生)
By using cassandra-cli tool, we don't have to input timestamp while insertion. Does it mean that Cassandra have time synchronization build-in already? Since cassandra depending on time-stamp parameter very much (both read/write). The most ideal way to deal with timestamp is by cassandra itself, c

Re: 0.7 memory usage problem

2010-09-20 Thread Alaa Zubaidi
Thanks Peter, I decreased the heap size, it did not help, however, it delayed the problem. I noticed that its swapping, so, do you think that I should set windows to Not to swap? Do you think its related to this issue? https://issues.apache.org/jira/browse/CASSANDRA-1014 Thanks, Alaa On 9/18

Re: How to get a list of keys of a CF?

2010-09-20 Thread Scott Mann
Aaron, Thanks. Also for the reminder about tombstones. -- -Scott

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Peter Schuller
>  We think we might have cracked the underlying problem >  though, and it might be similar to the 'behind the scenes >  swap thing' (sadly I suspect that such things might actually >  be happening -- plus I thought that memory overcommit wasn't >  possible with Xen - only with VMware - but I guess

Re: what are ways to keep the SSTable Count down low

2010-09-20 Thread Peter Schuller
> Not in 0.6 I believe, but I believe this recently went into trunk for > the upcoming 0.7. By "not in 0.6" i mean in terms of the configuration. If you are willing to modify the source you can change it fairly easily by changing the private constants towards the beginning of org/apache/cassandra/

Re: anything on the roadmap to know the status return of executing nodetool maintenance commands

2010-09-20 Thread Nirmal Ranganathan
https://issues.apache.org/jira/browse/CASSANDRA-1449 to change nodetool to blocking for all commands. Not necessarily providing more info, but waits till the operations are complete. On Mon, Sep 20, 2010 at 4:40 PM, Dathan Pattishall wrote: > I would love the ability to run nodetool repair and ge

Re: what are ways to keep the SSTable Count down low

2010-09-20 Thread Peter Schuller
> How do you set the compaction threshold from storage-conf.xml? is this > possible? Not in 0.6 I believe, but I believe this recently went into trunk for the upcoming 0.7. > What is the consensus on a basic Key-Value store of setting the > compactionthreshold min/max from > > ./nodetool --host=

anything on the roadmap to know the status return of executing nodetool maintenance commands

2010-09-20 Thread Dathan Pattishall
I would love the ability to run nodetool repair and get status updates on what is being done and when to what, as well as an estimated time of completion Like Grabbing data from nodes populating memtable cleanups etc. Currently this stuff happens asynchronously from the nodetool point of view (fi

Re: How to get a list of keys of a CF?

2010-09-20 Thread Aaron Morton
use get_range_slices with either a SliceRange start="", finish="" and count=0 OR SlicePredicate with column_names = []. This will return no columns. Not sure if the server will detect you are asking for zero columns and take a more efficient path, however it will not return any columns with the res

inter node protocol for 0.7 nightly

2010-09-20 Thread Aaron Morton
Just took a look upgrading from from 31/08 nightly to the 20/09 and noticed the news.txt says..."The Cassandra inter-node protocol is incompatible with 0.6.x releases (and with 0.7 beta1)"Could someone point me to the ticket(s) for this change so I can see if I can do a rolling upgrade. ThanksAaron

Re: what are ways to keep the SSTable Count down low

2010-09-20 Thread Edward Capriolo
On Mon, Sep 20, 2010 at 3:14 PM, Dathan Pattishall wrote: > How do you set the compaction threshold from storage-conf.xml? is this > possible? > > What is the consensus on a basic Key-Value store of setting the > compactionthreshold min/max from > > ./nodetool --host=localhost --port=8181 getcomp

what are ways to keep the SSTable Count down low

2010-09-20 Thread Dathan Pattishall
How do you set the compaction threshold from storage-conf.xml? is this possible? What is the consensus on a basic Key-Value store of setting the compactionthreshold min/max from ./nodetool --host=localhost --port=8181 getcompactionthreshold Current compaction threshold: Min=4, Max=32 to say some

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Jedd Rashbrooke
Hi Peter, We were logging the GC output as per this before, have since taken it out, but will put it back in I think. Apropos logging - I've found that with RMI to our boxes at EC2 I've had to do the ugly thing with this: -Djava.rmi.server.hostname= .. which then renders nodetool useless,

Re: FatClient Gossip error and some other problems

2010-09-20 Thread shimi
I was patient (although it is hard when you have millions of requests which are not served in time). I was waiting for a long time. There was nothing in the Logs and in JMX. Shimi On Mon, Sep 20, 2010 at 6:12 PM, Gary Dusbabek wrote: > On Mon, Sep 20, 2010 at 09:51, shimi wrote: > > I have a c

Re: Connection issue with trunk using .net

2010-09-20 Thread Michal Augustýn
Yeah, the solution is the same. I wanted just to point out the connection lag issue that Morten could face too. Augi 2010/9/20 Michael Greene > Sorry, got threads mixed up, but yes the solution is the same. > > > On Mon, Sep 20, 2010 at 12:40 PM, Michael Greene > wrote: > >> That... is this th

Re: Connection issue with trunk using .net

2010-09-20 Thread Michael Greene
Sorry, got threads mixed up, but yes the solution is the same. On Mon, Sep 20, 2010 at 12:40 PM, Michael Greene wrote: > That... is this thread. I'm glad the solution works for you Morten. > > > On Mon, Sep 20, 2010 at 12:33 PM, Michal Augustýn < > augustyn.mic...@gmail.com> wrote: > >> Check th

Re: Connection issue with trunk using .net

2010-09-20 Thread Michael Greene
That... is this thread. I'm glad the solution works for you Morten. On Mon, Sep 20, 2010 at 12:33 PM, Michal Augustýn wrote: > Check this thread too: > http://www.mail-archive.com/user@cassandra.apache.org/msg06024.html > > Au

Re: "timestamp" parameter for Thrift "insert" API ??

2010-09-20 Thread Michal Augustýn
Consider situation when you have more Cassandra servers. It would require time synchronization in Cassandra itself. So we have to have time synchronization - on applications servers or Cassandra servers. I'm not sure if this is concern of Cassandra but it would be good for me... 2010/9/20 Kuan(謝冠生

How to get a list of keys of a CF?

2010-09-20 Thread Scott Mann
Hello, I need to get a list of all the keys of a particular CF. The API doc indicates that "get_key_range" is gone now (I'm using 0.6.5), and that I need to use "get_range_slices," which returns the list of keys and their associated rows. Is this the only way to get the list of keys? It just seem

Re: Connection issue with trunk using .net

2010-09-20 Thread Michal Augustýn
Check this thread too: http://www.mail-archive.com/user@cassandra.apache.org/msg06024.html Augi 2010/9/20 Morten Wegelbye Nissen > On 19-09-2010 23:50, Michael Greene wrote: > >> Trunk (and 0.7) use Thrift's framed transport,

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Peter Schuller
> Nope - no swap enabled. Something is seriously weird, unless the system clock is broken... Given: INFO [GC inspection] 2010-09-20 15:27:42,046 GCInspector.java (line 129) GC for ParNew: 325411 ms, 84284896 reclaimed leaving 640770336 used; max is 25907560448 INFO [GC inspection] 2010-09-20 15:

Re: a few generic questions

2010-09-20 Thread Mario Micklisch
Hello Scott, Thank you for your help and pointing me into some directions and also for making me aware that some things are more important! :-) I did read most of the wiki and many blogs and websites about Cassandra before deciding to go with it, even did several test installations and small load

Re: Cassandra performance

2010-09-20 Thread Edward Capriolo
On Sat, Sep 18, 2010 at 9:26 AM, Peter Schuller wrote: >>  - performance (it should be not as much less than shard of MySQL and >> scale linearly, we want to have not more that 10K inserts per second >> of writes, and probably not more than 1K/s reads which will be mostly >> random) >>  - ability

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Dave Gardner
One other question for the list: I gather GMFD is "gossip stage" - but what does this actually mean? Is it an issue to have 203 pending operations? Thanks Dave INFO [GC inspection] 2010-09-20 16:56:12,792 GCInspector.java (line 129) GC for ParNew: 127970 ms, 570382800 reclaimed leaving 4606885

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Dave Gardner
Nope - no swap enabled. top - 16:53:14 up 12 days, 6:11, 3 users, load average: 1.99, 2.63, 5.03 Tasks: 133 total, 1 running, 132 sleeping, 0 stopped, 0 zombie Cpu(s): 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Mem: 35840228k total, 33077580k used, 2762648k f

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Peter Schuller
> Can anyone help shed any light on why this might be happening? We've tried a > variety of JVM settings to alleviate this; currently with no luck. Extremely long ParNew (young generations) pause times are almost always due to swapping. Are you swapping? -- / Peter Schuller

Re: commit log question

2010-09-20 Thread Peter Schuller
>    I have one question about the 'commit log' in Cassandra, so imagine > we issue a write with QUORUM, if the write was successful then we are > sure that N/2 +1 replicas have the new data. If one of these replicas > fail, no state is lost because the state is also available from > another machin

Re: Building a local cluster in Linux

2010-09-20 Thread Gary Dusbabek
On Mon, Sep 20, 2010 at 10:02, Rodrigo Hjort wrote: > Hello all, > > In order to build a Cassandra cluster exclusively for availability and > replication testings, I thought of a simple solution, based on a single > Linux instance, with no virtualization at all. > > The idea was to initialize ever

Re: FatClient Gossip error and some other problems

2010-09-20 Thread Gary Dusbabek
On Mon, Sep 20, 2010 at 09:51, shimi wrote: > I have a cluster with 6 nodes on 2 datacenters (3 on each datacenter). > I replaced all of the servers in the cluster (0.6.4) with new ones (0.6.5). > My old cluster was unbalanced since I was using Random Partitioner and I > bootstrapped all the nodes

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-20 Thread Dave Gardner
As a follow up to this conversation; we are still having issues with our Cassandra cluster on EC2. It *looks* to be related to Garbage Collection; however we aren't sure what the root cause of the problem is. Here is an extract from logs: INFO [GMFD:1] 2010-09-20 15:22:00,242 Gossiper.java (line

Building a local cluster in Linux

2010-09-20 Thread Rodrigo Hjort
Hello all, In order to build a *Cassandra cluster exclusively for availability and replication testings*, I thought of a simple solution, based on a single Linux instance, with no virtualization at all. The idea was to initialize every node, run a testing client, and manually kill some nodes proc

FatClient Gossip error and some other problems

2010-09-20 Thread shimi
I have a cluster with 6 nodes on 2 datacenters (3 on each datacenter). I replaced all of the servers in the cluster (0.6.4) with new ones (0.6.5). My old cluster was unbalanced since I was using Random Partitioner and I bootstrapped all the nodes without specifying their tokens. Since I wanted the

commit log question

2010-09-20 Thread Даниел Симеонов
Hello,    I have one question about the 'commit log' in Cassandra, so imagine we issue a write with QUORUM, if the write was successful then we are sure that N/2 +1 replicas have the new data. If one of these replicas fail, no state is lost because the state is also available from another machine i

Re: Schema question

2010-09-20 Thread Juho Mäkinen
We have built a facebook style "messenger" into our web site which uses cassandra as storage backend with two column families: TalkMessages and TalkLastMessages. I've uploaded a screenshot showing the feature in action to http://img138.imageshack.us/img138/3807/talkexample.jpg TalkMessages contain

Re: Schema question

2010-09-20 Thread vineet daniel
Hi Morten Simplest appraoch that comes to my mind (without considering any other use -cases just read and unread messages) is to use two CF's 'read' and 'unread', put all new messages in 'unread' and once user reads any one one them shift the same to 'read' and mark original for deletion. Regard

Re: Schema question

2010-09-20 Thread aaron morton
Here is a discussion about implementing twitter with Cassandra http://www.rackspacecloud.com/blog/2010/05/12/cassandra-by-example/ An example of the same on github http://github.com/ericflo/twissandra If you have not done already checkout the articles page on the wiki http://wiki.apache.org/cas

Schema question

2010-09-20 Thread Morten Wegelbye Nissen
Hello List, No matter where you read, you almost every-where read the the noSQL datascema is completely different from the relational way - and after a little insight in cassandra everyone can 2nd that. But I miss to see some real-life examples on how a real system can be modelled. Lets tak

"timestamp" parameter for Thrift "insert" API ??

2010-09-20 Thread Kuan(謝冠生)
To whom it may concern, While timestamp parameter is auto generated by using cassandra-cli tool, why user have to manually insert the parameter in Cassandra-Thrift "insert" API?? In my point of view, leaving time-stamp to be inserted by client side will lead data into chaos, since clients' loc