avoid log spam with 0 HH rows delivered

2012-03-01 Thread Radim Kolar
Can be something made to remove these empty delivery attempts from log? Its just tombstoned row. [default@system] list HintsColumnFamily; Using default limit of 100 --- RowKey: 00 1 Row Returned. Elapsed time: 234 msec(s). INFO [HintedHandoff:1] 2012-03-02 05:44:32,359 Hinte

Re: Combining Cassandra with some SQL language

2012-03-01 Thread Data Craftsman
Agree. That is Transactional database API. Orthogonality: Modular programming, implement Transactional database API as database access interface. Thanks, Charlie | DBA On Sun, Feb 26, 2012 at 6:21 AM, Adam Haney wrote: > I've been using a combination of MySQL and Cassandra for about a year now

Composite primary key does not work on Cassandra 1.1.0-beta1

2012-03-01 Thread Data Craftsman
Howdy, Here is the the CQL and error. Did I do something wrong? /home/cassandra>cqlsh Connected to Test Cluster at localhost:9160. [cqlsh 2.1.0 | Cassandra 1.1.0-beta1 | CQL spec 2.0.0 | Thrift protocol 19.28.0] Use HELP for help. cqlsh> use demo; cqlsh:demo> CREATE TABLE timeline ( ... user_id

composite types in CQL

2012-03-01 Thread Bayle Shanks
hi, i'm wondering how to do composite data storage types in CQL. I am trying to mimic the Composite Types functionality of the Pycassa client: http://pycassa.github.com/pycassa/assorted/composite_types.html In short, in Pycassa you can do something like: --- itemTimeCompositeType = CompositeTy

Re: Is this the correct data model thinking?

2012-03-01 Thread Data Craftsman
Yes. Think in queries. • Break your normalization habit • Roughly ~one CF per query • Denormalize! •

Re: Schema change causes exception when adding data

2012-03-01 Thread Tharindu Mathew
There are 2. I'd like to wait till there are one, when I insert the value. Going through the code, calling client.describe_schema_versions() seems to give a good answer to this. And I discovered that if I wait till there is only 1 version, I will not get this error. Is this the best practice if I

Re: can't find rows

2012-03-01 Thread Casey Deccio
On Thu, Mar 1, 2012 at 9:33 AM, aaron morton wrote: > What RF were you using and had you been running repair regularly ? > > RF 1 *sigh*. Waiting until I have more/better resources to use RF > 1. Hopefully soon. In the mean time... Oddly (to me), when I removed the most recently added node, all

sstable2json timestamp format

2012-03-01 Thread Lele Gaifax
Hi all, I'm rethinking the layout of my data (currently served by cassandra 0.8.x thru pycassa 1.1, but reasonably I'm gonna use cassandra 1.x with pycassa 1.4). My current data contains a few columns storing timestamps: given that at the time there was no native pycassa support I used an ISO 860

Re: Impact of old data on performance

2012-03-01 Thread Dan Retzlaff
I've never had to deal with GC tuning since our cluster has relatively few (but large) columns. So I'll leave further comment to others, but it sounds like you're on the right track. On Thu, Mar 1, 2012 at 2:13 AM, Stefan Reek wrote: > ** > Swap is disabled on the machines, so I'm sure the JVM i

Re: Schema change causes exception when adding data

2012-03-01 Thread aaron morton
use describe cluster in the CLI to see how many schema versions there are. Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 2/03/2012, at 12:25 AM, Tharindu Mathew wrote: > > > On Thu, Mar 1, 2012 at 11:47 AM, Tharindu Mathew wrote: > Je

Re: can't find rows

2012-03-01 Thread aaron morton
What RF were you using and had you been running repair regularly ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 1/03/2012, at 5:51 AM, Casey Deccio wrote: > On Wed, Feb 29, 2012 at 5:29 AM, Casey Deccio wrote: > On Wed, Feb 29, 2012 at

Re: nodetool decommission unable to do its job successfully

2012-03-01 Thread aaron morton
Do you have the full stack trace and maybe a few log lines before and after ? Also what version are you using ? As a work around you can shutdown the leaving node and use nodetool removetoken from one of the others. Cheers - Aaron Morton Freelance Developer @aaronmorton http:

Re: NullPointerException on upgradesstables

2012-03-01 Thread Casey Deccio
On Thu, Mar 1, 2012 at 2:39 AM, aaron morton wrote: > I am guessing you are running low on disk space. Can you check and try to > free some up ? > > Okay, I've freed some up and am trying again. > Looks like a bug in CompactionTask.execute() see > https://issues.apache.org/jira/browse/CASSANDRA-

Re: hadoop map join with ColumnFamilyInputFormat

2012-03-01 Thread Jeremy Hanna
I haven't used that in particular, but it's pretty trivial to do that with Pig and I would imagine it would just do the right thing under the covers. It's a simple join with Pig. We use pygmalion to get data from the Cassandra bag. A simple example would be: DEFINE FromCassandraBag org.pygmal

Secondary indexes don't go away after metadata change

2012-03-01 Thread Frisch, Michael
I have a few column families that I decided to get rid of the secondary indexes on. I see that there aren't any new index SSTables being created, but all of the old ones remain (some from as far back as September). Is it safe to just delete then when the node is offline? Should I run clean-up

Re: Huge amount of empty files in data directory.

2012-03-01 Thread Henrik Schröder
Great, thanks! /Henrik On Thu, Mar 1, 2012 at 13:08, Sylvain Lebresne wrote: > It's a bug, namely: https://issues.apache.org/jira/browse/CASSANDRA-3616 > You'd want to upgrade. > > -- > Sylvain > > On Thu, Mar 1, 2012 at 1:01 PM, Henrik Schröder wrote: > > Hi, > > > > We're running Cassandra

Re: Using cassandra at minimal expenditures

2012-03-01 Thread Hontvári József Levente
For Cassandra testing I am using a very old server with a one core Celeron processor and 1GiB RAM, and another one with 4GiB and 4 cores, both with two consumer SATA hard disks. Both works, i.e. there is no out of memory error etc. There are about 10 writes and reads per second, maybe more, but

Re: Huge amount of empty files in data directory.

2012-03-01 Thread Sylvain Lebresne
It's a bug, namely: https://issues.apache.org/jira/browse/CASSANDRA-3616 You'd want to upgrade. -- Sylvain On Thu, Mar 1, 2012 at 1:01 PM, Henrik Schröder wrote: > Hi, > > We're running Cassandra 1.0.6 on Windows, and noticed that the amount of > files in the datadirectory just keeps growing. We

Huge amount of empty files in data directory.

2012-03-01 Thread Henrik Schröder
Hi, We're running Cassandra 1.0.6 on Windows, and noticed that the amount of files in the datadirectory just keeps growing. We have about 60GB of data per node, we do a major compaction about once a week, but after compaction there's a lot of 0-byte temp files and old files that are kept for some

Re: Schema change causes exception when adding data

2012-03-01 Thread Tharindu Mathew
On Thu, Mar 1, 2012 at 11:47 AM, Tharindu Mathew wrote: > Jeremiah, > > Thanks for the reply. > > This is what we have been doing, but it's not reliable as we don't know a > definite time that the schema would get replicated. Is there any way I can > know for sure that changes have propagated? > [

hadoop map join with ColumnFamilyInputFormat

2012-03-01 Thread Benoit Mathieu
Hi all, I want to write a MapReduce job with a Map task taking its data from 2 CFs. Those 2 CFs have the same row keys and are in same keyspace, so they are partionned the same way across my cluster and it would be nice that the Map task reads the both column families locally. In hadoop package o

Re: NullPointerException on upgradesstables

2012-03-01 Thread aaron morton
I am guessing you are running low on disk space. Can you check and try to free some up ? Looks like a bug in CompactionTask.execute() see https://issues.apache.org/jira/browse/CASSANDRA-3985 Hope that helps. - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpi

Re: Using cassandra at minimal expenditures

2012-03-01 Thread Ertio Lew
expensive :-) I was expecting to start with 2GB nodes, if not 1GB for intial. On Thu, Mar 1, 2012 at 3:43 PM, aaron morton wrote: > As others said, depends on load and traffic and all sorts of thins. > > if you want a number, 4Gb would me a reasonable minimum IMHO. (You may get > by with less).

Re: Using cassandra at minimal expenditures

2012-03-01 Thread aaron morton
As others said, depends on load and traffic and all sorts of thins. if you want a number, 4Gb would me a reasonable minimum IMHO. (You may get by with less). 8Gb is about the tops. Any memory not allocated to Cassandra will be used to map files into memory. If you can get machines with 8GB

Re: Failed to join ring (NAT)

2012-03-01 Thread aaron morton
> were not immediately picked up They should be re-read on startup. if they were not let us know. Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 29/02/2012, at 10:27 PM, Richard Evans wrote: > Spot on Aaron! > > Of course when I set u