Re: portability between enterprise and community version

2012-06-13 Thread R. Verlangen
@Viktor: I've read/heard this many times before, however I've never seen a real explanation. Java is cross platform. If Cassandra runs properly on both Linux as Windows clusters: why would it be impossible to communicate? Of course I understand the disadvantages of having a combined cluster. 2012/

Re: Offset in slicequeries for pagination

2012-06-11 Thread R. Verlangen
I solved this with creating a manual index with as column keys integers and column values the uuid's of the results. Then run a slicequery to determine the batch to fetch. 2012/6/11 Cyril Auburtin > using 10 results maximum per page, > > to go directly to 14th page, there is no offset=141 possi

Re: Problem in getting data from a 2 node cluster

2012-06-06 Thread R. Verlangen
a from a 2 node cluster > > ** ** > > What does repair do? > > ** ** > > Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | > www.mu-sigma.com > > ** ** > > *From:* R. Verlangen [mailto:ro...@us2.nl] > *Sent:* Wednesday, June 06, 2

Re: Problem in getting data from a 2 node cluster

2012-06-06 Thread R. Verlangen
Did you run repair on the new node? 2012/6/6 Prakrati Agrawal > Dear all, > > ** ** > > I had a 1 node cluster. Then I added 1 more node to it. ** ** > > When I ran my query on 1 node cluster I got all my data but when I ran my > query on the 2 node cluster (Hector code) I am not getting th

Re: Why Hector is taking more time than Thrift

2012-06-06 Thread R. Verlangen
Hector is a higher-level client that provides some abstraction and an easy to use interface. The Thrift API is pretty raw. So for most cases the Hector client would be the best choice; except for use-cases where the ultimate performance is a requirement (resulting in lots of more maintenance betwee

Re: about multitenant datamodel

2012-06-05 Thread R. Verlangen
Every CF has a certain amount of overhead in memory. It's just not how Cassandra is designed to be used. Maybe you could think of a way to smash data down to indices and entities. With an abstraction layer you can store practically anything in Cassandra. 2012/6/5 Toru Inoko > IMHO a model that a

Re: nodetool repair -pr enough in this scenario?

2012-06-05 Thread R. Verlangen
In your case -pr would be just fine (see Viktor's explanation). 2012/6/5 Viktor Jevdokimov > Understand simple mechanics first, decide how to act later. > > ** ** > > Without –PR there’s no difference from which host to run repair, it runs > for the whole 100% range, from start to end, the

Re: repair

2012-06-04 Thread R. Verlangen
The "repair -pr" only repairs the nodes primary range: so is only usefull in day to day use. When you're recovering from a crash use it without -pr. 2012/6/4 Romain HARDOUIN > > Run "repair -pr" in your cron. > > Tamar Fraenkel a écrit sur 04/06/2012 13:44:32 : > > > Thanks. > > > > I actually

Re: Adding a new node to Cassandra cluster

2012-06-04 Thread R. Verlangen
> > If you use thrift API, you have to maintain lot of low level code by > yourself which is already being polished by HLC hector, pycassa also with > HLC your can easily switch between thrift and growing CQL. > > On Mon, Jun 4, 2012 at 3:00 PM, R. Verlangen wrote: > > Y

Re: Adding a new node to Cassandra cluster

2012-06-04 Thread R. Verlangen
> > I am using Thrift API and I am not able to find anything on the internet > about how to configure it for multiple nodes. I am not using any proper > client like Hector. > > ** ** > > Prakrati Agrawal | Developer - Big Data(I&D)| 9731648376 | > www.mu-sigm

Re: Adding a new node to Cassandra cluster

2012-06-04 Thread R. Verlangen
Hi there, When you speak to one node it will internally redirect the request to the proper node (local / external): but you won't be able to failover on a crash of the localhost. For adding another node to the connection pool you should take a look at the documentation of your java client. Good l

Re: Finding whether a new node is successfully added or not

2012-06-04 Thread R. Verlangen
Hi there, You can check the ring info with nodetool. Furthermore you can take a look at the streaming statistics: lots of pending indicates a node that is still receiving data from it's seed(s). As far as I'm aware of the seed value will be read upon start: so a restart is required. Good luck. 2

Re: Data Versioning Support

2012-05-24 Thread R. Verlangen
Hi Felipe, There recently was a thread about ( http://www.mail-archive.com/user@cassandra.apache.org/msg22298.html ). The answer in short: no. However you can build your own data model to support it. Cheers! 2012/5/24 Felipe Schmidt > Doe's Cassandra support data versioning? > > I'm trying to

Re: Number of keyspaces

2012-05-22 Thread R. Verlangen
ww.thelastpickle.com > > On 22/05/2012, at 6:58 PM, R. Verlangen wrote: > > Yes, it does. However there's no real answer what's the limit: it depends > on your hardware and cluster configuration. > > You might even want to search the archives of this mailinglist

Re: Number of keyspaces

2012-05-21 Thread R. Verlangen
Yes, it does. However there's no real answer what's the limit: it depends on your hardware and cluster configuration. You might even want to search the archives of this mailinglist, I remember this has been asked before. Cheers! 2012/5/21 Luís Ferreira > Hi, > > Does the number of keyspaces af

Re: is it possible to run cassandra process in client mode as smart proxy

2012-05-16 Thread R. Verlangen
Yes, I'm aware of those issues however in our use case they don't cause any problems. But ... If there's something better out there I'm really curious: so I'll keep up with this thread. 2012/5/16 Piavlo > On 05/16/2012 01:24 PM, R. Verlangen wrote: > > Hi the

Re: is it possible to run cassandra process in client mode as smart proxy

2012-05-16 Thread R. Verlangen
Hi there, I'm using HAProxy for PHP projects to take care of this. It improved connection pooling enormous on the client side: with preserving failover capabilities. Maybe that is something for you to use in combination with PHP. Good luck! 2012/5/16 Piavlo > > Hi, > > I'm interested in using

Re: get dinamicsnith info from php

2012-05-14 Thread R. Verlangen
I struggled with this before and decided to use HAProxy which suits my needs, you can read a little more about it at my personal blog: http://www.robinverlangen.nl/index/view/4fa902c1596cb-44a627/how-to-solve-the-pain-of-stateless-php-with-cassandra.html Good luck with it! 2012/5/14 Viktor Jevd

Re: Use-case: multi-instance webshop

2012-05-10 Thread R. Verlangen
@Aaron: Solr will probably be the solution to our problem. Thank you! @Radim: We already have a Cassandra cluster, we do not want to add an extra MongoDB cluster. At this moment the data would fit easily in SQL, but we don't know how our platform grows and we want to be prepared for the future. W

Use-case: multi-instance webshop

2012-05-08 Thread R. Verlangen
Hi there, I'm working on a datamodel for a multi-website, multi-customer system. Things we would like to do: - search products (lucene / solr / solandra) - multi-filter (e.g. categories) - reviews - voting I can't really see how to do the filtering of the products by categories and even things li

Re: Bad Request: No indexed columns present in by-columns clause with "equals" operator

2012-04-24 Thread R. Verlangen
I read a while ago that a "compaction" would rebuild the index. You can trigger this by running "repair" with the nodetool. 2012/4/24 > De : mdione@orange.com [mailto:mdione@orange.com] > > [default@avatars] describe HBX_FILE; > > ColumnFamily: HBX_FILE > > Key Validation Class

Re: blob fields, bynary or hexa?

2012-04-19 Thread R. Verlangen
PHPCassa does support binaries, so that should not be the problem. 2012/4/19 phuduc nguyen > Well, I'm not sure exactly how you're passing a blob to the CLI. It would > be > helpful if you pasted your commands/code and maybe there is a simple > oversight. > > With that said, Cassandra can most d

Re: swap grows

2012-04-14 Thread R. Verlangen
here > still about 3GB (on node with JAVA_HEAP=6GB free memory also 3GB) of free > memory but there JAVA_HEAP=5G, so this looks like some sysctl > (/proc/sys/vm???) ratio (about 10%(3 / 24 * 100)), i don't known which, > anybody can explain this situation > > 2012/4/14 R.

Re: swap grows

2012-04-14 Thread R. Verlangen
Its recommended to disable swap entirely when you run Cassandra on a server. 2012/4/14 ruslan usifov > I forgot to say that system have 24GB of phis memory > > > 2012/4/14 ruslan usifov > >> Hello >> >> We have 6 node cluster (cassandra 0.8.10). On one node i increase java >> heap size to 6GB,

Re: Trouble with wrong data

2012-04-13 Thread R. Verlangen
It sounds like the commitlog has been replayed however I have really no idea whether this could have happened. Anyone? 2012/4/13 Alain RODRIGUEZ > The commitlog_total_space_in_mb was not set, I set it to avoid having the > same problem in the future. > > I am aware of the over-counting problem i

Re: cassandra 0.8.7 + hector 0.8.3: All Quorum reads result in writes?

2012-04-11 Thread R. Verlangen
Are you sure this isn't read-repair? http://wiki.apache.org/cassandra/ReadRepair 2012/4/11 Thibaut Britz > Also executing the same multiget rangeslice query over the same range > again will trigger the same writes again and again. > > On Wed, Apr 11, 2012 at 5:41 PM, Thibaut Britz < > thibaut.br

Re: need of regular nodetool repair

2012-04-11 Thread R. Verlangen
> all HH works, and you never delete your data. > > > On 04/11/2012 11:34 AM, ruslan usifov wrote: > > Sorry fo my bad english, so QUORUM allow doesn't make repair regularity? > But form your anser it does not follow > > 2012/4/11 R. Verlangen > >> Yes, I

Re: need of regular nodetool repair

2012-04-11 Thread R. Verlangen
Yes, I personally have configured it to perform a repair once a week, as the GCGraceSeconds is at 10 days. This is also what's in the manual http://wiki.apache.org/cassandra/Operations#Repairing_missing_or_inconsistent_data (point 2) 2012/4/11 ruslan usifov > Hello > > I have follow question, i

Re: Nodetool snapshot, consistency and replication

2012-04-02 Thread R. Verlangen
Ok, thank you. 2012/4/2 Rob Coli > On Mon, Apr 2, 2012 at 9:19 AM, R. Verlangen wrote: > > - 3 node cluster > > - RF = 3 > > - fully consistent (not measured, but let's say it is) > > > > Is it true that when I take a snaphot at only one of the 3 nodes

Nodetool snapshot, consistency and replication

2012-04-02 Thread R. Verlangen
Hi there, I have a question about the nodetool snapshot. Situation: - 3 node cluster - RF = 3 - fully consistent (not measured, but let's say it is) Is it true that when I take a snaphot at only one of the 3 nodes this contains all the data in the cluster (at least 1 replica)? With kind regards

Re: another DataStax OpsCenter question

2012-03-31 Thread R. Verlangen
Nick, would that also result in useless duplicates of the statistics? 2012/3/30 Nick Bailey > Unfortunately at the moment OpsCenter only really supports having one > instance per cluster. It may be possible to set up an instance in each > datacenter, however it has not been tested and each opsce

Re: Any improvements in Cassandra JDBC driver ?

2012-03-29 Thread R. Verlangen
The best would to not use update / insert at all but "set" / "put" / "save". Cheers! 2012/3/29 Dinusha Dilrukshi > What I want to tell was this driver does not use INSERT key word. Since > CQL support for using INSERT keyword and it is more generic key word used > to add new records, it's more

Re: opscenter

2012-03-29 Thread R. Verlangen
As far as I'm aware of that is not possible using the opscenter. I recommend you use the cassandra-cli and perform an update column family query. 2012/3/29 puneet loya > I m currently using the the datastax opscenter. > > How do we add column to the column families in opscenter?? > > > -- Wi

Re: How to store a list of values?

2012-03-28 Thread R. Verlangen
t; > > On Wed, Mar 28, 2012 at 12:34 AM, R. Verlangen wrote: > >> Yes, that is one of the possible solutions to your problem. >> >> When you want to retrieve only the skills of a particular row just get >> the columns with as start value "skill:". >>

Re: How to store a list of values?

2012-03-28 Thread R. Verlangen
'software engineer', > 'company': 'google', > 'location': 'orange county', > 'skill:java': '', > 'skill:html': '', > 'skill:javascript': '' > } > > > O

Re: Graveyard compactions, when do they occur?

2012-03-28 Thread R. Verlangen
Cassandra "graveyard" sounds like a lot of thombstones that will be compacted during normal compact. You can trigger that manually using the nodetool. 2012/3/28 Erik Forsberg > Hi! > > I was trying out the "truncate" command in cassandra-cli. > > http://wiki.apache.org/**cassandra/CassandraCli0

Re: import

2012-03-27 Thread R. Verlangen
You can write your own script to parse the excel file (export as csv) and import it with batch inserts. Should be pretty easy if you have experience with those techniques. 2012/3/27 puneet loya > I want to import files from excel to cassandra? Is it possible?? > > Any tool that can help?? > > W

Re: counter column family

2012-03-27 Thread R. Verlangen
7;auto increment' in other > databases rite? > > > > I mean we have a column of type integer which increments with every > insert. > > > > Am i goin the rite way?? > > > > please reply :) > > On Tue, Mar 27, 2012 at 5:50 PM, R. Verlangen wrote: &g

Re: counter column family

2012-03-27 Thread R. Verlangen
*create column family MyCounterColumnFamily with default_validation_class=CounterColumnType and key_validation_class=UTF8Type and comparator=UTF8Type;* There you go! Keys must be utf8, as well as the column names. Of course you can change those validators. Cheers! 2012/3/27 puneet loya > Can u

Re: Schema advice/help

2012-03-27 Thread R. Verlangen
You can just get a slice range with as start "userId:" and no end. 2012/3/27 Maciej Miklas > multiget would require Order Preserving Partitioner, and this can lead to > unbalanced ring and hot spots. > > Maybe you can use secondary index on "itemtype" - is must have small > cardinality: > http:/

Re: Fwd: information on cassandra

2012-03-27 Thread R. Verlangen
Thank you Maki, wasn't aware of that. 2012/3/27 Maki Watanabe > auto_bootstrap has been removed from cassandra.yaml and always enabled > since 1.0. > fyi. > > maki > > 2012/3/26 R. Verlangen : > > Yes, you can add nodes to a running "cluster". It'

Re: How to store a list of values?

2012-03-27 Thread R. Verlangen
That's true, but it does not sound like a real problem to me.. Maybe someone else can shed some light upon this. 2012/3/27 samal > > > On Tue, Mar 27, 2012 at 1:47 AM, R. Verlangen wrote: > >> " but any schema change will break it " >> >> How do yo

Re: unbalanced ring

2012-03-27 Thread R. Verlangen
Cassandra is built to store tons and tons of data. In my opinion roughly ~ 6MB per node is not enough data to allow it to become a fully balanced cluster. 2012/3/27 Tamar Fraenkel > This morning I have > nodetool ring -h localhost > Address DC RackStatus State Load >

Re: problem in create column family

2012-03-26 Thread R. Verlangen
; so can i use comparator='utf8' or not?? > > > Please reply > > > On Mon, Mar 26, 2012 at 9:17 PM, R. Verlangen wrote: > >> You should use the full type names, e.g. >> >> create column family MyColumnFamily with comparator=UTF8Type; >> >&g

Re: cassandra 1.08 on java7 and win7

2012-03-26 Thread R. Verlangen
Ben Coverston wrote earlier today: " Use a version of the Java 6 runtime, Cassandra hasn't been tested at all with the Java 7 runtime" So I think that might be a good way to start. 2012/3/26 Frank Hsueh > I think I have cassandra the server started > > In another window: > > > cassandra-cl

Re: Performance overhead when using start and end columns

2012-03-26 Thread R. Verlangen
@Aaron: Very interesting article! Mentioned it on my Dutch blog. 2012/3/26 Mohit Anchlia > Thanks! > > > On Mon, Mar 26, 2012 at 10:53 AM, aaron morton wrote: > >> See the test's in the article. >> >> The code I used for profiling is also available. >> >> Cheers >> >>- >> Aar

Re: How to store a list of values?

2012-03-26 Thread R. Verlangen
ething along the lines of adding a skills super column to the User table: >>> >>> skills: { >>> 'java': null, >>> 'c++': null, >>> 'cobol': null >>> } >>> >>> However, I'm still not

Re: How to store a list of values?

2012-03-26 Thread R. Verlangen
null, > 'cobol': null > } > > However, I'm still not sure yet how to accomplish this with Astyanax. > I've only figured out how to make composite columns with predefined column > names with it and not dynamic column names like this. > > > > On Mon,

Re: How to store a list of values?

2012-03-26 Thread R. Verlangen
In this case you only neem the columns for values. You don't need the column-values to hold multiple columns (the super-column principle). So a normal CF would work. 2012/3/26 Ben McCann > Thanks for the reply Samal. I did not realize that you could store a > column with null value. Do you kno

Re: problem in create column family

2012-03-26 Thread R. Verlangen
You should use the full type names, e.g. create column family MyColumnFamily with comparator=UTF8Type; 2012/3/26 puneet loya > It is giving errors like " Unable to find abstract-type class > 'org.apache.cassandra.db.marshal.utf8' " > > and java.lang.RuntimeException: > org.apache.cassandra.db.m

Re: Fwd: information on cassandra

2012-03-26 Thread R. Verlangen
Yes, you can add nodes to a running "cluster". It's very simple: configure the cluster name and seed node(s) in cassandra.yaml, set auto_bootstrap to true and start the node. 2012/3/26 puneet loya > 5n.. consider i m starting on a single node. can I add nodes later?? plz > reply :) > > > On Sun,

Re: Error in FAQ?

2012-03-26 Thread R. Verlangen
If you want to modify a column family, just open the command line interface (cassandra-cli), connect to a node (probably: "connect localhost/9160;"). When you have to create your first keyspace type: "create keyspace MyKeyspace;" For modifying an existing keyspace type: "use MyKeyspace;" If you

Re: cassandra-cli and "uncreachable" status confusion

2012-03-20 Thread R. Verlangen
That's correct. If you run describe cluster normally you'll see something like: Cluster Information: Snitch: org.apache.cassandra.locator.SimpleSnitch Partitioner: org.apache.cassandra.dht.RandomPartitioner Schema versions: 3a0f6a80-7140-11e1--511aec3785ff: [IP_OF_NODE, IP_OF

Re: 0.8.1 Vs 1.0.7

2012-03-17 Thread R. Verlangen
Check your log for messages about rebuilding indices: that might grow your dataset some. One thing is for sure: the data import removed all the crap that lasted in the 0.8.1 cluster (duplicates, thombstones etc). The decrease is fairly dramatic but not unlogical at all. 2012/3/16 Jeremiah Jordan

Re: Single Node Cassandra Installation

2012-03-17 Thread R. Verlangen
aybe even used off >>> ebay??) so I could have at least two of them. >>> >>> If you do go with a one node solution, altho I haven't tried it myself >>> Priam looks like a good place to start for backups, otherwise roll your own >>> with inc

Re: Single Node Cassandra Installation

2012-03-15 Thread R. Verlangen
Hi Drew, One other disadvantage is the lack of "consistency level" and "replication". Both ware part of the high availability / redundancy. So you would really need to backup your single-node-"cluster" to some other external location. Good luck! 2012/3/15 Drew Kutcharian > Hi, > > We are worki

Re: Node joining / unknown

2012-03-08 Thread R. Verlangen
e current state? And if so, > how to solve it? > > 2012/3/7 > >> Maybe it wait for verification compaction on other node? >> >> >> >> >> >> -Original Message- >> From: "R. Verlangen" >> To: user@cassandra.apache.o

Re: Node joining / unknown

2012-03-07 Thread R. Verlangen
@Brandon: Thank you for the information. I'll do that next time. @Igor: Any ways to find out whether that is the current state? And if so, how to solve it? 2012/3/7 > Maybe it wait for verification compaction on other node? > > > > > > -Original Message- &g

Re: Node joining / unknown

2012-03-07 Thread R. Verlangen
> Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 7/03/2012, at 9:11 PM, R. Verlangen wrote: > > Hi there, > > I'm currently in a really weird situation. > - Nodetool ring says node X is joining

Node joining / unknown

2012-03-07 Thread R. Verlangen
Hi there, I'm currently in a really weird situation. - Nodetool ring says node X is joining (this already takes 12 hours, with no activity) - When I try to remove the token, it says: Exception in thread "main" java.lang.UnsupportedOperationException: Token not found. - Removetoken status = No toke

Re: TimeUUID

2012-02-28 Thread R. Verlangen
For querying purposes it would be better to use readable strings because you can really get information out of that. TimeUUID is just a unique value based on time; but not only the time. 2012/2/28 Tamar Fraenkel > Hi! > I have a column family where I use rows as "time buckets". > What I do is t

Re: Combining Cassandra with some SQL language

2012-02-26 Thread R. Verlangen
; you're planning on creating a data model that spans multiple databases make > sure you encapsulate the logic to read/write/delete information in a good > data model library and only use that library to access your data. This is > good practice anyway but when you add the ext

Combining Cassandra with some SQL language

2012-02-26 Thread R. Verlangen
Hi there, I'm currently busy with the technical design of a new project. Of course it will depend on your needs, but is it weird to combine Cassandra with a SQL language like MySQL? In my usecase it would be nice because we have some tables/CF's with lots and lots of data that does not really hav

Re: Please advise -- 750MB object possible?

2012-02-22 Thread R. Verlangen
I would suggest you chunk them down into small pieces (~ 10-50MB) and just fetch all the parts you need. A problem might be that if fetching one fails, the whole blob is useless. 2012/2/22 Rafael Almeida > Keep them where? > > -- > *From:* Mohit Anchlia > *To:* use

Re: List all keys with RandomPartitioner

2012-02-22 Thread R. Verlangen
You can leave the end key empty. 1) Start with "startkey" = "" 2) Next iteration start with "startkey" = "last key of the previous batch" 3) Keep on going until you ran out of results 2012/2/22 Rafael Almeida > > > > From: Franc Carter > >To: user@cassandra.apac

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-18 Thread R. Verlangen
You might want to check your Cassandra logs, they contain important information that might lead you to the actual cause of the problems. 2012/2/18 Aditya Gupta > Thanks! But what about the 100% cpu consumption that is causing the server > to hang? > > > On Sat, Feb 18, 2012 at 6:19 PM, Watanabe

Re: Replication factor per column family

2012-02-17 Thread R. Verlangen
Ok, that's clear, thank you for your time! 2012/2/16 aaron morton > yes. > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 16/02/2012, at 10:15 PM, R. Verlangen wrote: > > Hmm ok. This means

Re: Replication factor per column family

2012-02-16 Thread R. Verlangen
er reasons as well. In short it's baked in. > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 16/02/2012, at 9:54 PM, R. Verlangen wrote: > > Hi there, > > As the subject states: "I

Replication factor per column family

2012-02-16 Thread R. Verlangen
Hi there, As the subject states: "Is it possible to set a replication factor per column family?" Could not find anything of recent releases. I'm running Cassandra 1.0.7 and I think it should be possible on a per CF basis instead of the whole keyspace. With kind regards, Robin

Re: Wide row column slicing - row size shard limit

2012-02-16 Thread R. Verlangen
Things you should know: - Thrift has a limit on the amount of data it will accept / send, you can configure this in Cassandra: 64MB's should still work find (1) - Rows should not become huge: this will make "perfect" load balancing impossible in your cluster - A single row should fit on a disk - T

Re: CQL query issue when fetching data from Cassandra

2012-02-16 Thread R. Verlangen
I'm not sure about your first 2 questions. The third might be an exception: check your Cassandra logs. About the "like"-thing: there's no such query possibiliy in Cassandra / CQL. You can take a look at Hadoop / Hive to tackle those problems. 2012/2/16 Roshan > Hi > > I am using Cassandra 1.0.

Re: timed-out retrieving a giant row.

2012-02-14 Thread R. Verlangen
Of course you should set your limit to 100 or something like that, not Integer.MAX_VALUE ;-) 2012/2/14 R. Verlangen > I'm familiar to this in PHPCassa, but with Hector it would be something > like this: > > Query you CF with a range.setStart(lastColName) and > range.setFin

Re: timed-out retrieving a giant row.

2012-02-14 Thread R. Verlangen
I'm familiar to this in PHPCassa, but with Hector it would be something like this: Query you CF with a range.setStart(lastColName) and range.setFinish(StringUtils.byte("") where the " lastColName " is the name of the column from the previous read. You can continue this until you run out of result

Re: deleting rows and tombstones

2012-02-14 Thread R. Verlangen
Are you planning to insert rows with keys that existed before? If that's true, there will be no tombstones (as far as I understand Cassandra). It that's not, then you will get tombstones that might slow down the reads because they have to be skipped until the next compaction. 2012/2/14 Todd Burr

Re: Querying for rows without a particular column

2012-02-14 Thread R. Verlangen
One option might be to maintain an index containing the keys of the rows. The index would then have the same TTL as the row itself so when you iterate over the index columns you'll find exactly the same results. Although I'm not really sure whether this is the best option. Another might be to use

Re: Deleting a column vs setting it's value to empty

2012-02-14 Thread R. Verlangen
<< Setting to "" may cause you less headaches as you won't have to deal with tombstones >> You won't have to deal with tombstones manually, the Thrift API will take care of this. Deleting an empty column value will always be better; with one exception, when "empty" does actually mean something e

Re: keycache persisted to disk ?

2012-02-13 Thread R. Verlangen
I also noticed that, Cassandra appears to perform better under a continues load. Are you sure the rows you're quering are actually in the cache? 2012/2/13 Franc Carter > 2012/2/13 R. Verlangen > >> This is because of the "warm up" of Cassandra as it starts.

Re: keycache persisted to disk ?

2012-02-13 Thread R. Verlangen
This is because of the "warm up" of Cassandra as it starts. On a start it will start fetching the rows that were cached: this will have to be loaded from the disk, as there is nothing in the cache yet. You can read more about this at http://wiki.apache.org/cassandra/LargeDataSetConsiderations 201

Re: Need database to log and retrieve sensor data

2012-02-06 Thread R. Verlangen
As far as I'm familiar with Cassandra, I gave my opinion for every requirement on your list: 1) 10k inserts / seconds should be no problem at all for Cassandra 2) Cassandra should scale to that 3) As the homepage of Cassandra states that amount of data should be able to fit (source: http://cassan

Re: nodetool hangs and didn't print anything with firewall

2012-02-06 Thread R. Verlangen
Do you allow both outbound as inbound traffic? You might also try allowing both TCP as UDP. 2012/2/6 Roshan > Yes, If the firewall is disable it works. > > -- > View this message in context: > http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/nodetool-hangs-and-didn-t-print-anythi

Re: Best way to know the cluster status

2012-02-06 Thread R. Verlangen
You might consider writing some kind of php script that runs nodetool "ring" and parse the output? 2012/2/6 Tamil selvan R.S > Hi, > What is the best way to know the cluster status via php? > Currently we are trying to connect to individual cassandra instance with > a specified timeout and if

Re: yet a couple more questions on composite columns

2012-02-05 Thread R. Verlangen
Yiming, I am using 2 CF's. Performance wise this should not be an issue. I use it for small files data store. My 2 CF's are: FilesMeta FilesData 2012/2/5 Yiming Sun > Interesting idea, Jim. Is there a reason you don't you use > "metadata:{accountId}" instead? For performance reasons? > > > On

Re: yet a couple more questions on composite columns

2012-02-04 Thread R. Verlangen
I just kept both row keys the same. This was very trivial for fetching them both. When you have A, you can fetch B, and vice versa. 2012/2/4 Yiming Sun > Interesting idea, R.V. But what did you do with the row keys? > > > On Sat, Feb 4, 2012 at 2:29 PM, R. Verlangen wrote: >

Re: yet a couple more questions on composite columns

2012-02-04 Thread R. Verlangen
I also made something like this a while ago. I decided to go for the 2-rows-solution: by doing that you don't have the need for super columns. Cassandra is really good at reading, so this should not be an issue. Cheers! 2012/2/4 Yiming Sun > Thanks Andrey and Chris. It sounds like we don't nec

Re: Can you query Cassandra while it's doing major compaction

2012-02-02 Thread R. Verlangen
It will have a performance penalty, so it would be better to spread the compactions over a period of time. But Cassandra will still take care of any reads/writes (within the given timeout). 2012/2/3 myreasoner > If every node in the cluster is running major compaction, would it be able > to > an

Re: Restart cassandra every X days?

2012-02-02 Thread R. Verlangen
-- > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 2/02/2012, at 9:55 PM, R. Verlangen wrote: > > Yes, I already did a repair and cleanup. Currently my ring looks like this: > > Address DC RackSt

Re: Restart cassandra every X days?

2012-02-02 Thread R. Verlangen
hen > 2) nodetool cleanup if you have changed the tokens at any point finally > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 31/01/2012, at 11:56 PM, R. Verlangen wrote: > > After running

Re: Restart cassandra every X days?

2012-01-31 Thread R. Verlangen
ww.thelastpickle.com > > On 29/01/2012, at 4:14 AM, Maxim Potekhin wrote: > > Sorry if this has been covered, I was concentrating solely on 0.8x -- > can I just d/l 1.0.x and continue using same data on same cluster? > > Maxim > > > On 1/28/2012 7:53 AM, R. Verlangen wr

Re: Any tools like phpMyAdmin to see data stored in Cassandra ?

2012-01-30 Thread R. Verlangen
You might run it from a VM? 2012/1/30 Ertio Lew > > > On Mon, Jan 30, 2012 at 7:16 AM, Frisch, Michael < > michael.fri...@nuance.com> wrote: > >> OpsCenter? >> >> http://www.datastax.com/products/opscenter >> >> - Mike >> >> >> I have tried Sebastien's phpmyAdmin For >> Cassandra

Re: Restart cassandra every X days?

2012-01-28 Thread R. Verlangen
Ok, seems that it's clear what I should do next ;-) 2012/1/28 aaron morton > There are no blockers to upgrading to 1.0.X. > > A > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 28/01/2012, at 7:48 AM

Re: How to create a table in Cassandra

2012-01-27 Thread R. Verlangen
A "table" is called a "column family" in Cassandra. >From the CLI you can just create one by typing: create column family MyApplication; -- Forwarded message -- > From: > Date: Fri, Jan 27, 2012 at 2:36 PM > Subject: How to create a table in Cassandra > To: d...@cassandra.apac

Re: Restart cassandra every X days?

2012-01-27 Thread R. Verlangen
andra process. You will > lose all the run time optimizations the JVM has made (i think the mapped > files pages will stay resident). As well as adding additional entropy to > the system which must be repaired via HH, RR or nodetool repair. > > If you want to see compacted files p

Re: Restart cassandra every X days?

2012-01-25 Thread R. Verlangen
ssandra.apache.org > Subject: Re: Restart cassandra every X days? > > > On 01/25/12 19:18, R. Verlangen wrote: > > Ok thank you for your feedback. I'll add these tasks to our daily > > cassandra maintenance cronjob. Hopefully this will keep things under > > c

Re: Restart cassandra every X days?

2012-01-25 Thread R. Verlangen
Thanks for reminding. I'm going to start with adding the cleanup & compact to the chain of maintenance tasks. In my opinion java should determine itselfs when to start a GC: doesn't feel natural to do this manually. 2012/1/25 Karl Hiramoto > > On 01/25/12 19:18, R. Verl

Re: Restart cassandra every X days?

2012-01-25 Thread R. Verlangen
Ok thank you for your feedback. I'll add these tasks to our daily cassandra maintenance cronjob. Hopefully this will keep things under controll. 2012/1/25 Karl Hiramoto > On 01/25/12 16:09, R. Verlangen wrote: > >> Hi there, >> >> I'm currently running a 2-no

Restart cassandra every X days?

2012-01-25 Thread R. Verlangen
Hi there, I'm currently running a 2-node cluster for some small projects that might need to scale-up in the future: that's why we chose Cassandra. The actual problem is that one of the node's harddrive usage keeps growing. For example: - after a fresh restart ~ 10GB - after a couple of days runni

Re: Enable thrift logging

2012-01-24 Thread R. Verlangen
Pick a "custom" loglevel and redirect them with the /etc/syslog.conf ? 2012/1/24 ruslan usifov > Hello > > I try to log thrift log message (this need to us for solve communicate > problem between Cassandra daemon and php client ), so in > log4j-server.properties i write follow lines: > > log4j.l

Re: Tips for using OrderedPartitioner

2012-01-24 Thread R. Verlangen
If you would like to index your rows in an "index-row", you could also choose for indexing the "index-rows". This will scale up for any needs and create a tree structure. 2012/1/24 aaron morton > Nothing I can thin of other than making the keys uniform. > > Having a single index row with the RP

Re: Data Model Question

2012-01-21 Thread R. Verlangen
A couple of days ago I came across Countandra ( http://countandra.org/ ). It seems that it might be a solution for you. Gr. Robin 2012/1/20 Tamar Fraenkel > ** > > Hi! > > I am a newbie to Cassandra and seeking some advice regarding the data > model I should use to best address my needs. > >

Re: nodetool ring question

2012-01-19 Thread R. Verlangen
p://www.thelastpickle.com > > On 18/01/2012, at 10:58 PM, Michael Vaknine wrote: > > I did restart the cluster and now it is normal 5GB. > ** ** > *From:* R. Verlangen [mailto:ro...@us2.nl] > *Sent:* Wednesday, January 18, 2012 11:32 AM > *To:* user@cassandra.apache.org

  1   2   >