RE: dropping keyspace in cassandra

2011-03-02 Thread Sagar Kohli
Thanks aaron, I also restarted Cassandra instance , but data is still there.., is there any mechanism to claim storage?? if so can you please give some pointer on that. `sagar -Original Message- From: Joshua Partogi [mailto:joshua.j...@gmail.com] Sent: Thursday, March 03, 2011 4:45 AM T

How to use JConsole to connect to a Cassandra cluster in Amazon EC2?

2011-03-02 Thread Sameer Farooqui
I want to use JConsole to look at the Cassandra's MBean's attributes (like Memtable stats). But since my Cassandra nodes are running on Amazon EC2 instances, I have to use an ssh login with a ppk authentication file (when connecting via Putty, anyway). It looks like two popular approaches to esta

Re: Defrag

2011-03-02 Thread Robert Coli
On Wed, Mar 2, 2011 at 2:51 PM, Peter Schuller wrote: > Cassandra is about as friendly as it can be with respect to > fragmentation. All data is written sequentially and in bulk. That said, there can be more than one thread writing at a time, which doesn't help fragmentation. On the plus side, "m

Re: dropping keyspace in cassandra

2011-03-02 Thread Joshua Partogi
Hi Aaron, Does that mean the data will be deleted eventually? Does this also depends on the compaction configuration? Thanks. On Thu, Mar 3, 2011 at 7:23 AM, Aaron Morton wrote: > What exactly was the folder that was left in place? Dropping a keyspace does > not physically delete the data immed

Re: Defrag

2011-03-02 Thread Peter Schuller
> Are there any details on how much of an issue fragmentation is (with > Cassandra ) ? > With all the merging and deletes that happen with during compactions, > how does the disk fragmentation look like over time ? Any thumb-rules > on how frequently and how to defrag ? Cassandra is about as frien

Defrag

2011-03-02 Thread A J
Are there any details on how much of an issue fragmentation is (with Cassandra ) ? With all the merging and deletes that happen with during compactions, how does the disk fragmentation look like over time ? Any thumb-rules on how frequently and how to defrag ? Thanks.

Re: Seed Nodes

2011-03-02 Thread Aaron Morton
The seed list is also used as part of the Gossip round http://wiki.apache.org/cassandra/ArchitectureGossip I agree it's not critical to ensure all running nodes have the same running seed list. But it's definitely a good idea. Disclaimer: Learning more about the gossip protocol is on my to-do list

Re: Seed Nodes

2011-03-02 Thread Nick Bailey
You do not need to restart when changing the seedlist of nodes that are up. The seedlist is really only used when a node first starts. So if you modify the list of seeds the changes will take effect whenever that node restarts, exactly when they need to. In 0.7 you can basically get away with igno

Re: cassandra.yaml

2011-03-02 Thread Aaron Morton
All nodes should have the correct configuration files. With the only difference been the initial token. Aaron On 3/03/2011, at 6:18 AM, A J wrote: > Hello, > I am trying to setup a cluser (for the first time) of a few nodes. Had > a few questions related to that. > > I want the following prop

Re: dropping keyspace in cassandra

2011-03-02 Thread Aaron Morton
What exactly was the folder that was left in place? Dropping a keyspace does not physically delete the data immediately. Aaron On 3/03/2011, at 1:01 AM, Sagar Kohli wrote: > > > > > Hi , > > > > I am using Cassandra .70, I have dropped a sample keyspace using command > “drop keyspac

Re: Storing photos, images, docs etc.

2011-03-02 Thread mcasandra
Thanks! Please let me know if others have more suggestions. In all feeling I get is to keep the images/docs off Cassandra. Flicks and facebook seem to have mysqldb for meta data and actual photos are stored somewhere else. Looks like I need to search for hosting platform where data can be stored

Integrating Cassandra with other projects/products

2011-03-02 Thread Jeremy Hanna
I started a wiki page for those wishing to let people in the community know about projects/products that integrate with Cassandra. http://wiki.apache.org/cassandra/IntegrationPoints So far listed there are projects like Hadoop (including Pig and hive), Solr/Lucene, Flume, and Scribe. If you wo

cassandra.yaml

2011-03-02 Thread A J
Hello, I am trying to setup a cluser (for the first time) of a few nodes. Had a few questions related to that. I want the following properties in my cluster: 1. Not to use RP but BOP 2. Specify initial token myself on each node. 3. Change a few memtable defaults. 4. For Keyspaces to use NetworkTop

Re: Issues connecting from outside of localhost

2011-03-02 Thread David McNelis
In case anyone is interested. Our problem revolved around one machine having the phpcassa thrift patch, and the other did not. Its resolved now. On Wed, Mar 2, 2011 at 10:25 AM, David McNelis wrote: > It looks like we are having an issue with the Thrift installation on the > 'other' machine. W

Re: Issues connecting from outside of localhost

2011-03-02 Thread David McNelis
It looks like we are having an issue with the Thrift installation on the 'other' machine. We spun up a cassandra instance on that machine and were unable to connect locally through php, but I was able to connect from the original cassandra machine with PHP. That sounds convoluted...but in short,

Re: Storing photos, images, docs etc.

2011-03-02 Thread A J
>>What are other options then << Several. 1. Mogilefs. Stores on filesystem but metadata in database (MySQL or Postgres). Also has redundancy built in. Does not require RAID. No SPOF. But I think it has too many moving parts and requires a few more boxes than cassandra. 2. Ofcourse the good old Blo

Re: Is it possible to get list of row keys?

2011-03-02 Thread Eric Charles
OK, got it. Tks for the explanation, - Eric On 2/03/2011 14:45, Ching-Cheng Chen wrote: Stable order mean the keys will return in some kind of ordering (not alphanumeric, not numeric,), just in certain order. For example, if you use setKeys("","") as first batch call and you got following res

Re: Issues connecting from outside of localhost

2011-03-02 Thread David McNelis
We are able to telnet to port 9160, and didn't have any issues resolving the target along those lines. So at this point, I don't think we're looking at a firewall / network topology issue. On Wed, Mar 2, 2011 at 8:54 AM, Sasha Dolgy wrote: > i had a similar issue on windows and it came down to

Compaction Thresholds Discrepancies In Tools

2011-03-02 Thread Jeremy.Truelove
When I do a bin/nodetool -h localhost getcompactionthreshold MyKeySpace MyColumnFamily I get the following(which is what I set the CF to) Current compaction thresholds for MyKeySpace / MyColumnFamily: min = 4, max = 24 when I run the cassandra-cli for my column family I see Compaction min/max

Re: Issues connecting from outside of localhost

2011-03-02 Thread Sasha Dolgy
i had a similar issue on windows and it came down to php not being able to resolve the target ... the solution was to add an entry to the hosts file ... of course, if there is a firewall blocking ... that's your problem. can you telnet from remote server to cassandra server on port 9160? On Wed,

Issues connecting from outside of localhost

2011-03-02 Thread David McNelis
Morning folks, I'm not sure if this is an issue with my setup of Cassandra, or with phpCassathere is an open question on StackOverflow that is very similar to what I am experiencing (1). In short, I can connect using PHP from the same machine that Cassandra is running on. I can connect with

Re: Is it possible to get list of row keys?

2011-03-02 Thread Ching-Cheng Chen
Stable order mean the keys will return in some kind of ordering (not alphanumeric, not numeric,), just in certain order. For example, if you use setKeys("","") as first batch call and you got following result. keyabc key456 keydsg key8jkg keyag87 key45s ... Then if you call with setKeys("keydsg"

dropping keyspace in cassandra

2011-03-02 Thread Sagar Kohli
Hi , I am using Cassandra .70, I have dropped a sample keyspace using command "drop keyspace " Command executed successfully but when I checked key space folder still exists there, Is there any other process to delete data from file system or Cassandra does it automatically?? Thanks in advan

Re: Is it possible to get list of row keys?

2011-03-02 Thread Eric Charles
Hi, I'm also facing the need to retrieve all row keys. What do you mean with "stable" order? From this thread, I understand paging method with RandomPartitioner will return all keys (shuffled, but missing key, no double key). This seems to have already told, but I prefer to double-check... Tks

Re: Advice on a design

2011-03-02 Thread Burc Sade
You can use PHP Solr Extension. It is a fully featured and light-weight client. http://www.php.net/manual/en/book.solr.php Without the secondary indexes on columns in CFs within SCFs, the best approach is to create query-specific CFs at the moment. In the end all comes down to how simple you can

Re: limit on rows in a cf

2011-03-02 Thread Sylvain Lebresne
On Tue, Mar 1, 2011 at 10:36 PM, Shaun Cutts wrote: > This isn't quite true, I think. RandomPartitioner uses MD5. So if you had > 10^16 rows, you would have a 10^-6 chance of a collision, according to > http://en.wikipedia.org/wiki/Birthday_attack ... and apparently MD5 isn't > quite balanced, so

Re: Storing photos, images, docs etc.

2011-03-02 Thread Norman Maurer
2011/3/2 Peter Schuller : >> Is it advisable or ok to store photos, images and docs in cassandra where you >> expect high volume of uploads and views? > > To diverge a bit from the direction the thread is going: You can > definitely store large files in Cassandra. I would recommend against > doing

Re: backup strategies

2011-03-02 Thread Sasha Dolgy
ah, not sure how i didn't see this. sorry! Is anyone actively using this approach? Have you recovered from a failure ? Was it pain free? -sd On Tue, Mar 1, 2011 at 8:58 PM, Aaron Morton wrote: > Does this help http://wiki.apache.org/cassandra/Operations#Backing_up_data > > Aaron > > On 2/03/

Re: Storing photos, images, docs etc.

2011-03-02 Thread Sasha Dolgy
I took the advice from previous threads and use cassandra to hold pointers to the files that are uploaded and other meta information. Amazon S3 can be quite simple and pain free at times and was a great cost-effective place for me to keep the large files... i have had some great success already wit

Re: Storing photos, images, docs etc.

2011-03-02 Thread Peter Schuller
> Is it advisable or ok to store photos, images and docs in cassandra where you > expect high volume of uploads and views? To diverge a bit from the direction the thread is going: You can definitely store large files in Cassandra. I would recommend against doing so by simply smacking entire files

Re: Advice on a design

2011-03-02 Thread Vodnok
I think too via Solr it'll be easier. Just need to google it. (if you have links about Solr in php...) I realize that i have to remove some dimension to my CF... I thought it was possible to have SCF -> CF -> SC -> C:value having secondary index on C but has i understood, secondary index on C on