Re: Cassandra event notification on INSERT/DELETE of records

2016-05-24 Thread Mark Reddy
+1 to what Eric said, a queue is a classic C* anti-pattern. Something like Kafka or RabbitMQ might fit your use case better. Mark On 24 May 2016 at 18:03, Eric Stevens wrote: > It sounds like you're trying to build a queue in Cassandra, which is one > of the classic anti-pattern use cases for

Re: Record deletion job

2015-06-21 Thread Mark Reddy
No, this is what TTL is for. If you want to schedule a cron to delete data this will need to be an external task. On 19 June 2015 at 07:41, anil_ah wrote: > Hi >Is their any way to schedule a job in cassandra to delete the recrods > which are older than a specific time period. > > Excludin

Re: connection refused error for cassandra-cli

2015-05-26 Thread Mark Reddy
By default the cli will connect to localhost so you don't even need to specify a host (-h). You should only need to specify a host if connecting to a remote host or locally on a different ip than localhost (127.0.0.1). Mark On 26 May 2015 at 15:37, Mahmood Naderan wrote: > Thanks. Instead of c

Re: connection refused error for cassandra-cli

2015-05-26 Thread Mark Reddy
Have you configured the rpc_address to listen on that ip? You can check your nodes casandra.yaml for the rpc_address setting. On a side node why are you testing on such an old version of Cassandra? The latest versions of 1.2.x is 1.2.19. There are also two branches of 2.x which are 2.0.15 and 2.1.

Re: java.io.IOException: Failed during snapshot creation

2015-05-25 Thread Mark Reddy
Can you check your logs for any other other error message around the time of the repair? Something to look for would be "Error occurred during snapshot phase". Regards, Mark On 25 May 2015 at 14:56, Sachin PK wrote: > Hey I'm new to Cassandra ,I have 4 node cluster with each node 16GB VPS, > i

Re: Drop/Create table with same CF Name

2015-05-21 Thread Mark Reddy
Yes, it's a known issue. For more information on the topic see this support post from DataStax: https://support.datastax.com/hc/en-us/articles/204226339-How-to-drop-and-recreate-a-table-in-Cassandra-versions-older-than-2-1 Mark On 21 May 2015 at 15:31, Ken Hancock wrote: > > We've been running

Re: Disabling auto snapshots

2015-05-21 Thread Mark Reddy
To disable auto snapshots, set the property auto_snapshot: false in your cassandra.yaml file. Mark On 21 May 2015 at 08:30, Ali Akhtar wrote: > Is there a config setting where automatic snapshots can be disabled? I > have a use case where a table is truncated quite often, and would like to > no

Re: Steps to do after schema changes

2015-03-12 Thread Mark Reddy
It's always good to run "nodetool describecluster" after a schema change, this will show you all the nodes in your cluster and what schema version they have. If they have different versions you have a schema disagreement and should follow this guide to resolution: http://www.datastax.com/documentat

Re: unsubscribe

2015-02-22 Thread Mark Reddy
Raman, To unsubscribe send a mail to user-unsubscr...@cassandra.apache.org On 22 February 2015 at 03:59, Raman wrote: > unsubscribe >

Re: Why no virtual nodes for Cassandra on EC2?

2015-02-20 Thread Mark Reddy
Hey Clint, Someone for DataStax can correct me here, but I'm assuming that they have disabled vnodes because the AMI is built to make it easy to set up a pre-configured mixed workload cluster. A mixture of Real-Time/Transactional (Cassandra), Analytics (Hadoop), or Search (Solr). If you take a loo

Re: Node joining take a long time

2015-02-19 Thread Mark Reddy
What is a long time in your scenario? What is the data size in your cluster? I'm sure Rob will be along shortly to say that 2.1.2 is, in his opinion, broken for production use...an opinion I'd agree with. So bare that in mind if you are running a production cluster. Regards, Mark On 19 February

Re: Cancel subscription

2015-02-19 Thread Mark Reddy
Please use user-unsubscr...@cassandra.apache.org to unsubscribe from this mailing list. Thanks Regards, Mark On 19 February 2015 at 09:14, Hilary Albutt - CEO < hil...@incrediblesoftwaresolutions.com> wrote: > Cancel subscription >

Re: run cassandra on a small instance

2015-02-19 Thread Mark Reddy
What does your schema look like, your total data size and your read/write patterns? Maybe you are simply doing a heavier workload than a small instance can handle. Regards, Mark On 19 February 2015 at 08:40, Carlos Rolo wrote: > I have Cassandra instances running on VMs with smaller RAM (1GB e

Re: Cassandra install on JRE vs JDK

2015-02-18 Thread Mark Reddy
t; > > On Wed, Feb 18, 2015 at 11:58 AM, Mark Reddy > wrote: > >> Yes you can use Oracle JDK if your prefer, I've been using the JDK with >> Cassandra in production for years without issue. >> >> Regards, >> Mark >> >> On 18 February 2015

Re: Cassandra install on JRE vs JDK

2015-02-18 Thread Mark Reddy
Yes you can use Oracle JDK if your prefer, I've been using the JDK with Cassandra in production for years without issue. Regards, Mark On 18 February 2015 at 19:49, cass savy wrote: > Can we install Oracle JDK instead of JRE in Cassandra servers? We have > few clusters running JDK when we upgr

Re: High GC activity on node with 4TB on data

2015-02-08 Thread Mark Reddy
Hey Jiri, While I don't have any experience running 4TB nodes (yet), I would recommend taking a look at a presentation by Arron Morton on large nodes: http://planetcassandra.org/blog/cassandra-community-webinar-videoslides-large-nodes-with-cassandra-by-aaron-morton/ to see if you can glean anythin

Re: Compacted_at timestamp

2015-02-08 Thread Mark Reddy
Hi Santo, If you are seeing the compacted_at value as timestamp and want to convert it to a human readable date, this is not possible via nodetool. You will always write a script to make the compactionhistory call and then convert the output (the fourth column - compacted_at) to a readable date.

Re: Case Study for Learning Cassandra

2015-02-05 Thread Mark Reddy
Hey Krish, Here is a list of use cases from the industry: http://planetcassandra.org/apache-cassandra-use-cases/ You may find some examples on how people/companies are using and running their C* clusters. Also DataStax provide online training that will get you up to speed on several aspects of C*

Re: Error while starting Cassandra for the first time

2015-02-05 Thread Mark Reddy
rectory and saved_caches > directory and after that it worked. > > Thanks a lot for the help ... > > On Wed, Feb 4, 2015 at 2:32 PM, Mark Reddy wrote: > >> INFO 22:17:19 Loading settings from file:/home/csduser/cassandra/ >>> conf/cassandra.yaml

Re: Newly added column not visible

2015-02-05 Thread Mark Reddy
Hey Saurabh, I can't say that I have experienced this before, however if you can reliably reproduce the issue it would be worth commenting on the JIRA issue you linked to or alternatively creating a new JIRA with as much info (setup, test case, debug logs, etc) as possible. Regards, Mark On 5 Fe

Re: Error while starting Cassandra for the first time

2015-02-04 Thread Mark Reddy
> > INFO 22:17:19 Loading settings from file:/home/csduser/cassandra/ > conf/cassandra.yaml > ERROR 22:17:20 Fatal configuration error > org.apache.cassandra.exceptions.ConfigurationException: Invalid yaml You have an malformed cassandra.yaml config file that is resulting in Cassandra not being

Re: Question about use scenario with fulltext search

2015-02-01 Thread Mark Reddy
> > If you have a list of usernames stored in your cassandra database, > how could you find all usernames starting with 'Jo'? Cassandra does not support full text search on its own, if you are looking into DataStax enterprise Cassandra there is an integration with Slor that gives you this functio

Re: max ttl for column

2014-11-21 Thread Mark Reddy
Hi Rajanish, Cassandra imposes a max TTL of 20 years. public static final int MAX_TTL = 20 * 365 * 24 * 60 * 60; // 20 years in > seconds See: https://github.com/apache/cassandra/blob/8d8fed52242c34b477d0384ba1d1ce3978efbbe8/src/java/org/apache/cassandra/db/ExpiringCell.java#L37 Mark On 21

Re: [jira] Akhtar Hussain shared a search result with you

2014-11-21 Thread Mark Reddy
I believe you were attempting to share: https://issues.apache.org/jira/browse/CASSANDRA-8352 Your cassandra logs outputs the following: > DEBUG [Thrift:4] 2014-11-20 15:36:50,653 CustomTThreadPoolServer.java > (line 204) Thrift transport error occurred during processing of message. > org.apache.t

Re: A questiion to adding a new data center

2014-11-21 Thread Mark Reddy
Hi Boying, I'm not sure I fully understand your question here, so some clarification may be needed. However, if you are asking what steps need to be performed on the current datacenter or on the new datacenter: Step 1 - Current DC Step 2 - New DC Step 3 - Depending on the snitch you may need to m

Re: any way to get nodetool proxyhistograms data for an entire cluster?

2014-11-20 Thread Mark Reddy
> > To be clear, I don't believe DSE is required to use OpsCenter. My > understanding is that it can be used with vanilla Apache Cassandra, but I > have never actually tried to do so. Correct Rob, you can use Opscenter with vanilla Apache Cassandra. Mark On 20 November 2014 16:10, Robert Coli

Re: logging over multi-datacenter

2014-11-20 Thread Mark Reddy
November 2014 14:40, Adil wrote: > cassandra version 2.1.2 > with the default user > we create another user and with this one we could login even if only one > node is up > > 2014-11-20 15:16 GMT+01:00 Mark Reddy : > >> Hi Adil, >> >> What Cassandra versio

Re: logging over multi-datacenter

2014-11-20 Thread Mark Reddy
Hi Adil, What Cassandra version are you using? Are you using the default user or a non-default user? Mark On 20 November 2014 08:20, Adil wrote: > Hi, > We have two data-center, we configured PasswordAuthenticator on each node, > we increment the RF of system_auth to the number of nodes (each

Re: What time range does nodetool cfhistograms use?

2014-11-16 Thread Mark Reddy
Hi Clint, The values of SSTables, Write Latency and Read Latency will be reset on node start/restart and after running the cfhistograms command itself. The values of Row Size and Column Count are calculated at startup and then re-evaluated during compaction. Mark On 16 November 2014 17:12, Cl

Re: Netstats > 100% streaming

2014-11-03 Thread Mark Reddy
HI Eric, It looks like you are running into CASSANDRA-7878 which is fixed in 2.0.11 / 2.1.1 Mark On 1 November 2014 14:08, Eric Stevens wrote: > We've been commissioning some new nodes on a 2.0.10 community edition > cluster, and we're se

Re: OldGen saturation

2014-10-28 Thread Mark Reddy
Hi Adrià, We have about 50.000 CFs of varying size Before I read any further, having 50,000 CFs is something that I would highly discourage. Each column family is allocated 1MB of available memory ( CASSANDRA-2252 ) so having anything over a

Re: EC2 Snitch load imbalance

2014-10-28 Thread Mark Reddy
Oleg, If you are running nodetool status, be sure to specify the keyspace also. If you don't specify the keyspace the results will be nonsense. https://issues.apache.org/jira/browse/CASSANDRA-7173 Regards, Mark On 28 October 2014 10:35, Oleg Dulin wrote: > I have a setup with 6 cassandra nod

Re: Bulk load in cassandra

2014-08-27 Thread Mark Reddy
Hi, You can use the COPY command to load in data from a local file, however it needs to be a csv formatted file. For more info see http://www.datastax.com/documentation/cql/3.0/cql/cql_reference/copy_r.html Regards, Mark On 27 August 2014 13:13, Malay Nilabh wrote: > Hi > > I installed Cass

Re: Can't Add AWS Node due to /mnt/cassandra/data directory

2014-08-27 Thread Mark Reddy
Hi stephen, I have never added a node via OpsCenter, so this may be a short coming of that process. However in non OpsCenter installs you would have to create the data directories first: sudo mkdir -p /mnt/cassandra/commitlog sudo mkdir -p /mnt/cassandra/data sudo mkdir -p /mnt/cassandra/saved_ca

Re: Installing Cassandra Multinode on CentOs coming up with exception

2014-08-26 Thread Mark Reddy
.CassandraDaemon.activate(CassandraDaemon.java:496) > at > org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:585) > Missing required directive CommitLogSync > Fatal configuration error; unable to start. See log for stacktrace. > > Do you have any idea about this. > &g

Re: OpsCenter_rollups*

2014-08-26 Thread Mark Reddy
> > Why is there so much OpsCenter work happening? Opscenter stores a lot of information regarding all aspects of your cluster, such as OS, cluster, keyspace and individual table metrics, after a set period of time these granular data point are rolled up into aggregates. This is what you are seei

Re: Installing Cassandra Multinode on CentOs coming up with exception

2014-08-26 Thread Mark Reddy
It is telling you that your yaml is invalid, from looking at the snippet you have provided it looks like the seed_provider.parameters is not correctly indented, it should look something like: seed_provider: - class_name: org.apache.cassandra.locator.SimpleSeedProvider parameters: -

Re: Secondary index or dedicated CF?

2014-08-22 Thread Mark Reddy
Hi, As a general rule of thumb I would steer clear of secondary indexes, this is also the official stand that DataStax take (see p5 of their best practices doc: http://www.datastax.com/wp-content/uploads/2014/04/WP-DataStax-Enterprise-Best-Practices.pdf). “It is best to avoid using Cassandra's

Re: 答复: can not query data from cassandra

2014-08-20 Thread Mark Reddy
Hi, It looks like you are running into a known issue: https://issues.apache.org/jira/browse/CASSANDRA-7780 and its being worked on here: https://issues.apache.org/jira/browse/CASSANDRA-7145 Mark On 20 August 2014 09:06, 鄢来琼 wrote: > Is it related to MAX_HEAP_SIZE and HEAP_NEWSIZE? > > My sy

Re: Cassandra Consistency Level

2014-08-19 Thread Mark Reddy
Hi, As you are writing as CL.ONE and cqlsh by default reads at CL.ONE, there is a probability that you are reading stale data i.e. the node you have contacted for the read may not have the most recent data. If you have a higher consistency requirement, you should look at increasing your consistenc

Re: Cassandra Wiki Immutable?

2014-08-18 Thread Mark Reddy
Hi Otis, On the front page, https://wiki.apache.org/cassandra/FrontPage there are instructions on how to get edit permissions: "If you would like to contribute to this wiki, please send an email to the mailing list dev.at.cassandra.apache-dot-org with your wiki username and we will be happy to ad

Re: Bootstrap failures: unable to find sufficient sources for streaming range

2014-08-15 Thread Mark Reddy
Hi Peter, At the time of the IllegalStateException, do you see the node that it should be streaming from marked as down by the failure detector? Mark On Fri, Aug 15, 2014 at 5:45 AM, Peter Haggerty wrote: > When adding nodes via bootstrap to a 27 node 2.0.9 cluster with a > cluster-wide phi_

Re: question about OpsCenter agent

2014-08-14 Thread Mark Reddy
Hi Clint, You need to configure the datastax-agents so they know what machine OpsCenter is running on. To do this you will need to edit the address.yaml of the datastax-agent, located in /var/lib/datastax-agent/conf/. In this file you need to add the following line: stomp_interface: This will

Re: A question to nodetool removenode command

2014-08-14 Thread Mark Reddy
Hi, Gossip will propagate to all nodes in a cluster. So if you have a cluster spanning DC1, DC2 and DC3 and you then remove all nodes in DC3 via nodetool removenode from a node in DC1, all nodes in both DC1 and DC2 will be informed of the nodes removal so no need to run it from a node in DC2. Ma

Re: How to prevent the removed DC comes back automactically?

2014-08-14 Thread Mark Reddy
> > How can we prevent a disconnected DC from coming back automatically? You could use firewall rules to prevent the disconnected DC from contacting your live DCs when it becomes live again Mark On Thu, Aug 14, 2014 at 6:48 AM, Lu, Boying wrote: > Hi, All, > > > > We are using Cassandra 2.0

Re: Nodetool Repair questions

2014-08-12 Thread Mark Reddy
Hi Vish, 1. This tool repairs inconsistencies across replicas of the row. Since > latest update always wins, I dont see inconsistencies other than ones > resulting from the combination of deletes, tombstones, and crashed nodes. > Technically, if data is never deleted from cassandra, then nodetool

Re: Cassandra corrupt column family

2014-08-12 Thread Mark Reddy
Hi, Without more information (Cassandra version, setup, topology, schema, queries performed) this list won't be able to assist you. If you can provide a more detailed explanation of the steps you took to reach your current state that would be great. Mark On Tue, Aug 12, 2014 at 12:21 PM, Batra

Re: Cassandra bootstrap error: java.lang.IllegalStateException: No sources found

2014-08-11 Thread Mark Reddy
Hi, When you configured the new node, did you correctly configure your seeds and listen address? Mark On Mon, Aug 11, 2014 at 6:12 PM, Rahul Gupta wrote: > I have a 3 node Cassandra cluster and I am adding a new node when I get > below error during bootstrap. > > Running DataStax ver 4.5 (C

Re: clarification on 100k tombstone limit in indexes

2014-08-10 Thread Mark Reddy
means quantitatively. As written, the docs make it sound dangerous to > create an index on a column that is *ever* deleted or updated since there > is no sense of how frequent is "too frequent". > > Cheers, > Ian > > > > > > > > On Sun, Aug 10

Re: clarification on 100k tombstone limit in indexes

2014-08-10 Thread Mark Reddy
Hi Ian, The issues here, which relates to normal and index column families, is scanning over a large number of tombstones can cause Cassandra to fall over due to increased GC pressure. This pressure is caused because tombstones will create DeletedColumn objects which consume heap. Also these Delet

Re: Unable to connect to cassandra server on EC2?

2014-08-10 Thread Mark Reddy
Hi, While I have no direct experience with the Python driver itself I took a quick and it uses Cassandra's native transport protocol, so setting the port to 9160 (Thrift protocol) won't work. You will need to set it to the native transport port, which is 9042. Also make sure that you have the rpc

Re: Node stuck during nodetool rebuild

2014-08-05 Thread Mark Reddy
Hi Vasilis, To further on what Rob said I believe you might be able to tune the phi detector threshold to help this > operation complete, hopefully someone with direct experience of same will > chime in. I have been through this operation where streams break due to a node falsely being marked d

Re: Node bootstrap

2014-08-05 Thread Mark Reddy
Compaction iprod customerorder 1682804084 >> 17956558077 bytes 9.37% >>Compactionprodgatecustomerorder >> 1664239271 1693502275 bytes98.27% >>Compaction qa_config_bkupfixsessionconfig_hist >> 2443

Re: A question about using 'update keyspace with strategyoptions' command

2014-08-05 Thread Mark Reddy
> > So the ‘strategy’ change may not be seen by all nodes when the ‘upgrade > keyspace …’ command returns and I can use ’describe cluster’ to check if > the change has taken effect on all nodes right? Correct, the change may take time to propagate to all nodes. As Rahul said you can check describ

Re: Node bootstrap

2014-08-05 Thread Mark Reddy
Active Pending Completed Blocked >>> All time blocked >>> FlushWriter 0 0 1136 0 >>> 512 >>> >>> Looks like about 50% of flushes are blocked. >>> >>>

Re: Node bootstrap

2014-08-05 Thread Mark Reddy
> > My understanding was that if initial_token is left empty on the new node, > it just contacts the heaviest node and bisects its token range. If you are using vnodes and you have num_tokens set to 256 the new node will take token ranges dynamically. What is the configuration of your other nodes

Re: Make an existing cluster multi data-center compatible.

2014-08-05 Thread Mark Reddy
Yes, you must run a full repair for the reasons stated in the yaml file. Mark On Tue, Aug 5, 2014 at 11:52 AM, Rene Kochen wrote: > Hi all, > > I want to add a data-center to an existing single data-center cluster. > First I have to make the existing cluster multi data-center compatible. > >

Re: Reasonable range for the max number of tables?

2014-08-05 Thread Mark Reddy
Hi Phil, In theory, the max number of column families would be in the low number of hundreds. In practice the limit is related the amount of heap you have, as each column family will consume 1 MB of heap due to arena allocation. To segregate customer data, you could: - Use customer specific colum

Re: `system` keyspace replication

2014-07-31 Thread Mark Reddy
Hi Jens, The system keyspace is configured with LocalStrategy, this strategy only stores data on the the local node. This strategy is reserved for internal use only and is used for other things such as SecondaryIndexes. You cannot change the replication factor of the 'system' keyspace. If you att

Re: bootstrapping new nodes on 1.2.12

2014-07-30 Thread Mark Reddy
y, what negative side effects can there be? > > > > Parag > > > > > > *From:* Mark Reddy [mailto:mark.re...@boxever.com] > *Sent:* Wednesday, July 30, 2014 9:06 AM > > *To:* user@cassandra.apache.org > *Subject:* Re: bootstrapping new nodes on 1.2.12 > > &

Re: bootstrapping new nodes on 1.2.12

2014-07-30 Thread Mark Reddy
n future versions? > > 2) We have too much data to migrate to run on non-production hours. > How do we make it such that full GC’s don’t impact bootstrapping? Should > we increase phi_convict_threshold ? > > > > Parag > > > > > > *From:* Mark R

Re: bootstrapping new nodes on 1.2.12

2014-07-30 Thread Mark Reddy
al order, it’s currently streaming from the last 2. > > > > *From:* Mark Reddy [mailto:mark.re...@boxever.com] > *Sent:* Wednesday, July 30, 2014 4:42 AM > *To:* user@cassandra.apache.org > *Subject:* Re: bootstrapping new nodes on 1.2.12 > > > > Hi Parag, > > > > 1)

Re: bootstrapping new nodes on 1.2.12

2014-07-30 Thread Mark Reddy
Hi Parag, 1) Would anyone be able to help me interrupt this information from > OpsCenter? At a high level bootstrapping a new node has two phases, streaming and secondary index builds. I believe OpsCenter will only report active streams, the pending stream will be listed as such in OpsCente

Re: nodetool status returning negative load values on 2.1.0-rc4

2014-07-29 Thread Mark Reddy
Looks like you are running into this issue: https://issues.apache.org/jira/browse/CASSANDRA-7239 Mark On Tue, Jul 29, 2014 at 5:32 PM, Jeronimo de A. Barros < jeronimo.bar...@gmail.com> wrote: > Hi everyone, > > After upgrade and clean install on a 5 nodes cluster: > > Datacenter: datacenter

Re: Full GC in cassandra

2014-07-28 Thread Mark Reddy
What is your data size and number of columns in Cassandra. Do you do many deletions? On Mon, Jul 28, 2014 at 2:53 PM, Ruchir Jha wrote: > Really curious to know what's causing the spike in Columns and > DeletedColums below : > > > 2014-07-28T09:30:27.471-0400: 127335.928: [Full GC 127335.928: [

Re: Cassandra version upgrade path

2014-07-26 Thread Mark Reddy
Check https://github.com/apache/cassandra/blob/trunk/NEWS.txt to see the steps required, if any, when upgrading. As a general practice you should also reviw the changes at https://github.com/apache/cassandra/blob/trunk/CHANGES.txt If upgrading major versions it is often recommended to be on the la

Re: do all nodes actually send the data to the coordinator when doing a read?

2014-07-25 Thread Mark Reddy
data (cell value) from both nodes but >> if the cell name have different timestamp would it still request a full >> data read to the node having the most recent time ? >> >> >> On Fri, Jul 25, 2014 at 11:25 PM, Mark Reddy >> wrote: >> >>> Hi Brian

Re: do all nodes actually send the data to the coordinator when doing a read?

2014-07-25 Thread Mark Reddy
Hi Brian, A read request will be handled in the following manner: Once the coordinator receives a read request it will firstly determine the replicas responsible for the data. From there those replicas are sorted by "proximity" to the coordinator. The closest node as determined by proximity sorti

Re: How to get rid of stale info in gossip

2014-07-25 Thread Mark Reddy
After removing a node, it's information can persist in the Gossiper for up to 3 days, after which time it should be removed. Are you having issues with a removed node state persisting for longer? Mark On Fri, Jul 25, 2014 at 11:33 AM, Rahul Neelakantan wrote: > Is there a way to get rid of s

Re: What is C*?

2014-07-24 Thread Mark Reddy
Yes you are correct, Cassandra is often abbreviated as C*. With most languages and applications being referenced by their acronym / abbreviation, I guess you just have to pick one that is available. I assume if someone creates a new language and wants to name it C*, they will see that it is taken a

Re: When will a node's host ID change?

2014-07-23 Thread Mark Reddy
It won't. Cassandra stores a node's host ID in an end-point to host ID mapping, each ID must be unique and cannot be changed after the fact. Mark On Mon, Jul 21, 2014 at 4:44 PM, John Sanda wrote: > Under what circumstances, if any, will a node's host ID change? > > - John >

Re: Why is the cassandra documentation such poor quality?

2014-07-23 Thread Mark Reddy
Hi Kevin, The difference here is that the Apache Cassandra site is maintained by the community whereas the DataStax site is maintained by paid employees with a vested interest in producing documentation. With DataStax having some comprehensive docs, I guess the desire for people to maintain the A

Re: Case Study from Migrating from RDBMS to Cassandra

2014-07-22 Thread Mark Reddy
PlanetCassandra has a collection of migration use cases: http://planetcassandra.org/mysql-to-cassandra-migration/ http://planetcassandra.org/oracle-to-cassandra-migration/ If they don't contain the information you need I'm sure you could reach out to the companies involved or DataStax itself to g

Re: Only one node in Cassandra AMI cluster

2014-07-15 Thread Mark Reddy
.com/documentation/cassandra/2.0/cassandra/install/installAMISecurityGroup.html > > > > > > On Tuesday, July 15, 2014 6:43 PM, Mark Reddy > wrote: > > > Have you setup your security group correctly so that all nodes can > communicate. Is there anything in the logs to

Re: Only one node in Cassandra AMI cluster

2014-07-15 Thread Mark Reddy
Have you setup your security group correctly so that all nodes can communicate. Is there anything in the logs to suggest nodes could not communicate to each other? When you log into the other instances, is Cassandra running correctly? On Tue, Jul 15, 2014 at 6:28 PM, Lars Schouw wrote: > When

Re: Small bug on Cassandra 2.1.0-rc3 and fix

2014-07-11 Thread Mark Reddy
gt; CLASSPATH="$CLASSPATH:$EXTRA_CLASSPATH" > > > # set JVM javaagent opts to avoid warnings/errors > if [ "$JVM_VENDOR" != "OpenJDK" -o "$JVM_VERSION" \> "1.6.0" ] \ > || [ "$JVM_VERSION" = "1.6.0" -a &q

Re: UnavailableException

2014-07-11 Thread Mark Reddy
Can you post the output of nodetool status and your Astyanax connection settings? On Fri, Jul 11, 2014 at 9:06 PM, Ruchir Jha wrote: > This is how we create our keyspace. We just ran this command once through > a cqlsh session on one of the nodes, so don't quite understand what you > mean by "c

Re: Small bug on Cassandra 2.1.0-rc3 and fix

2014-07-11 Thread Mark Reddy
Hi Carlos, I have just downloaded 2.1.0-rc3 to validate this and the cassandra.in.sh contains the correct reference to the jamm-0.2.6.jar This can also been seen on the trunk: https://github.com/apache/cassandra/blob/trunk/bin/cassandra.in.sh I am unsure why you are seeing jamm-0.2.5.jar in your

Re: TTransportException (java.net.SocketException: Broken pipe)

2014-07-08 Thread Mark Reddy
Hi Bhaskar, Can you check your limits using 'ulimit -a'? The default is 1024, which needs to be increased if you have not done so already. Here you will find a list of recommended production settings: http://www.datastax.com/documentation/cassandra/2.0/cassandra/install/installRecommendSettings.h

Re: Disable vnode

2014-07-04 Thread Mark Reddy
. >> Tried a cold restart of the cluster. That also didn't work. >> >> I will try the nodetool rebuild and see whether any change. >> >> Thanks, >> rameez >> >> >> >> On Fri, Jul 4, 2014 at 7:19 PM, Mark Reddy >> wrote: >>

Re: Disable vnode

2014-07-04 Thread Mark Reddy
Hi Rameez, I have never done a migration from vnodes to non-vnodes however I would imagine that the procedure would be the same as its counterpart. As always testing in dev should be done first. To move from vnodes to non-vodes I would add a new datacenter to the cluster with vnodes disabled and

Re: why aren't my nodes show same percentage of ownership

2014-06-30 Thread Mark Reddy
You should run nodetool ring and specify a keyspace, otherwise the ownership information will be nonsense. https://issues.apache.org/jira/browse/CASSANDRA-7173 Mark On Mon, Jun 30, 2014 at 6:13 PM, S C wrote: > When I run "nodetoool ring" I see the ownership with different > percentages. Howe

Re: Index with same Name but different keyspace

2014-05-17 Thread Mark Reddy
Can you share your schema and the commands you are running? On Thu, May 15, 2014 at 7:54 PM, mahesh rajamani wrote: > Hi, > > I am using Cassandra 2.0.5 version. I trying to setup 2 keyspace with same > tables for different testing. While creating index on the tables, I > realized I am not able

Re: Mutation messages dropped

2014-05-16 Thread Mark Reddy
Yes, please see http://wiki.apache.org/cassandra/FAQ#dropped_messages for further details. Mark On Fri, May 9, 2014 at 12:52 PM, Raveendran, Varsha IN BLR STS < varsha.raveend...@siemens.com> wrote: > Hello, > > I am writing around 10Million records continuously into a single node > Cassandra

Re: JDK 8

2014-04-29 Thread Mark Reddy
> > Datastax recommended during a long time (still now ?) to use Java 6 Java 6 is recommended for version 1.2 Java 7 is required for version 2.0 Mark On Tue, Apr 29, 2014 at 10:19 AM, Alain RODRIGUEZ wrote: > Looks like it will be like with the version 7... Cassandra has been > compatible wi

Re: Can the seeds list be changed at runtime?

2014-04-29 Thread Mark Reddy
Hi Boying, >From Datastax documentation: http://www.datastax.com/documentation/cassandra/1.2/cassandra/architecture/architectureGossipAbout_c.html > The seed node designation has no purpose other than bootstrapping the > gossip process for new nodes joining the cluster. Seed nodes are not a > sin

Re: Bootstrap Timing

2014-04-18 Thread Mark Reddy
If all streams have completed, the node could be still rebuilding secondary indexes? Try looking at 'nodetool compactionstats' for this. On Fri, Apr 18, 2014 at 1:05 PM, Phil Burress wrote: > nodetool netstats shows 84 files. They are all at 100%. Nothing showing in > Pending or Active for Read

Re: GC histogram analysis

2014-04-16 Thread Mark Reddy
Do you delete and/or set TTLs on your data? On Wed, Apr 16, 2014 at 2:14 PM, Ruchir Jha wrote: > Hi, > > I am trying to investigate ParNew promotion failures happening routinely > in production. As part of this exercise, I enabled > -XX:PrintHistogramBeforeFullGC and saw the following output. A

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Mark Reddy
k Mishra" wrote: > >> Thanks Mark. does this mean with RF=3, all 3 nodes must be up and running >> for CAS updates? >> >> -Vivek >> >> >> On Wed, Apr 16, 2014 at 6:22 PM, Mark Reddy wrote: >> >>> create keyspace twitter with replic

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Mark Reddy
> > create keyspace twitter with replication = {'class':'SimpleStrategy', > 'replication_factor' : 3} Your replication factor is your issue here, you have a single node and a RF=3. For a single node setup your RF should be 1. You can find more info about replication here: http://www.datastax.com/

Re: How to clear all data using CQL?

2014-04-16 Thread Mark Reddy
To get a list of all keyspaces via CQL you can run: *describe keyspaces *or *select keyspace_name from system.schema_keyspaces;* On Wed, Apr 16, 2014 at 9:23 AM, Sebastian Schmidt wrote: > Hey > > I want to do unit tests for software that uses Cassandra as storage > backend. Using the Hector cl

Re: Datastax OPSCENTER

2014-04-15 Thread Mark Reddy
Datastax have a comprehensive installation and configuration guide here: http://www.datastax.com/documentation/opscenter/4.1/opsc/install/opscInstallation_g.html On Tue, Apr 15, 2014 at 9:13 AM, Aravindan T wrote: > Hi, > > Could anyone please guide me on installation procedure of Cassandra usi

Re: bloom filter + suddenly smaller CF

2014-04-14 Thread Mark Reddy
Hi Will, You can run 'nodetool upgradesstables', this will rewrite the SSTables and regenerate the bloom filters for those tables, This will reduce their usage. Mark On Mon, Apr 14, 2014 at 3:16 PM, William Oberman wrote: > Ah, so I could change the chance value to "poke it". Good to know! >

Re: Cassandra disk usage

2014-04-13 Thread Mark Reddy
ncrease server numbers? >> 2) Still it has 1.5X of overall data how can this be resolved and what is >> reason for that? >> 3) Also i see that data is in different size on all nodes , does that >> means that servers are out of sync??? >> >> Thanks and best regards >>

Re: Cassandra disk usage

2014-04-13 Thread Mark Reddy
What are you storing these 15 chars as; string, int, double, etc.? 15 chars does not translate to 15 bytes. You may be mixing up replication factor and quorum when you say "Cassandra cluster has 3 servers, and data is stored in quorum ( 2 servers )." You read and write at quorum (N/2)+1 where N=to

Re: clearing tombstones?

2014-04-11 Thread Mark Reddy
4-04-11, at 11:20 AM, Paulo Ricardo Motta Gomes < >> paulo.mo...@chaordicsystems.com> wrote: >> >> I have a similar problem here, I deleted about 30% of a very large CF >> using LCS (about 80GB per node), but still my data hasn't shrinked, even if >> I used 1

Re: clearing tombstones?

2014-04-11 Thread Mark Reddy
t;> *horrible*. Now, if I only deleted 10% of the data, waiting to compact >> 450GB until I had another 1.3TB would be rough... >> >> I think your advice is great for people looking for "normal" answers in >> the forum, but I don't think my use case is very

Re: clearing tombstones?

2014-04-11 Thread Mark Reddy
til the SSTable minor compactions build up enough > to consider including the big SSTable in a compaction, making it likely > I'll have to self manage compactions. > > > > On Fri, Apr 11, 2014 at 10:31 AM, Mark Reddy wrote: > >> Correct, a tombstone will only be rem

Re: clearing tombstones?

2014-04-11 Thread Mark Reddy
Correct, a tombstone will only be removed after gc_grace period has elapsed. The default value is set to 10 days which allows a great deal of time for consistency to be achieved prior to deletion. If you are operationally confident that you can achieve consistency via anti-entropy repairs within a

Re: How to replace cluster name without any impact?

2014-04-09 Thread Mark Reddy
What version are you running? As of 1.2.x you can do the following: 1. Start the cqlsh connected locally to the node. 2. Run: update system.local set cluster_name='$CLUSTER_NAME' where key='local'; 3. Run nodetool flush on the node. 4. Update the cassandra.yaml file on the node, changing the clust

  1   2   >