Re: Data Distribution / Replication

2010-08-14 Thread Benjamin Black
#546 #1076 #1169 #1377 etc... On Sat, Aug 14, 2010 at 12:05 PM, Bill de hÓra wrote: > That data suggests the inbuilt tools are a hazard and manual workarounds > less so. > > Can you point me at the bugs? > > Bill > > > On Fri, 2010-08-13 at 20:30 -0700, Benjamin Black wrote: >> Number of bugs I'

Re: Data Distribution / Replication

2010-08-14 Thread Benjamin Black
On Fri, Aug 13, 2010 at 10:13 PM, Stefan Kaufmann wrote: >> My recommendation is to leave Autobootstrap disabled, copy the >> datafiles over, and then run cleanup.  It is faster and more reliable >> than streaming, in my experience. > > I thought about copying da Data manually. However if I have a

Re: Data Distribution / Replication

2010-08-14 Thread Bill de hÓra
That data suggests the inbuilt tools are a hazard and manual workarounds less so. Can you point me at the bugs? Bill On Fri, 2010-08-13 at 20:30 -0700, Benjamin Black wrote: > Number of bugs I've hit doing this with scp: 0 > Number of bugs I've hit with streaming: 2 (and others found more) >

Re: Data Distribution / Replication

2010-08-13 Thread Stefan Kaufmann
> My recommendation is to leave Autobootstrap disabled, copy the > datafiles over, and then run cleanup.  It is faster and more reliable > than streaming, in my experience. I thought about copying da Data manually. However if I have a running environment and add a node (or replace a broken one), h

Re: Data Distribution / Replication

2010-08-13 Thread Benjamin Black
Number of bugs I've hit doing this with scp: 0 Number of bugs I've hit with streaming: 2 (and others found more) Also easier to monitor progress, manage bandwidth, etc. I just prefer using specialized tools that are really good at specific things. This is such a case. b On Fri, Aug 13, 2010 at

Re: Data Distribution / Replication

2010-08-13 Thread Bill de hÓra
On Fri, 2010-08-13 at 09:51 -0700, Benjamin Black wrote: > My recommendation is to leave Autobootstrap disabled, copy the > datafiles over, and then run cleanup. It is faster and more reliable > than streaming, in my experience. What is less reliable about streaming? Bill

Re: Data Distribution / Replication

2010-08-13 Thread Benjamin Black
On Fri, Aug 13, 2010 at 9:48 AM, Oleg Anastasjev wrote: > Benjamin Black b3k.us> writes: > >> > 3. I waited for the data to replicate, which didn't happen. >> >> Correct, you need to run nodetool repair because the nodes were not >> present when the writes came in.  You can also use a higher >> c

Re: Data Distribution / Replication

2010-08-13 Thread Oleg Anastasjev
Benjamin Black b3k.us> writes: > > 3. I waited for the data to replicate, which didn't happen. > > Correct, you need to run nodetool repair because the nodes were not > present when the writes came in. You can also use a higher > consistency level to force read repair before returning data, whi

Re: Data Distribution / Replication

2010-08-12 Thread Benjamin Black
On Thu, Aug 12, 2010 at 8:30 AM, Stefan Kaufmann wrote: > Hello again, > > last day's I started several tests with Cassandra and learned quite some > facts. > > However, of course, there are still enough things I need to > understand. One thing is, how the data replication works. > For my Testing

Data Distribution / Replication

2010-08-12 Thread Stefan Kaufmann
Hello again, last day's I started several tests with Cassandra and learned quite some facts. However, of course, there are still enough things I need to understand. One thing is, how the data replication works. For my Testing: 1. I set the replication Factor to 3, started with 1 active node (the