Re: migration to a new model

2014-06-05 Thread Marcelo Elias Del Valle
Michael, I will try to test it up to tomorrow and I will let you know all the results. Thanks a lot! Best regards, Marcelo. 2014-06-04 22:28 GMT-03:00 Laing, Michael : > BTW you might want to put a LIMIT clause on your SELECT for testing. -ml > > > On Wed, Jun 4, 2014 at 6:04 PM, Laing, Micha

Re: migration to a new model

2014-06-04 Thread Laing, Michael
BTW you might want to put a LIMIT clause on your SELECT for testing. -ml On Wed, Jun 4, 2014 at 6:04 PM, Laing, Michael wrote: > Marcelo, > > Here is a link to the preview of the python fast copy program: > > https://gist.github.com/michaelplaing/37d89c8f5f09ae779e47 > > It will copy a table fr

Re: migration to a new model

2014-06-04 Thread Laing, Michael
Marcelo, Here is a link to the preview of the python fast copy program: https://gist.github.com/michaelplaing/37d89c8f5f09ae779e47 It will copy a table from one cluster to another with some transformation- they can be the same cluster. It has 3 main throttles to experiment with: 1. fetch_si

Re: migration to a new model

2014-06-04 Thread Laing, Michael
OK Marcelo, I'll work on it today. -ml On Tue, Jun 3, 2014 at 8:24 PM, Marcelo Elias Del Valle < marc...@s1mbi0se.com.br> wrote: > Hi Michael, > > For sure I would be interested in this program! > > I am new both to python and for cql. I started creating this copier, but > was having problems wi

Re: migration to a new model

2014-06-03 Thread Marcelo Elias Del Valle
Hi Michael, For sure I would be interested in this program! I am new both to python and for cql. I started creating this copier, but was having problems with timeouts. Alex solved my problem here on the list, but I think I will still have a lot of trouble making the copy to work fine. I open sou

Re: migration to a new model

2014-06-03 Thread Laing, Michael
Hi Marcelo, I could create a fast copy program by repurposing some python apps that I am using for benchmarking the python driver - do you still need this? With high levels of concurrency and multiple subprocess workers, based on my current actual benchmarks, I think I can get well over 1,000 row

Re: migration to a new model

2014-06-02 Thread Marcelo Elias Del Valle
Hi Jens, Thanks for trying to help. Indeed, I know I can't do it using just CQL. But what would you use to migrate data manually? I tried to create a python program using auto paging, but I am getting timeouts. I also tried Hive, but no success. I only have two nodes and less than 200Gb in this c

Re: migration to a new model

2014-06-02 Thread Jens Rantil
Hi Marcelo, Looks like you can't do this without migrating your data manually: https://stackoverflow.com/questions/18421668/alter-cassandra-column-family-primary-key-using-cassandra-cli-or-cql Cheers, Jens On Mon, Jun 2, 2014 at 7:48 PM, Marcelo Elias Del Valle < marc...@s1mbi0se.com.br> wrote:

migration to a new model

2014-06-02 Thread Marcelo Elias Del Valle
Hi, I have some cql CFs in a 2 node Cassandra 2.0.8 cluster. I realized I created my column family with the wrong partition. Instead of: CREATE TABLE IF NOT EXISTS entity_lookup ( name varchar, value varchar, entity_id uuid, PRIMARY KEY ((name, value), entity_id)) WITH caching=all;