Recommender system

2012-07-06 Thread Carlos Sanchez
All, Is anyone aware of a [distributed] Recommender system (i.e. amazon recommendations) that uses cassandra as a back-end? Thanks a lot Carlos

Re: Python Client

2011-06-11 Thread Carlos Sanchez
client language options are found here: > http://wiki.apache.org/cassandra/ClientOptions > > On Jun 10, 2011, at 9:58 PM, Carlos Sanchez wrote: > >> All, >> >> I was wondering if there are Cassandra python clients and which one would be >> the best to use >> >> Thanks a lot, >> >> Carlos >

Python Client

2011-06-10 Thread Carlos Sanchez
All, I was wondering if there are Cassandra python clients and which one would be the best to use Thanks a lot, Carlos

Bloom filter

2011-01-13 Thread Carlos Sanchez
All, Could someone tell me where (what classes) or what library is Cassandra using for its bloom filters? Thanks Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be privileged o

Merkle trees

2010-11-05 Thread Carlos Sanchez
All, Is there a good resource to learn about merkle trees and its implementation in Cassandra? What classes should I be looking at. Thx a lot Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential informatio

Index package

2010-08-14 Thread Carlos Sanchez
All, Could I know what package or classes are used in Cassandra for secondary column indices? Thanks Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be privileged or otherwise

Index feature in 0.7

2010-08-13 Thread Carlos Sanchez
All, I was wondering if I could get some information (link / pdf) about the new [column] indices in Cassandra for version 0.7 Thanks a lot, Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential informatio

RE: Map Reduce support

2010-06-28 Thread Carlos Sanchez
Drew, I was wondering if you care to share your map-reduce code Thanks Carlos From: Drew Dahlke [drew.dah...@bronto.com] Sent: Monday, June 28, 2010 7:17 AM To: user@cassandra.apache.org Subject: Re: Map Reduce support The difference is noticeable but sm

KeyRange deletion

2010-06-23 Thread Carlos Sanchez
Would there be any support for a KeyRange (start/end key) deletion in 0.7? Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be privileged or otherwise protected from disclosure. An

RE: Lucandra issues

2010-06-20 Thread Carlos Sanchez
Jake, I will be interested in this functionality Carlos From: Jake Luciani [jak...@gmail.com] Sent: Friday, June 18, 2010 10:57 PM To: user@cassandra.apache.org Subject: Re: Lucandra issues Hi Maxim, Lucandra doesn't support numeric queries quite yet. A

RE: Range Slices timing question

2010-06-10 Thread Carlos Sanchez
at 11:04 AM, Carlos Sanchez wrote: > I have about a million rows (each row with 100 cols) of the form > domain/!date/!id (e.g. gwm.com/!20100430/!CFRA4500) So I am interested in > getting all the ids (all cols) for a particular domain/date (e.g. > "gwm.ml.com/!20100430/!A&quo

Range Slices timing question

2010-06-09 Thread Carlos Sanchez
I have about a million rows (each row with 100 cols) of the form domain/!date/!id (e.g. gwm.com/!20100430/!CFRA4500) So I am interested in getting all the ids (all cols) for a particular domain/date (e.g. "gwm.ml.com/!20100430/!A" "gwm.ml.com/!20100430/!D"). I am looping in chunks of 6000 rows

RE: high-scale-lib & clhm-production jars

2010-05-25 Thread Carlos Sanchez
om/p/concurrentlinkedhashmap/ On Tue, May 25, 2010 at 10:17 AM, Carlos Sanchez mailto:carlos.sanc...@riskmetrics.com>> wrote: Do anyone know if there are repositories for high-scale-lib & clhm-production jars? Is the source available somewhere? Thanks Carlos This email message and any attachments ar

high-scale-lib & clhm-production jars

2010-05-25 Thread Carlos Sanchez
Do anyone know if there are repositories for high-scale-lib & clhm-production jars? Is the source available somewhere? Thanks Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be

Key distribution

2010-04-29 Thread Carlos Sanchez
All, Does anyone know of a program (series of classes) that can capture the key distribution of the rows in a ColumnFamily, sort of a [sub] string-histogram. Thanks, Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/o

RE: Trove maps

2010-04-24 Thread Carlos Sanchez
maps On Fri, Apr 23, 2010 at 1:22 PM, Carlos Sanchez wrote: > I will try to modify the code... what I like about Trove is that even for > regular maps (non primitive) there are no Entry objects created so there are > much less references to be gced This could help, but how is itera

Re: Trove maps

2010-04-23 Thread Carlos Sanchez
ress.py.) > > -Jonathan > > On Fri, Apr 23, 2010 at 2:50 PM, Carlos Sanchez > wrote: >> Jonathan, >> >> Have you thought of using Trove collections instead of regular java >> collections (HashMap / HashSet) in Cassandra? Trove maps are faster and >&g

Trove maps

2010-04-23 Thread Carlos Sanchez
Jonathan, Have you thought of using Trove collections instead of regular java collections (HashMap / HashSet) in Cassandra? Trove maps are faster and require less memory Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary an

Row deletion and get_range_slices (cassandra 0.6.1)

2010-04-22 Thread Carlos Sanchez
I have a curious question.. I am doing some testing where I insert 500 rows to a super column family and then delete one row, I make sure the row was indeed deleted (NotFoundException in the get call) and then I ran a get_range_slices and the row indeed returned. The shutdown Cassandra and rest

RE: Batch row deletion

2010-04-20 Thread Carlos Sanchez
, 2010 at 10:45 PM, Carlos Sanchez wrote: > All, > > Is there or will there be a feature to batch delete rows? (KeyRange delete?) > > Thanks > > Carlos > > This email message and any attachments are for the sole use of the intended > recipients and may contain p

Batch row deletion

2010-04-20 Thread Carlos Sanchez
All, Is there or will there be a feature to batch delete rows? (KeyRange delete?) Thanks Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be privileged or otherwise protected fro

Insertion time question

2010-03-30 Thread Carlos Sanchez
I was wondering if I could have a bit more insight as why we are seeing different insertion times between regular column families and super columns. We have a group object (with its name) that may have a series of attributes (name/value). There can be up a million group object and different grou

Re: Write times

2010-03-29 Thread Carlos Sanchez
de > is a gateway to do anything, anywhere in the cluster. > > On 2010-03-29 23:31, Carlos Sanchez wrote: >> Would it be best then for the client to select the node to write to when >> using OPP in order to evenly distributes the keys? >> >> On Mar 29, 201

Re: Write times

2010-03-29 Thread Carlos Sanchez
nodes to be overworked. > > In other words, OPP vs. RP on a single node system should have no real effect. > > -Original Message- > From: Carlos Sanchez > Date: Mon, 29 Mar 2010 18:58:50 > To: user@cassandra.apache.org > Subject: Write times > > Are writes on Orde

Write times

2010-03-29 Thread Carlos Sanchez
Are writes on OrderPreservingPartitioner always slower than RandomPartitioner? Is the replication factor a 'factor' in the write times? Thanks, Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential informat

Re: updates on hector, a java cassandra client

2010-03-23 Thread Carlos Sanchez
cassandra and thrift) that aren't in a public repository. Any suggestion of to deal with it? On Tue, Mar 23, 2010 at 12:58 PM, Carlos Sanchez mailto:carlos.sanc...@riskmetrics.com>> wrote: Ran, Are you planning to submit hector to a maven reposito

RE: updates on hector, a java cassandra client

2010-03-23 Thread Carlos Sanchez
Ran, Are you planning to submit hector to a maven repository? Carlos From: Ran Tavory [ran...@gmail.com] Sent: Tuesday, March 23, 2010 4:36 AM To: user@cassandra.apache.org Subject: Re: updates on hector, a java cassandra client Download ready jars with a

RE: Storing large blobs

2010-03-17 Thread Carlos Sanchez
@cassandra.apache.org Subject: Re: Storing large blobs My question would be how large is large? Perhaps you could compress the blobs and then store them. But it depends on the answer to the first question. Cheers Avinash On Wed, Mar 17, 2010 at 5:10 PM, Carlos Sanchez mailto:carlos.sanc

Storing large blobs

2010-03-17 Thread Carlos Sanchez
Has anyone had experience storing large blobs in Cassandra? Is really Cassandra tailored for large content? Carlos This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be privileged or other