Bad versioning in Yum repository

2011-10-19 Thread David Strauss
The current version scheme in the "riptano" repository puts 1.0.0rc2 as the successor to 1.0.0 final. I think this will resolve itself with the 1.0.1 release (or later), but it's broken right now. [root@david-onebox ~]# yum -y update Loaded plugins: presto Setting up Update Process Resolving Depen

Re: CassandraFS in 1.0?

2011-07-12 Thread David Strauss
? > > If you store large files, how large is your store per node and how does it > handle compactions (any performance issues while compacting large data)? > > Also would be interesting to hear some benchmarks and performance stats for > read/writes. > > Regards, > Rus

Re: CassandraFS in 1.0?

2011-07-11 Thread David Strauss
It's not, currently, but I'm happy to answer questions about its architecture. On Thu, Jul 7, 2011 at 10:35, Norman Maurer wrote: > May I ask if its opensource by any chance ? > > bye > norman > > Am Donnerstag, 7. Juli 2011 schrieb David Strauss : >> I'm

Re: CassandraFS in 1.0?

2011-07-07 Thread David Strauss
d_ > (or mongodb->gridfs) > happy to help as I am moving down this road in general > Thanks! > > /* > Joe Stein > http://www.linkedin.com/in/charmalloc > Twitter: @allthingshadoop > */ > -- David Strauss    | da...@davidstrauss.net    | +1 512 577 5827 [mobile]

Re: Sharing Cassandra with Solandra

2011-06-27 Thread David Strauss
On Mon, 2011-06-27 at 15:06 -0600, AJ wrote: > Would anyone care to talk about their experiences with using Solandra > along side another application that uses Cassandra (also on the same > node)? I'm curious about any resource contention issues or > compatibility between C* versions and Sol.

Re: Apt repositories

2011-04-27 Thread David Strauss
On Tue, 2011-04-26 at 19:03 -0500, Eric Evans wrote: > There was already a repo for cassandra-0.6 (called 06x), it just fell > through the cracks with the last release. > > There is one for each version now (06x, 07x, and 08x). The unstable > suite continues to point to latest-and-greatest. The w

Re: Apt repositories

2011-04-27 Thread David Strauss
On Tue, 2011-04-26 at 19:03 -0500, Eric Evans wrote: > There is one for each version now (06x, 07x, and 08x). The unstable > suite continues to point to latest-and-greatest. The wiki has been > updated. Where, exactly, is this on the wiki? I had been using the CloudConfig page [1], which still on

Re: encryption_options & 0.8

2011-04-27 Thread David Strauss
On Wed, 2011-04-27 at 12:56 +0200, Sasha Dolgy wrote: > "IBM WebSphere applies a hardcoded XOR. Each caracter is XOR'd with > the caracter ‘_’, and the resulting string is encoded in base64. This > is not cryptography, it is just enough encoding so that a casual > glance at the file will not reveal

Re: encryption_options & 0.8

2011-04-26 Thread David Strauss
On Tue, 2011-04-26 at 08:57 +0200, Sasha Dolgy wrote: > Is it possible to store an encrypted keystore_password and > truststore_password in the cassandra.yaml? I see that the defaults > allow cleartext which isn't suitable when negotiating with security > specialists for sign-off of a solution...

Re: Manual Conflict Resolution in Cassandra

2011-04-25 Thread David Strauss
On Mon, 2011-04-25 at 03:50 -0700, Milind Parikh wrote: > I suppose the term 'silently dropped' is a matter of perspective. C > makes an explicit automated choice of latest-timestamp-wins. In > certain situations, this is not the appropriate choice. I would still insist that using Cassandra and ex

RE: Apt repositories

2011-04-25 Thread David Strauss
On Mon, 2011-04-25 at 15:25 -0700, Gregory Szorc wrote: > If you don't want your APT-sourced packages to upgrade automatically, > I suggest pinning the package. I'm aware that I can pin the package, but it's still a workaround for the Cassandra Apt repository not being set up according to best pra

Re: Manual Conflict Resolution in Cassandra

2011-04-25 Thread David Strauss
On Fri, 2011-04-22 at 13:31 -0700, Milind Parikh wrote: > Is there a chance of getting manual conflict resolution in Cassandra? > Please see attachment for why this is important in some cases. You can actually already perform "manual conflict resolution" in Cassandra by naming your columns so that

Apt repositories

2011-04-23 Thread David Strauss
I just noticed that, following the Cassandra 0.8 beta release, the Apt repository is encouraging servers in my clusters to upgrade. Beta releases should probably be on different channels (or named differently) than stable ones. Better yet would be naming the packages based on the major release in

Re: Understand eventually consistent

2011-02-20 Thread David Strauss
On Fri, 2011-02-18 at 12:01 -0600, Anthony John wrote: > Writes will go thru w/hinted handoff, read will fail That is not correct. Hinted handoffs do not count toward reaching QUORUM counts.[1] [1] http://wiki.apache.org/cassandra/HintedHandoff -- David Strauss | da...@davidstrauss.

Re: Replacing Redis

2011-02-20 Thread David Strauss
e slower growing part. The > lpush/ltrim usage was in here. It's not clear just from your use of lpush/ltrim what you're doing. Some uses can map to Cassandra's APIs with some creative reinterpretation; others can't. -- David Strauss | da...@davidstrauss.net | +

Cassandra-related post by a friend

2010-07-16 Thread David Strauss
ead on the status of the evolving non-relational space. -- David Strauss | da...@fourkitchens.com | +1 512 577 5827 [mobile] Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: A very short summary on Cassandra for a book

2010-07-15 Thread David Strauss
n 0.7 as a result of Digg's work), it will be possible to support increment and decrement operations, just not ones that give you an instant, unique result. The actual inc and dec support probably won't be in 0.7, though. > "Also sorting happens on insert time" &

Re: Digg 4 Preview on TWiT

2010-07-06 Thread David Strauss
t; > On Tue, Jul 6, 2010 at 3:34 AM, David Strauss <mailto:da...@fourkitchens.com>> wrote: > > On 2010-07-05 15:40, Eric Evans wrote: > > On Sun, 2010-07-04 at 13:14 +0100, Bill de hÓra wrote: > >> This person's understanding is that Fa

Re: Digg 4 Preview on TWiT

2010-07-06 Thread David Strauss
right question for Facebook's purposes. That decision isn't an indictment of Cassandra's capability; it's confirmation that Cassandra isn't everything to everyone. But we already knew that. :-) -- David Strauss | da...@fourkitchens.com | +1 512 577 5827 [mobile] Four

Re: how does cassandra compare with mongodb?

2010-05-14 Thread David Strauss
On 2010-05-13 19:48, Steve Lihn wrote: > Now the java community is developing a database like Mongo that is > schema-less. Mongo is written in C++. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [

Re: Extremly slow inserts on LAN

2010-05-09 Thread David Strauss
utes. > > In both cases Cassandra server contains a single node. > > > > We use Cassandra version 0.6.0 running on Windows server 2008. > > The client is .NET c# application. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 51

Re: bloom filter

2010-05-07 Thread David Strauss
On 2010-05-07 10:58, vineet daniel wrote: > Is there any way to view the content of this file. Which file? -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: Open

Re: bloom filter

2010-05-07 Thread David Strauss
indicate mutually exclusive ranges of addresses, while the bloom filters may indicate the possible presence of a key in *several* files.) -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: bloom filter

2010-05-07 Thread David Strauss
es a false > positive. > > See: > > http://spyced.blogspot.com/2009/01/all-you-ever-wanted-to-know-about.html It is also important for identifying which SSTable files to look inside even when a key is present. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://f

Re: bloom filter

2010-05-07 Thread David Strauss
On 2010-05-07 10:51, vineet daniel wrote: > what is the benefit of creating bloom filter when cassandra writes data, > how does it helps ? http://wiki.apache.org/cassandra/ArchitectureOverview -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com |

Re: Appropriate use for Cassandra?

2010-05-05 Thread David Strauss
re a entity per row, > with the time-series data as (or in?) super columns (and maybe it > would make sense to break those out in column families by date range). > I'd have to think through a little more what that might mean for our > secondary indexing needs. > > Thanks, &g

Re: Appropriate use for Cassandra?

2010-05-04 Thread David Strauss
n Cassandra. > Thoughts? Pitfalls? Gotchas? Are we completely whacked? Does the random partitioner support what you need? -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: Is this sentence slightly inaccurate

2010-04-08 Thread David Strauss
r. I just want the ability to walk all rows and prune stuff. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: Write consistency

2010-04-08 Thread David Strauss
I think the general answer here is don't use R=1 if you can't > tolerate > inconsistency? Still the point of confusion -- if W=3 and the write > succeeds on 2 nodes but fails the 3rd, the write fails (to the > updating client), but is the

Re: Is this sentence slightly inaccurate

2010-04-07 Thread David Strauss
iscover them. Cold comfort, imo. Is it possible to iterate through RP keys? I've gotten mixed answers. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: how to store list data in Apache Cassndra ?

2010-04-06 Thread David Strauss
(a python lib of cassandra) >> >> import pycassa >> client = pycassa.connect() >> cf = pycassa.ColumnFamily(client, 'mygame', 'user') >> >> key = '1234567890' >> value = { >> 'name': 'Lee Li', >> >> 'age'; '21', >> 'girls': ['java', 'actionscript', 'python'], >> } >> >> cf.insert(key, value) >> >> >> Oops, get error while save a `value` like above. >> >> So, how to store list data in Apache Cassndra ? >> >> >> Thanks for reply. >> >> >> >> >> -- >> Shuge Lee | Lee Li >> -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: Cassandra Design or another solution

2010-04-05 Thread David Strauss
//row key > mark_userId: ""// (column name : value), > mark2_userId2: "" > }, > gameId2:{//row key > mark_userId: "" > } > } > > > On Sun, Apr 4, 2010 at 11:44 PM, David Strauss <mailto:da...@fourkitch

Re: Memcached protocol?

2010-04-05 Thread David Strauss
On 2010-04-05 07:47, Paul Prescod wrote: > On Mon, Apr 5, 2010 at 12:01 AM, David Strauss wrote: >> On 2010-04-05 03:42, Paul Prescod wrote: >> ... >> >> There is a difference between Cassandra allowing inc/dec on values and >> actually *knowing* the resultant val

Re: Memcached protocol?

2010-04-05 Thread David Strauss
hich are the basis of counters. There is a difference between Cassandra allowing inc/dec on values and actually *knowing* the resultant value at the time of the write. It's likely that inc/dec support will still feature blind writes if at all possible. The memcached protocol returns a resultant val

Re: Cassandra Design or another solution

2010-04-04 Thread David Strauss
key/row, and column? > When user's mark changes, we remove old mark_userId and add new > mark_userId. > Because user's mark change frequently and with large amount of user, I > think Cassandra can not satisfy. On the contrary, Cassandra excels at tracking rapidly cha

Re: Read Performance

2010-03-31 Thread David Strauss
Or, if faking memcached misses is too high a price to pay, queue some proportion of the reads to replay asynchronously against Cassandra. On Wed, 2010-03-31 at 11:04 -0500, Jonathan Ellis wrote: > Can you redirect some of the reads from memcache to cassandra? Sounds > like the cache isn't getting

Re: Replicating data over the wan?

2010-03-30 Thread David Strauss
write to for any row in either DC without need for hinted hand-off. Writes should then succeed if you use SL.ONE, even with a disconnect between DCs. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [dir

Re: Poor performance; PHP & Thrift to blame

2010-03-30 Thread David Strauss
enerated > with thrift compiler The generated Thrift client automatically makes use of the PHP extension, if available, for certain small, computationally intense portions of the code. You have to use TBinaryProtocolAccelerated for this to work. -- David Strauss | da...@fourkitchens.com

Re: Large data files and no "edit in place"?

2010-03-30 Thread David Strauss
original set) on a designated street corner. You'd want to pick the bucket option -- despite the higher volume -- and your hard disk would agree. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: Poor performance; PHP & Thrift to blame

2010-03-29 Thread David Strauss
ver, I was testing with many threads making many small requests (under 10 columns each). -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [office] | +1 512 870 8453 [direct] signature.asc Description: OpenPGP digital signature

Re: Write times

2010-03-29 Thread David Strauss
t;> To: user@cassandra.apache.org >> Subject: Write times >> >> Are writes on OrderPreservingPartitioner always slower than >> RandomPartitioner? Is the replication factor a 'factor' in the write times? -- David Strauss | da...@fourkitchens.com | +1 512 577 5

Re: How reliable is cassandra?

2010-03-29 Thread David Strauss
are very nice LDAP systems available that support multi-master replication and are commonly used as key stores. Also, I imagine reads are more important than writes for you, and LDAP tools tend to be optimized for reads. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fo

Re: Digg's data model

2010-03-19 Thread David Strauss
r a site like Digg, immediate consistency just isn't that important. (2) Treat one direction as the canonical source. When reading from the other direction, verify that each item exists in the other direction. (3) Add locking above Cassandra, using a system like ZooKeeper. (4) Don't use C

Re: serialized vector clock as global counter?

2010-03-14 Thread David Strauss
g from Cassandra. You could even use memcached to maintain the counters. You'd have to be careful to avoid re-using values on memcached restart or clearing, but there are many ways to do that. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 5

Re: serialized vector clock as global counter?

2010-03-14 Thread David Strauss
e interesting responses from this list if you share the original problem you're trying to solve instead of asking us how to implement a component of your chosen solution. -- David Strauss | da...@fourkitchens.com Four Kitchens | http://fourkitchens.com | +1 512 454 6659 [offi