Re: NoSE: Automated schema design for Cassandra

2017-05-11 Thread Michael Mior
nk it’s possible to generate test code to validate the workload? > > > > *--* > > *Jacques-Henri Berthemet* > > > > *From:* michael.m...@gmail.com [mailto:michael.m...@gmail.com] *On Behalf > Of *Michael Mior > *Sent:* mardi 9 mai 2017 17:30 > *To:* user > *Subject:* NoSE: Au

RE: NoSE: Automated schema design for Cassandra

2017-05-10 Thread Jacques-Henri Berthemet
generate test code to validate the workload? -- Jacques-Henri Berthemet From: michael.m...@gmail.com [mailto:michael.m...@gmail.com] On Behalf Of Michael Mior Sent: mardi 9 mai 2017 17:30 To: user Subject: NoSE: Automated schema design for Cassandra Hi all, I wanted to share a tool I've

NoSE: Automated schema design for Cassandra

2017-05-09 Thread Michael Mior
Hi all, I wanted to share a tool I've been working on that tries to help automate the schema design process for Cassandra. The short description is that you provide information on the kind of data you want to store and the queries and updates you want to issue, and NoSE will perform a cost-

Re: Read Performance / Schema Design

2011-10-26 Thread David Jeske
On Wed, Oct 26, 2011 at 7:35 PM, Ben Gambley wrote: > Our requirement is to store per user, many unique results (which is > basically an attempt at some questions ..) so I had thought of having the > userid as the row key and the result id as columns. > > The keys for the result ids are maintaine

Re: Read Performance / Schema Design

2011-10-26 Thread Tyler Hobbs
On Wed, Oct 26, 2011 at 9:35 PM, Ben Gambley wrote: > > Hi Everyone > > I have a question with regards read performance and schema design if > someone could help please. > > > Our requirement is to store per user, many unique results (which is > basically an attempt at

Read Performance / Schema Design

2011-10-26 Thread Ben Gambley
Hi Everyone I have a question with regards read performance and schema design if someone could help please. Our requirement is to store per user, many unique results (which is basically an attempt at some questions ..) so I had thought of having the userid as the row key and the result id

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-03 Thread Aditya Narayan
Thanks Tyler! On Thu, Feb 3, 2011 at 12:06 PM, Tyler Hobbs wrote: > On Wed, Feb 2, 2011 at 3:27 PM, Aditya Narayan wrote: >> >> Can I have some more feedback about my schema perhaps somewhat more >> criticisive/harsh ? > > It sounds reasonable to me. > > Since you're writing/reading all of the

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Tyler Hobbs
On Wed, Feb 2, 2011 at 3:27 PM, Aditya Narayan wrote: > Can I have some more feedback about my schema perhaps somewhat more > criticisive/harsh ? > It sounds reasonable to me. Since you're writing/reading all of the subcolumns at the same time, I would opt for a standard column with the tags se

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
for a user, for >>>>>> a >>>>>> day, that would only contain sub column names with the tags you're >>>>>> looking >>>>>> for? Then based upon the column names returned, you'd look-up the >>>>>> remi

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
t; That seems like a solid schema to me. >>>>> >>>>> Bill- >>>>> >>>>> On 02/02/2011 09:37 AM, Aditya Narayan wrote: >>>>>> >>>>>> Actually, I am trying to use Cassandra to display to users on my >>>

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread William R Speirs
as the supercolumn names and the subcolumns inside that supercolumns could contain the list of tags associated with particular reminder. All tags set at once during first write. The no of tags(subcolumns) will be around 8 maximum. Any comments, suggestions and feedback on the schema design are requested..

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
; presented to the user in a chronological order like a news feed. >>>> Each reminder has got certain tags associated with it(so that, at >>>> times, user may also choose to see the reminders filtered by tags in >>>> chronological order). >>>>

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread William R Speirs
n names and the subcolumns inside that supercolumns could contain the list of tags associated with particular reminder. All tags set at once during first write. The no of tags(subcolumns) will be around 8 maximum. Any comments, suggestions and feedback on the schema design are requested.. T

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
he supercolumn names and the subcolumns >> inside that supercolumns could contain the list of tags associated >> with particular reminder. All tags set at once during first write. The >> no of tags(subcolumns) will be around 8 maximum. >> >> Any comments, suggestions and

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread William R Speirs
ce during first write. The no of tags(subcolumns) will be around 8 maximum. Any comments, suggestions and feedback on the schema design are requested.. Thanks Aditya Narayan On Wed, Feb 2, 2011 at 7:49 PM, Aditya Narayan wrote: Hey all, I need to store supercolumns each with around 8 subcolumn

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
of tags associated with particular reminder. All tags set at once during first write. The no of tags(subcolumns) will be around 8 maximum. Any comments, suggestions and feedback on the schema design are requested.. Thanks Aditya Narayan On Wed, Feb 2, 2011 at 7:49 PM, Aditya Narayan wrote: >

Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
Hey all, I need to store supercolumns each with around 8 subcolumns; All the data for a supercolumn is written at once and all subcolumns need to be retrieved together. The data in each subcolumn is not big, it just contains keys to other rows. Would it be preferred to have a supercolumn family o

Re: Schema Design

2011-01-30 Thread Jonathan Ellis
r a host in a single row is not a good choice. 2 >> reason: >> 1, too few keys, so your data will not distributing well. >> 2, data under a key will always increase. So Cassandra have to do more >> SSTable compaction. >> >> -邮件原件- >> 发件人: Wil

Re: Schema Design

2011-01-29 Thread aaron morton
t; > -邮件原件- > 发件人: William R Speirs [mailto:bill.spe...@gmail.com] > 发送时间: 2011年1月27日 9:15 > 收件人: user@cassandra.apache.org > 主题: Re: Schema Design > > It makes sense that the single row for a system (with a growing number of > columns) will reside on a single mac

Re: Schema Design

2011-01-26 Thread Wangpei (Peter)
.@gmail.com] 发送时间: 2011年1月27日 9:15 收件人: user@cassandra.apache.org 主题: Re: Schema Design It makes sense that the single row for a system (with a growing number of columns) will reside on a single machine. With that in mind, here is my updated schema: - A single column family for all the mess

Re: Schema Design

2011-01-26 Thread William R Speirs
Ah, sweet... thanks for the link! Bill- On 01/26/2011 08:20 PM, buddhasystem wrote: Bill, it's all explained here: http://wiki.apache.org/cassandra/MemtableThresholds#JVM_Heap_Size,the Watch the number of CFs and the memtable sizes. In my experience, this all matters.

Re: Schema Design

2011-01-26 Thread buddhasystem
Bill, it's all explained here: http://wiki.apache.org/cassandra/MemtableThresholds#JVM_Heap_Size,the Watch the number of CFs and the memtable sizes. In my experience, this all matters. -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/S

Re: Schema Design

2011-01-26 Thread William R Speirs
ich you means you'll definitely not be able to distribute your load very well. From: Bill Speirs [bill.spe...@gmail.com] Sent: Wednesday, January 26, 2011 1:23 PM To: user@cassandra.apache.org Subject: Re: Schema Design I like this approach, but I have 2 q

RE: Schema Design

2011-01-26 Thread Shu Zhang
very well. From: Bill Speirs [bill.spe...@gmail.com] Sent: Wednesday, January 26, 2011 1:23 PM To: user@cassandra.apache.org Subject: Re: Schema Design I like this approach, but I have 2 questions: 1) what is the implications of continually adding columns t

Re: Schema Design

2011-01-26 Thread buddhasystem
065146.n2.nabble.com/Schema-Design-tp5964167p5964326.html Sent from the cassandra-u...@incubator.apache.org mailing list archive at Nabble.com.

Re: Schema Design

2011-01-26 Thread Nick Santini
here are memory considerations. > > > > Cheers > > > > Maxim > > > > -- > > View this message in context: > http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Schema-Design-tp5964167p5964227.html > > Sent from the cassandra-u...@incubator.apache.org mailing list archive > at Nabble.com. > > >

Re: Schema Design

2011-01-26 Thread David McNelis
My cli knowledge sucks so far, so I'll leave that to othersI'm doing most of my reading/writing through a thrift client (hector/java based) As for the implications, as of the latest version of Cassandra there is not theoretical limit to the number of columns that a particular row can hold. O

Re: Schema Design

2011-01-26 Thread Bill Speirs
s of using OPP. > > Whether to shard data as per system depends on how many you have. If more > than a few, don't do that, there are memory considerations. > > Cheers > > Maxim > > -- > View this message in context: > http://cassandra-user-incubator-apa

Re: Schema Design

2011-01-26 Thread Bill Speirs
I like this approach, but I have 2 questions: 1) what is the implications of continually adding columns to a single row? I'm unsure how Cassandra is able to grow. I realize you can have a virtually infinite number of columns, but what are the implications of growing the number of columns over time

Re: Schema Design

2011-01-26 Thread buddhasystem
ator-apache-org.3065146.n2.nabble.com/Schema-Design-tp5964167p5964227.html Sent from the cassandra-u...@incubator.apache.org mailing list archive at Nabble.com.

Re: Schema Design

2011-01-26 Thread David McNelis
I would say in that case you might want to try a single column family where the key to the column is the system name. Then, you could name your columns as the timestamp. Then when retrieving information from the data store you can can, in your slice request, specify your start column as X and

Schema Design

2011-01-26 Thread Bill Speirs
I'm looking to use Cassandra to store log messages from various systems. A log message only has a message (UTF8Type) and a data/time. My thought is to create a column family for each system. The row key will be a TimeUUIDType. Each row will have 7 columns: year, month, day, hour, minute, second, an

Re: Is this a good schema design to implement a social application..

2011-01-08 Thread Edward Capriolo
indexed >> currently may suck here, whenever a small no (10-20 ) of subcolumns >> need to be retreived from a large list of subcolumns of a supercolumn >> like MyPostsIdKeysList. >> >> On Fri, Jan 7, 2011 at 9:58 PM, Raj wrote: >>> My question is in context of a

Re: Is this a good schema design to implement a social application..

2011-01-07 Thread Rajkumar Gupta
like MyPostsIdKeysList. > > On Fri, Jan 7, 2011 at 9:58 PM, Raj wrote: >> My question is in context of a social network schema design >> >> I am thinking of following schema for storing a user's data that is >> required as he logs in & is led to his homepage:-

Re: Is this a good schema design to implement a social application..

2011-01-07 Thread Rajkumar Gupta
context of a social network schema design > > I am thinking of following schema for storing a user's data that is > required as he logs in & is led to his homepage:- > (I aimed at a schema design such that through a single row read query > all the data that would be require

Is this a good schema design to implement a social application..

2011-01-07 Thread Raj
My question is in context of a social network schema design I am thinking of following schema for storing a user's data that is required as he logs in & is led to his homepage:- (I aimed at a schema design such that through a single row read query all the data that would be required to p

Re: SQL Server to Cassandra Schema Design - Ideas Anyone?

2010-06-24 Thread Jonathan Ellis
the main idea is "denormalize your data into multiple CFs at write time so that each CF lets you answer a query from a single row." http://www.rackspacecloud.com/blog/2010/05/12/cassandra-by-example/ is a good place to start. On Tue, Jun 22, 2010 at 3:43 PM, Craig Faulkner wrote: > I'm having a

SQL Server to Cassandra Schema Design - Ideas Anyone?

2010-06-22 Thread Craig Faulkner
I'm having a little block in converting an existing SQL Server schema that we have into Cassandra Keyspace(s). The whole key-value thing has just not clicked yet. Do any of you know of any good examples that are more complex than the example in the readme file? We are looking to report on web