Re: #3665 / new test target

2012-01-30 Thread Sylvain Lebresne
On Fri, Jan 27, 2012 at 9:08 PM, Eric Evans  wrote:
> Hey all,
>
> In #3665 (https://issues.apache.org/jira/browse/CASSANDRA-3665) I
> added a new test target to ant called test-clientutil-jar.  The
> purpose of this test is to exercise apache-cassandra-clientutil.jar
> using only the dependencies that we expect (commons-lang and guava);
> If a new dependency sneaks its way into that code, then the test
> should produce an error.
>
> I didn't include this in the main test target (actually, the test
> itself is probably included by virtue of where it is in-tree), because
> it is kind of a special-case target (a failure here might mean
> something different), and because it invokes the jar target; We can,
> but I wanted to see what others thought first (those targets are
> already a little unwieldy).
>
> Should we add this to run as part of "test", or leave it to be
> configured separately by CI systems?

I wouldn't mind keeping it out of 'ant test' but adding a new 'ant
test-all' target that would run test, test-compression, long-test and
this could be handy.

--
Sylvain


Re: #3665 / new test target

2012-01-30 Thread Jonathan Ellis
+1

On Mon, Jan 30, 2012 at 8:54 AM, Sylvain Lebresne  wrote:
> On Fri, Jan 27, 2012 at 9:08 PM, Eric Evans  wrote:
>> Hey all,
>>
>> In #3665 (https://issues.apache.org/jira/browse/CASSANDRA-3665) I
>> added a new test target to ant called test-clientutil-jar.  The
>> purpose of this test is to exercise apache-cassandra-clientutil.jar
>> using only the dependencies that we expect (commons-lang and guava);
>> If a new dependency sneaks its way into that code, then the test
>> should produce an error.
>>
>> I didn't include this in the main test target (actually, the test
>> itself is probably included by virtue of where it is in-tree), because
>> it is kind of a special-case target (a failure here might mean
>> something different), and because it invokes the jar target; We can,
>> but I wanted to see what others thought first (those targets are
>> already a little unwieldy).
>>
>> Should we add this to run as part of "test", or leave it to be
>> configured separately by CI systems?
>
> I wouldn't mind keeping it out of 'ant test' but adding a new 'ant
> test-all' target that would run test, test-compression, long-test and
> this could be handy.
>
> --
> Sylvain



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com


extra diffs showing up in update column family

2012-01-30 Thread Dave Brosius
If a user specifies a Comparator in an update column family (as was from 
a irc user), as


update column family report_by_account_content with comparator=UTF8Type 
and column_metadata = [{ column_name:'meta:account-id', 
validation_class:UTF8Type,index_type:KEYS},{ 
column_name:'meta:filter-hash', validation_class:UTF8Type,index_type:KEYS}];



The comparator value is seen as different because the original 
comparator was the fully qualified name of the 
org.apache.cassandra.db.marshal.UTF8Type, and new one is what is passed 
in UTF8Type. So CFMetaData.diff sees this as a change and does extra 
work because of it.


I'm guessing there are other class name values where this holds true as 
well.


Is this a big enough concern to address?

thanks
dave




understanding cassandra internal

2012-01-30 Thread Thanh Do
hi all,

I would like to study the internal code
of cassandra. The website (wiki)
provides limited documentation.

Is there any way (documents, blogs)
that mention in details about how
cassandra internally works?
Is there a fast way beside
walking through the code
and reason about how it works.

many thanks,
Thanh


Re: extra diffs showing up in update column family

2012-01-30 Thread aaron morton
Can you raise a ticket at https://issues.apache.org/jira/browse/CASSANDRA with 
steps to reproduce. 

Thanks
p.s. the user list is the appropriate list for emails like this. 


-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 31/01/2012, at 9:31 AM, Dave Brosius wrote:

> If a user specifies a Comparator in an update column family (as was from a 
> irc user), as
> 
> update column family report_by_account_content with comparator=UTF8Type and 
> column_metadata = [{ column_name:'meta:account-id', 
> validation_class:UTF8Type,index_type:KEYS},{ column_name:'meta:filter-hash', 
> validation_class:UTF8Type,index_type:KEYS}];
> 
> 
> The comparator value is seen as different because the original comparator was 
> the fully qualified name of the org.apache.cassandra.db.marshal.UTF8Type, and 
> new one is what is passed in UTF8Type. So CFMetaData.diff sees this as a 
> change and does extra work because of it.
> 
> I'm guessing there are other class name values where this holds true as well.
> 
> Is this a big enough concern to address?
> 
> thanks
> dave
> 
> 



Re: understanding cassandra internal

2012-01-30 Thread aaron morton
The code is where it's at, and...

http://www.datastax.com/2011/08/video-cassandra-internals-presentation-from-cassandra-sf-2011

http://wiki.apache.org/cassandra
http://planetcassandra.org/
http://www.datastax.com/docs/1.0/index

Cheers

-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 31/01/2012, at 12:07 PM, Thanh Do wrote:

> hi all,
> 
> I would like to study the internal code
> of cassandra. The website (wiki)
> provides limited documentation.
> 
> Is there any way (documents, blogs)
> that mention in details about how
> cassandra internally works?
> Is there a fast way beside
> walking through the code
> and reason about how it works.
> 
> many thanks,
> Thanh



Re: extra diffs showing up in update column family

2012-01-30 Thread Brandon Williams
On Mon, Jan 30, 2012 at 6:36 PM, aaron morton  wrote:
> p.s. the user list is the appropriate list for emails like this.

I disagree, this was on-topic for dev@ imho.

-Brandon


Re: extra diffs showing up in update column family

2012-01-30 Thread aaron morton
Sorry, i thought bug reports went to the user list. 

A

-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com

On 31/01/2012, at 1:39 PM, Brandon Williams wrote:

> On Mon, Jan 30, 2012 at 6:36 PM, aaron morton  wrote:
>> p.s. the user list is the appropriate list for emails like this.
> 
> I disagree, this was on-topic for dev@ imho.
> 
> -Brandon



Re: extra diffs showing up in update column family

2012-01-30 Thread Brandon Williams
On Mon, Jan 30, 2012 at 6:42 PM, aaron morton  wrote:
> Sorry, i thought bug reports went to the user list.

Well, if it was just "I have this exception" then sure, but Dave has
done quite a few patches and dug into the problem pretty specifically
and was asking if it's a big enough concern to bother with, so I don't
see a problem with dev@

-Brandon


Cassandra and SQL DB Integration?

2012-01-30 Thread Krassimir Kostov

Hello All,
 
I posted these questions to the Users Mailing List, but perhaps they may be 
more suitable for the Developers Mailing List.
 
Background: I am working on a project, for which I have to evaluate and 
recommend the implementation of a new storage DBMS, which will interact heavily 
with the following 3 SQL DB based environments:
  
(1) A data mining application (IBM SPSS Modeler) that imports/exports data 
from/to an SQL DB
(2) A partner platform, based on an Oracle DB (CSV data import/export) 
(3) Various client SQL DBs, whose data elements will be replicated and uploaded 
in the recommended DBMS
 
I am planning to recommend the implementation of Cassandra, hosted on Amazon 
Elastic Compute Cloud (Amazon EC2), primarily due to its scalability and the 
"DB as a Service" provided by the host.  Probably, the biggest challenge in 
terms of development and data model architecture will come from 3 above, since 
we need to custom-build and replicate each client's SQL data schema before 
implementing it in Cassandra, thus changing the data model from SQL to NoSQL.
 
With the above 3 requirements in mind, from your experience: 
 
(1) Is it a good idea to use Cassandra as a storage solution for SQL data, 
converted to the NoSQL data model just to be stored on Cassandra?
(2) Do you know of any similar cases of using Cassandra as a storage, which 
supports SQL data applications, or perhaps data model architecture differences 
and high development costs make no sense for this?
(3) If using Cassandra as a storage, which supports SQL data applications, is 
not a good idea, could you recommend an alternative SQL cloud DB solution that 
has good scalability?
  
Thanks and regards, 
 
Krassimir Kostov  


Re: 1.1 freeze approaching

2012-01-30 Thread Jonathan Ellis
I've created a 1.1 branch in git.  The freeze is on!

On Tue, Jan 10, 2012 at 2:59 PM, Jonathan Ellis  wrote:
> I've tagged 7 tickets as "critical" [1] for 1.1.  All of them deal
> with CQL; I strongly believe that 1.1 needs to be where CQL goes from
> being "the future" to being "the present."  We've been promising this
> for almost a year now and it's time to deliver.
>
> All of these (with the exception of 3707, which is relatively quick)
> are in progress, and some (2475, 1391) are almost complete, but I
> don't think we're going to get all of them done by the 18th.
>
> Thus, I'd like to propopose postponing the freeze one more week, until the 
> 25th.
>
> [1] 
> https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&jqlQuery=project+%3D+CASSANDRA+AND+fixVersion+%3D+%221.1%22+AND+resolution+%3D+Unresolved+ORDER+BY+due+ASC%2C+priority+DESC%2C+created+ASC
>
> On Mon, Dec 19, 2011 at 11:56 AM, Jonathan Ellis  wrote:
>> Just a reminder that for us to meet our four-month major release
>> schedule (i.e., 1.1 = Feb 18), we need to code freeze on Jan 18, which
>> is just about a month away on the calendar but significantly closer in
>> terms of man-hours as people take holiday vacations.
>>
>> I've taken a first stab at moving issues to 1.2 that were tagged 1.1
>> but I'm pretty sure won't be done in time, with 34 issues remaining in
>> 1.1, which is fairly optimistic [1].  Please be aware of the timeline
>> if you are working on or reviewing one of these tickets, or if you are
>> planning to volunteer for one before the freeze.
>>
>> [1] 
>> https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&jqlQuery=project+%3D+CASSANDRA+AND+fixVersion+%3D+%221.1%22+AND+resolution+%3D+Unresolved+ORDER+BY+due+ASC%2C+priority+DESC%2C+created+ASC&mode=hide
>>
>> --
>> Jonathan Ellis
>> Project Chair, Apache Cassandra
>> co-founder of DataStax, the source for professional Cassandra support
>> http://www.datastax.com
>
>
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of DataStax, the source for professional Cassandra support
> http://www.datastax.com



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com


Re: Cassandra and SQL DB Integration?

2012-01-30 Thread Himanshu Kumar Das
Hey Even I am also have same question can anybody answer to Krassimir Kostov

thnks

regards
Himanshu K Das

2012/1/31 Krassimir Kostov 

>
> Hello All,
>
> I posted these questions to the Users Mailing List, but perhaps they may
> be more suitable for the Developers Mailing List.
>
> Background: I am working on a project, for which I have to evaluate and
> recommend the implementation of a new storage DBMS, which will interact
> heavily with the following 3 SQL DB based environments:
>
> (1) A data mining application (IBM SPSS Modeler) that imports/exports data
> from/to an SQL DB
> (2) A partner platform, based on an Oracle DB (CSV data import/export)
> (3) Various client SQL DBs, whose data elements will be replicated and
> uploaded in the recommended DBMS
>
> I am planning to recommend the implementation of Cassandra, hosted on
> Amazon Elastic Compute Cloud (Amazon EC2), primarily due to its scalability
> and the "DB as a Service" provided by the host.  Probably, the biggest
> challenge in terms of development and data model architecture will come
> from 3 above, since we need to custom-build and replicate each client's SQL
> data schema before implementing it in Cassandra, thus changing the data
> model from SQL to NoSQL.
>
> With the above 3 requirements in mind, from your experience:
>
> (1) Is it a good idea to use Cassandra as a storage solution for SQL data,
> converted to the NoSQL data model just to be stored on Cassandra?
> (2) Do you know of any similar cases of using Cassandra as a storage,
> which supports SQL data applications, or perhaps data model architecture
> differences and high development costs make no sense for this?
> (3) If using Cassandra as a storage, which supports SQL data applications,
> is not a good idea, could you recommend an alternative SQL cloud DB
> solution that has good scalability?
>
> Thanks and regards,
>
> Krassimir Kostov
>