Re: [DISCUSS] Adding support for BETWEEN operator

2024-05-14 Thread Benjamin Lerer
Range restrictions (>, >=, =<, < and BETWEEN) do not work on UPDATEs. They
do work on DELETE because under the hood C* they get translated into range
tombstones.

Le mar. 14 mai 2024 à 02:44, David Capwell  a écrit :

> I would also include in UPDATE… but yeah, <3 BETWEEN and welcome this work.
>
> On May 13, 2024, at 7:40 AM, Patrick McFadin  wrote:
>
> This is a great feature addition to CQL! I get asked about it from time to
> time but then people figure out a workaround. It will be great to just have
> it available.
>
> And right on Simon! I think the only project I had as a high school senior
> was figuring out how many parties I could go to and still maintain a
> passing grade. Thanks for your work here.
>
> Patrick
>
> On Mon, May 13, 2024 at 1:35 AM Benjamin Lerer  wrote:
>
>> Hi everybody,
>>
>> Just raising awareness that Simon is working on adding support for the
>> BETWEEN operator in WHERE clauses (SELECT and DELETE) in CASSANDRA-19604.
>> We plan to add support for it in conditions in a separate patch.
>>
>> The patch is available.
>>
>> As a side note, Simon chose to do his highschool senior project
>> contributing to Apache Cassandra. This patch is his first contribution for
>> his senior project (his second feature contribution to Apache Cassandra).
>>
>>
>>
>


Re: [RESULT] [VOTE] Release Apache Cassandra 3.0.30

2024-05-14 Thread Christofer Dutz
Hi all,

As I have mentioned on many other projects mailing lists, that also had the 
habit of counting "implicit votes" ... at the ASF there is no such thing as an 
"implicit vote" on a release.

Feel free to add an "this email also counts as my +1" to the initial VOTE email 
or even better, cast an explicit vote. 

I personally have voted on many of my own RCs with -1 as I found things when 
executing the release checks after staging the RC.

Chris

On 2024/05/09 05:36:26 Justin Mclean wrote:
> Hi,
> 
> Correct as it didn't contain a +1 in it.
> 
> Justin
> 


Re: [DISCUSS] Adding support for BETWEEN operator

2024-05-14 Thread Jon Haddad
Is there a technical limitation that would prevent a range write that
functions the same way as a range tombstone, other than probably needing a
version bump of the storage format?


On Tue, May 14, 2024 at 12:03 AM Benjamin Lerer  wrote:

> Range restrictions (>, >=, =<, < and BETWEEN) do not work on UPDATEs. They
> do work on DELETE because under the hood C* they get translated into range
> tombstones.
>
> Le mar. 14 mai 2024 à 02:44, David Capwell  a écrit :
>
>> I would also include in UPDATE… but yeah, <3 BETWEEN and welcome this
>> work.
>>
>> On May 13, 2024, at 7:40 AM, Patrick McFadin  wrote:
>>
>> This is a great feature addition to CQL! I get asked about it from time
>> to time but then people figure out a workaround. It will be great to just
>> have it available.
>>
>> And right on Simon! I think the only project I had as a high school
>> senior was figuring out how many parties I could go to and still maintain a
>> passing grade. Thanks for your work here.
>>
>> Patrick
>>
>> On Mon, May 13, 2024 at 1:35 AM Benjamin Lerer  wrote:
>>
>>> Hi everybody,
>>>
>>> Just raising awareness that Simon is working on adding support for the
>>> BETWEEN operator in WHERE clauses (SELECT and DELETE) in CASSANDRA-19604.
>>> We plan to add support for it in conditions in a separate patch.
>>>
>>> The patch is available.
>>>
>>> As a side note, Simon chose to do his highschool senior project
>>> contributing to Apache Cassandra. This patch is his first contribution for
>>> his senior project (his second feature contribution to Apache Cassandra).
>>>
>>>
>>>
>>


Community over Code EU 2024: The countdown has started!

2024-05-14 Thread Ryan Skraba
[Note: You're receiving this email because you are subscribed to one
or more project dev@ mailing lists at the Apache Software Foundation.]

We are very close to Community Over Code EU -- check out the amazing
program and the special discounts that we have for you.

Special discounts

You still have the opportunity to secure your ticket for Community
Over Code EU. Explore the various options available, including the
regular pass, the committer and groups pass, and now introducing the
one-day pass tailored for locals in Bratislava.

We also have a special discount for you to attend both Community Over
Code and Berlin Buzzwords from June 9th to 11th. Visit our website to
find out more about this opportunity and contact te...@sg.com.mx to
get the discount code.

Take advantage of the discounts and register now!
https://eu.communityovercode.org/tickets/

Check out the full program!

This year Community Over Code Europe will bring to you three days of
keynotes and sessions that cover topics of interest for ASF projects
and the greater open source ecosystem including data engineering,
performance engineering, search, Internet of Things (IoT) as well as
sessions with tips and lessons learned on building a healthy open
source community.

Check out the program: https://eu.communityovercode.org/program/

Keynote speaker highlights for Community Over Code Europe include:

* Dirk-Willem Van Gulik, VP of Public Policy at the Apache Software
Foundation, will discuss the Cyber Resiliency Act and its impact on
open source (All your code belongs to Policy Makers, Politicians, and
the Law).

* Dr. Sherae Daniel will share the results of her study on the impact
of self-promotion for open source software developers (To Toot or not
to Toot, that is the question).

* Asim Hussain, Executive Director of the Green Software Foundation
will present a framework they have developed for quantifying the
environmental impact of software (Doing for Sustainability what Open
Source did for Software).

* Ruth Ikegah will  discuss the growth of the open source movement in
Africa (From Local Roots to Global Impact: Building an Inclusive Open
Source Community in Africa)

* A discussion panel on EU policies and regulations affecting
specialists working in Open Source Program Offices

Additional activities

* Poster sessions: We invite you to stop by our poster area and see if
the ideas presented ignite a conversation within your team.

* BOF time: Don't miss the opportunity to discuss in person with your
open source colleagues on your shared interests.

* Participants reception: At the end of the first day, we will have a
reception at the event venue. All participants are welcome to attend!

* Spontaneous talks: There is a dedicated room and social space for
having spontaneous talks and sessions. Get ready to share with your
peers.

* Lighting talks: At the end of the event we will have the awaited
Lighting talks, where every participant is welcome to share and
enlighten us.

Please remember:  If you haven't applied for the visa, we will provide
the necessary letter for the process. In the unfortunate case of a
visa rejection, your ticket will be reimbursed.

See you in Bratislava,

Community Over Code EU Team


Re: [DISCUSS] Adding support for BETWEEN operator

2024-05-14 Thread Benjamin Lerer
It should be like range tombstones ... in much worse ;-). A tombstone is a
simple marker (deleted). An update can be far more complex.

Le mar. 14 mai 2024 à 15:52, Jon Haddad  a écrit :

> Is there a technical limitation that would prevent a range write that
> functions the same way as a range tombstone, other than probably needing a
> version bump of the storage format?
>
>
> On Tue, May 14, 2024 at 12:03 AM Benjamin Lerer  wrote:
>
>> Range restrictions (>, >=, =<, < and BETWEEN) do not work on UPDATEs.
>> They do work on DELETE because under the hood C* they get translated into
>> range tombstones.
>>
>> Le mar. 14 mai 2024 à 02:44, David Capwell  a écrit :
>>
>>> I would also include in UPDATE… but yeah, <3 BETWEEN and welcome this
>>> work.
>>>
>>> On May 13, 2024, at 7:40 AM, Patrick McFadin  wrote:
>>>
>>> This is a great feature addition to CQL! I get asked about it from time
>>> to time but then people figure out a workaround. It will be great to just
>>> have it available.
>>>
>>> And right on Simon! I think the only project I had as a high school
>>> senior was figuring out how many parties I could go to and still maintain a
>>> passing grade. Thanks for your work here.
>>>
>>> Patrick
>>>
>>> On Mon, May 13, 2024 at 1:35 AM Benjamin Lerer 
>>> wrote:
>>>
 Hi everybody,

 Just raising awareness that Simon is working on adding support for the
 BETWEEN operator in WHERE clauses (SELECT and DELETE) in CASSANDRA-19604.
 We plan to add support for it in conditions in a separate patch.

 The patch is available.

 As a side note, Simon chose to do his highschool senior project
 contributing to Apache Cassandra. This patch is his first contribution for
 his senior project (his second feature contribution to Apache Cassandra).



>>>


Re: [DISCUSS] Adding support for BETWEEN operator

2024-05-14 Thread Bowen Song via dev

Ranged update sounds like a disaster for compaction and read performance.

Imagine compacting or reading some SSTables in which a large number of 
overlapping but non-identical ranges were updated with different values. 
It gives me a headache by just thinking about it.


Ranged delete is much simpler, because the "value" is the same tombstone 
marker, and it also is guaranteed to expire and disappear eventually, so 
the performance impact of dealing with them at read and compaction time 
doesn't suffer in the long term.



On 14/05/2024 16:59, Benjamin Lerer wrote:
It should be like range tombstones ... in much worse ;-). A tombstone 
is a simple marker (deleted). An update can be far more complex.


Le mar. 14 mai 2024 à 15:52, Jon Haddad  a écrit :

Is there a technical limitation that would prevent a range write
that functions the same way as a range tombstone, other than
probably needing a version bump of the storage format?


On Tue, May 14, 2024 at 12:03 AM Benjamin Lerer
 wrote:

Range restrictions (>, >=, =<, < and BETWEEN) do not work on
UPDATEs. They do work on DELETE because under the hood C* they
get translated into range tombstones.

Le mar. 14 mai 2024 à 02:44, David Capwell
 a écrit :

I would also include in UPDATE… but yeah, <3 BETWEEN and
welcome this work.


On May 13, 2024, at 7:40 AM, Patrick McFadin
 wrote:

This is a great feature addition to CQL! I get
asked about it from time to time but then people figure
out a workaround. It will be great to just have it
available.

And right on Simon! I think the only project I had as a
high school senior was figuring out how many parties I
could go to and still maintain a passing grade. Thanks
for your work here.

Patrick

On Mon, May 13, 2024 at 1:35 AM Benjamin Lerer
 wrote:

Hi everybody,

Just raising awareness that Simon is working on
adding support for the BETWEEN operator in WHERE
clauses (SELECT and DELETE) in CASSANDRA-19604. We
plan to add support for it in conditions in a
separate patch.

The patch is available.

As a side note, Simon chose to do his highschool
senior project contributing to Apache Cassandra. This
patch is his first contribution for his senior
project (his second feature contribution to Apache
Cassandra).




Re: Is there appetite to maintain the gocql driver (in the drivers subproject) ?

2024-05-14 Thread Mick Semb Wever
Ok, so we're got confidence now on how to approach this, confirmation from
the project's maintainers supporting it, and interest from a handful of
people interested in maintaining and contributing to the project.

The proposed plan forward is…

We will go through a round of collecting CLAs along with agreements to
donate work to ASF from all gocql authors, over email and LinkedIn searches
and messages.  We will also open a github issue on the gocql project
describing the steps involved and mentioning all the authors.  A response
on the GH issue from everyone agreeing to the donation is the best single
place to collect the responses from everyone, but we'll accept and work
with whatever/however we get them.   These authors will also need to sign
this ICLA and submit it to the ASF.

After a four week grace period we will move ahead with the IP donation to
ASF, and make a list of all work (files) that we don't have CLAs for.  Such
work may remain with headers honouring their past MIT license and authors.
When the work is accepted and brought into the Cassandra Driver subproject
we will be looking to add committers to the subproject.  These may or may
not be people who have expressed interest in further contributing to the
codebase, but rather people we trust regardless when/if they come back to
contribute.

Does anyone call out the need for a new CEP for bringing the gocql into the
Driver's subproject ?
My suggestion is that this falls under CEP-8, even if it is not DataStax
donating this particular codebase, the process is largely the same and it
is the Drivers subproject receiving it.



On Mon, 15 Apr 2024 at 12:31, Mick Semb Wever  wrote:

>
>
>
> We can open an issue with LEGAL to see what they say at least?
>>>
>>
>>
>> I will raise a LEGAL ticket.
>>
>> My question here is whether we have to go through the process of
>> best-efforts to get approval to donate (transfer copyright), or whether we
>> can honour the copyright on the prior work and move forward ( by
>> referencing it in our NOTICE.txt, as per
>> https://infra.apache.org/licensing-howto.html )
>>
>
>
> https://issues.apache.org/jira/browse/LEGAL-674
>


Re: [DISCUSS] Adding support for BETWEEN operator

2024-05-14 Thread Jon Haddad
Personally, I don't think that something being scary at first glance is a
good reason not to explore an idea.  The scenario you've described here is
tricky but I'm not expecting it to be any worse than say, SAI, which (the
last I checked) has O(N) complexity on returning result sets with regard to
rows returned.  We've also merged in Vector search which has O(N) overhead
with the number of SSTables.  We're still fundamentally looking at, in most
cases, a limited number of SSTables and some merging of values.

Write updates are essentially a timestamped mask, potentially overlapping,
and I suspect potentially resolvable during compaction by propagating the
values.  They could be eliminated or narrowed based on how they've
propagated by using the timestamp metadata on the SSTable.

It would be a lot more constructive to apply our brains towards solving an
interesting problem than pointing out all its potential flaws based on gut
feelings.  We haven't even moved this past an idea.

I think it would solve a massive problem for a lot of people and is 100%
worth considering.  Thanks Patrick and David for raising this.

Jon



On Tue, May 14, 2024 at 9:48 AM Bowen Song via dev 
wrote:

> Ranged update sounds like a disaster for compaction and read performance.
>
> Imagine compacting or reading some SSTables in which a large number of
> overlapping but non-identical ranges were updated with different values. It
> gives me a headache by just thinking about it.
>
> Ranged delete is much simpler, because the "value" is the same tombstone
> marker, and it also is guaranteed to expire and disappear eventually, so
> the performance impact of dealing with them at read and compaction time
> doesn't suffer in the long term.
>
> On 14/05/2024 16:59, Benjamin Lerer wrote:
>
> It should be like range tombstones ... in much worse ;-). A tombstone is a
> simple marker (deleted). An update can be far more complex.
>
> Le mar. 14 mai 2024 à 15:52, Jon Haddad  a écrit :
>
>> Is there a technical limitation that would prevent a range write that
>> functions the same way as a range tombstone, other than probably needing a
>> version bump of the storage format?
>>
>>
>> On Tue, May 14, 2024 at 12:03 AM Benjamin Lerer 
>> wrote:
>>
>>> Range restrictions (>, >=, =<, < and BETWEEN) do not work on UPDATEs.
>>> They do work on DELETE because under the hood C* they get translated into
>>> range tombstones.
>>>
>>> Le mar. 14 mai 2024 à 02:44, David Capwell  a
>>> écrit :
>>>
 I would also include in UPDATE… but yeah, <3 BETWEEN and welcome this
 work.

 On May 13, 2024, at 7:40 AM, Patrick McFadin 
 wrote:

 This is a great feature addition to CQL! I get asked about it from time
 to time but then people figure out a workaround. It will be great to just
 have it available.

 And right on Simon! I think the only project I had as a high school
 senior was figuring out how many parties I could go to and still maintain a
 passing grade. Thanks for your work here.

 Patrick

 On Mon, May 13, 2024 at 1:35 AM Benjamin Lerer 
 wrote:

> Hi everybody,
>
> Just raising awareness that Simon is working on adding support for the
> BETWEEN operator in WHERE clauses (SELECT and DELETE) in CASSANDRA-19604.
> We plan to add support for it in conditions in a separate patch.
>
> The patch is available.
>
> As a side note, Simon chose to do his highschool senior project
> contributing to Apache Cassandra. This patch is his first contribution for
> his senior project (his second feature contribution to Apache Cassandra).
>
>
>



Re: [DISCUSS] Adding support for BETWEEN operator

2024-05-14 Thread Benjamin Lerer
>
> It would be a lot more constructive to apply our brains towards solving an
> interesting problem than pointing out all its potential flaws based on gut
> feelings.


It is not simply a gut feeling, Jon. This change impacts read, write,
indexing, storage, compaction, repair... The risk and cost associated with
it are pretty significant and I am not convinced at this point of its
benefit.

Le mar. 14 mai 2024 à 19:05, Jon Haddad  a écrit :

> Personally, I don't think that something being scary at first glance is a
> good reason not to explore an idea.  The scenario you've described here is
> tricky but I'm not expecting it to be any worse than say, SAI, which (the
> last I checked) has O(N) complexity on returning result sets with regard to
> rows returned.  We've also merged in Vector search which has O(N) overhead
> with the number of SSTables.  We're still fundamentally looking at, in most
> cases, a limited number of SSTables and some merging of values.
>
> Write updates are essentially a timestamped mask, potentially overlapping,
> and I suspect potentially resolvable during compaction by propagating the
> values.  They could be eliminated or narrowed based on how they've
> propagated by using the timestamp metadata on the SSTable.
>
> It would be a lot more constructive to apply our brains towards solving an
> interesting problem than pointing out all its potential flaws based on gut
> feelings.  We haven't even moved this past an idea.
>
> I think it would solve a massive problem for a lot of people and is 100%
> worth considering.  Thanks Patrick and David for raising this.
>
> Jon
>
>
>
> On Tue, May 14, 2024 at 9:48 AM Bowen Song via dev <
> dev@cassandra.apache.org> wrote:
>
>> Ranged update sounds like a disaster for compaction and read performance.
>>
>> Imagine compacting or reading some SSTables in which a large number of
>> overlapping but non-identical ranges were updated with different values. It
>> gives me a headache by just thinking about it.
>>
>> Ranged delete is much simpler, because the "value" is the same tombstone
>> marker, and it also is guaranteed to expire and disappear eventually, so
>> the performance impact of dealing with them at read and compaction time
>> doesn't suffer in the long term.
>>
>> On 14/05/2024 16:59, Benjamin Lerer wrote:
>>
>> It should be like range tombstones ... in much worse ;-). A tombstone is
>> a simple marker (deleted). An update can be far more complex.
>>
>> Le mar. 14 mai 2024 à 15:52, Jon Haddad  a écrit :
>>
>>> Is there a technical limitation that would prevent a range write that
>>> functions the same way as a range tombstone, other than probably needing a
>>> version bump of the storage format?
>>>
>>>
>>> On Tue, May 14, 2024 at 12:03 AM Benjamin Lerer 
>>> wrote:
>>>
 Range restrictions (>, >=, =<, < and BETWEEN) do not work on UPDATEs.
 They do work on DELETE because under the hood C* they get translated into
 range tombstones.

 Le mar. 14 mai 2024 à 02:44, David Capwell  a
 écrit :

> I would also include in UPDATE… but yeah, <3 BETWEEN and welcome this
> work.
>
> On May 13, 2024, at 7:40 AM, Patrick McFadin 
> wrote:
>
> This is a great feature addition to CQL! I get asked about it from
> time to time but then people figure out a workaround. It will be great to
> just have it available.
>
> And right on Simon! I think the only project I had as a high school
> senior was figuring out how many parties I could go to and still maintain 
> a
> passing grade. Thanks for your work here.
>
> Patrick
>
> On Mon, May 13, 2024 at 1:35 AM Benjamin Lerer 
> wrote:
>
>> Hi everybody,
>>
>> Just raising awareness that Simon is working on adding support for
>> the BETWEEN operator in WHERE clauses (SELECT and DELETE) in
>> CASSANDRA-19604. We plan to add support for it in conditions in a 
>> separate
>> patch.
>>
>> The patch is available.
>>
>> As a side note, Simon chose to do his highschool senior project
>> contributing to Apache Cassandra. This patch is his first contribution 
>> for
>> his senior project (his second feature contribution to Apache Cassandra).
>>
>>
>>
>


Re: Is there appetite to maintain the gocql driver (in the drivers subproject) ?

2024-05-14 Thread Josh McKenzie
Does anyone call out the need for a new CEP for bringing the gocql into the 
Driver's subproject ? 
> My suggestion is that this falls under CEP-8, even if it is not DataStax 
> donating this particular codebase, the process is largely the same and it is 
> the Drivers subproject receiving it.
+1 to future driver donations falling under CEP-8

On Tue, May 14, 2024, at 1:03 PM, Mick Semb Wever wrote:
> 
> Ok, so we're got confidence now on how to approach this, confirmation from 
> the project's maintainers supporting it, and interest from a handful of 
> people interested in maintaining and contributing to the project.
> 
> The proposed plan forward is…
> 
> We will go through a round of collecting CLAs along with agreements to donate 
> work to ASF from all gocql authors, over email and LinkedIn searches and 
> messages.  We will also open a github issue on the gocql project describing 
> the steps involved and mentioning all the authors.  A response on the GH 
> issue from everyone agreeing to the donation is the best single place to 
> collect the responses from everyone, but we'll accept and work with 
> whatever/however we get them.   These authors will also need to sign this 
> ICLA and submit it to the ASF.
> 
> After a four week grace period we will move ahead with the IP donation to 
> ASF, and make a list of all work (files) that we don't have CLAs for.  Such 
> work may remain with headers honouring their past MIT license and authors.  
> When the work is accepted and brought into the Cassandra Driver subproject we 
> will be looking to add committers to the subproject.  These may or may not be 
> people who have expressed interest in further contributing to the codebase, 
> but rather people we trust regardless when/if they come back to contribute.
> 
> Does anyone call out the need for a new CEP for bringing the gocql into the 
> Driver's subproject ? 
> My suggestion is that this falls under CEP-8, even if it is not DataStax 
> donating this particular codebase, the process is largely the same and it is 
> the Drivers subproject receiving it.
> 
>  
> 
> On Mon, 15 Apr 2024 at 12:31, Mick Semb Wever  wrote:
>> 
>>
>> 
 We can open an issue with LEGAL to see what they say at least?
>>> 
>>> 
>>> I will raise a LEGAL ticket.
>>> 
>>> My question here is whether we have to go through the process of 
>>> best-efforts to get approval to donate (transfer copyright), or whether we 
>>> can honour the copyright on the prior work and move forward ( by 
>>> referencing it in our NOTICE.txt, as per 
>>> https://infra.apache.org/licensing-howto.html )
>> 
>> 
>> https://issues.apache.org/jira/browse/LEGAL-674  


Re: Is there appetite to maintain the gocql driver (in the drivers subproject) ?

2024-05-14 Thread Dinesh Joshi
On Tue, May 14, 2024 at 10:05 AM Mick Semb Wever  wrote:

>
> Ok, so we're got confidence now on how to approach this, confirmation from
> the project's maintainers supporting it, and interest from a handful of
> people interested in maintaining and contributing to the project.
>

Did you talk to the current maintainers off list or did I miss some thread
where the maintainers indicated their support in maintaining this project?