Re: Important news about Cassandra Summit

2023-02-03 Thread Wei Deng
Thank you for the update, Patrick. Appreciate all the work your team, the
community and the Linux Foundation Events team have done. Looking forward
to the virtual event and Cassandra Day series.

> If you have open positions, call them out in this email thread or
#cassandra in the ASF slack.

Speaking about open positions, our team at DataStax is looking for a senior
software engineer to work on the open source CQL drivers. The detailed job
description can be found here: https://smrtr.io/cFnJS

Folks can apply using the "I'm Interested" link inside the webpage
mentioned above. Feel free to connect with me on LinkedIn (
https://www.linkedin.com/in/weideng/) if this role looks interesting, and
you want to know more about it.

-Wei

On Fri, Feb 3, 2023 at 2:31 PM Patrick McFadin  wrote:

>
>
>
>
>
>
> *Hello Cassandra Community,We all see what’s happening in tech right now.
> Cuts are being made, and budgets are frozen. For Cassandra Summit, this has
> translated to low sponsorship and registrations. The program committee has
> been discussing options with the Linux Foundation events team, and the
> decision was made to move Cassandra Summit to December 12-13. You’ll see
> something official from the Linux Foundation soon. This isn’t what anyone
> wanted. It’s a challenging time for our community to gather, and that’s
> entirely the point of a Cassandra Summit. Hopefully, this provides enough
> space to have the Summit we want and need. Between now and December, the
> DataStax community team is ramping up a plan B to keep up the project
> momentum during this downturn and facilitate community information sharing.
> Cassandra 5.0 is coming, and it’s going to be game-changing. No way we are
> waiting until December to talk about it! The plan is to have a virtual
> event (online) on March 14 and a series of city-specific Cassandra Days in
> the coming months. It’s hard for our community to get out, so we’ll come to
> you. More information will follow in the next few days. I want to reassure
> you this isn’t specific to our community. I’ve been hearing from many that
> you were trying anything to get to San Jose in March, but budgets wouldn’t
> allow for any non-essential travel. When I started hearing the same thing
> from speakers, then sponsors, I knew this was a large-scale problem. We all
> know people impacted by layoffs, and I’m sure many are personally affected.
> Let’s come together as a community and help each other. If you have open
> positions, call them out in this email thread or #cassandra in the ASF
> slack.I want to thank the Linux Foundation Events team personally. They are
> exceptional professionals and worked quickly to get us back on track. There
> was a rush of events trying to postpone to later in the year, but they were
> able to get us a new date. They are as protective of conference uptime like
> you are about database uptime. More info to follow. ThanksPatrick*
>


Re: Welcome Alexandre Dutra, Andrew Tolbert, Bret McGuire, Olivier Michallat as Cassandra Committers

2024-04-17 Thread Wei Deng
Congrats folks! This is amazing news!

On Wed, Apr 17, 2024 at 2:00 PM Patrick McFadin  wrote:

> Congratulations, everyone. I am loving this new direction for the project!
>
> On Wed, Apr 17, 2024 at 11:16 AM Yifan Cai  wrote:
>
>> Congrats all
>> --
>> *From:* Josh McKenzie 
>> *Sent:* Wednesday, April 17, 2024 11:05:29 AM
>> *To:* dev 
>> *Subject:* Re: Welcome Alexandre Dutra, Andrew Tolbert, Bret McGuire,
>> Olivier Michallat as Cassandra Committers
>>
>> Congrats everyone and thanks for all the hard work to get things to this
>> point!
>>
>> On Wed, Apr 17, 2024, at 1:18 PM, Ekaterina Dimitrova wrote:
>>
>> Congrats and thank you for all your work on the drivers!
>>
>> On Wed, 17 Apr 2024 at 13:17, Francisco Guerrero 
>> wrote:
>>
>> Congratulations everyone!
>>
>> On 2024/04/17 17:14:34 Abe Ratnofsky wrote:
>> > Congrats everyone!
>> >
>> > > On Apr 17, 2024, at 1:10 PM, Benjamin Lerer 
>> wrote:
>> > >
>> > > The Apache Cassandra PMC is pleased to announce that Alexandre Dutra,
>> Andrew Tolbert, Bret McGuire and Olivier Michallat have accepted the
>> invitation to become committers on the java driver sub-project.
>> > >
>> > > Thanks for your contributions to the Java driver during all those
>> years!
>> > > Congratulations and welcome!
>> > >
>> > > The Apache Cassandra PMC members
>> >
>> >
>>
>>
>>


Re: Welcome Yifan Cai as Cassandra committer

2020-12-23 Thread Wei Deng
Congrats Yifan! So happy to see you taking a bigger role in the community!

-Wei

On Mon, Dec 21, 2020 at 10:10 AM Benjamin Lerer 
wrote:

>  The PMC's members are pleased to announce that Yifan Cai has accepted the
> invitation to become committer last Friday.
>
> Thanks a lot, Yifan,  for everything you have done!
>
> Congratulations and welcome
>
> The Apache Cassandra PMC members
>


MAX_COMPACTING_L0, is it still important to enforce?

2016-08-18 Thread Wei Deng
I was digging into LCS code lately, and found the following comments (note
the last paragraph "that would be ideal, but we can't"):

"// The problem is that L0 has a much higher score (almost 250)
than L1 (11), so what we'll
// do is compact a batch of MAX_COMPACTING_L0 sstables with all 117
L1 sstables, and put the
// result (say, 120 sstables) in L1. Then we'll compact the next
batch of MAX_COMPACTING_L0,
// and so forth.  So we spend most of our i/o rewriting the L1 data
with each batch.
//
// If we could just do *all* L0 a single time with L1, that would
be ideal.  But we can't
// -- see the javadoc for MAX_COMPACTING_L0."

And then when I read the MAX_COMPACTING_L0 javadoc referenced above:

"/**
 * limit the number of L0 sstables we do at once, because compaction
bloom filter creation
 * uses a pessimistic estimate of how many keys overlap (none), so we
risk wasting memory
 * or even OOMing when compacting highly overlapping sstables
 */"

I'm starting to wonder if this is still a concern post C* 2.1 given that
we've implemented CASSANDRA-5906. Here is an excerpt from Jonathan's blog
post (
http://www.datastax.com/dev/blog/improving-compaction-in-cassandra-with-cardinality-estimation)
on what motivated 5906 to be implemented:

"Because bloom filters are not re-sizeable, we need to pre-allocate them at
the start of the compaction, but at the start of the compaction, we don’t
know how much the sstables being compacted overlap. Since bloom filter
performance deteriorates dramatically when over-filled, we allocate our
bloom filters to be large enough even if the sstables do not overlap at
all. Which means that if they do overlap (which they should if compaction
is doing a good job picking candidates), then we waste space — up to 100%
per sstable compacted."

Since we have 5906 to address this very issue for a few years, does it make
sense now to revisit MAX_COMPACTING_L0 choice (hard coded to 32) since the
"bloom filter wasting memory" concern is no longer there? I would imagine
this could have the potential of improving backlogged LCS behavior when we
have thousands of L0 SSTables.

Thanks.

-Wei