Materialized Views during node replace

2021-06-04 Thread Shaurya Gupta
Hi

We recently replaced a down node containing 90 GB of data. The streaming
process completed within 1 hour as we could see a sufficient amount of data
being streamed on the data disk but the node remained in UJ state and it
took 7 more hours to move to UN state.

We could not figure out anything from system logs. Our DB schema contains 3
materialized views over the main table, could that be the reason for this?
If yes how can we be sure of it and if not then how do we figure out what
went wrong ?

Thanks!
-- 
Shaurya Gupta


[DISCUSSION] Should we mark DROP COMPACT STORAGE as experimental

2021-06-04 Thread Benjamin Lerer
Hi everybody,

There are a significant amount of issues with DROP COMPACT STORAGE that can
be pretty surprising for users.
To name a few:
* Some hidden columns will show up changing the resultset returned for
wildcard queries
* As COMPACT tables did not have primary key liveness there empty rows
inserted AFTER the ALTER will be returned whereas the one inserted before
the ALTER will not.
* Also due to the lack of primary key liveness the amount of SSTables being
read will increase resulting in slower queries
* After DROP COMPACT it becomes possible to ALTER the table in a way that
makes all the row disappears
* There is a loss of functionality around null clustering when dropping
compact storage (CASSANDRA-16069)

In my opinion DROP COMPACT STORAGE is not ready for production use unless
users fully understand what they are doing.
By consequence, I am wondering if we should not mark it as experimental as
we did for the Materialized Views (CASSANDRA-13959).

What is your opinion?


Re: Materialized Views during node replace

2021-06-04 Thread Erick Ramirez
I can't recall if view builds affect the bootstrap but index builds
definitely do. It's a bit late now but you could've run nodetool
compactionstats to see what tasks were running at the time and those need
to complete before the bootstrap is considered complete for the node to go
into UP/NORMAL status. Cheers!


Re: Obfuscation of passwords in audit loging, in or not in 4.0?

2021-06-04 Thread Sam Tunnicliffe


> On 4 Jun 2021, at 03:44, Jonathan Koppenhofer  wrote:
> 
> +1 to this being a serious bug. As a large user, if we used internal
> passwords, this would completely prevent me from using Cassandra native
> audit log capabilities. Disabling DCL is not a great option, as DCL is
> probably the most needed auditable event.
> 
> If this is on by default (not sure of default settings) I also assume this
> would be classified as an immediate CVE... Right?

+1, I would think so too. 

Shipping a brand new, non-experimental feature with a security hole like this 
feels
counter to our goal of releases being prod ready in .0, so I'm +1 on including 
it in 
an rc/ga


> I don't directly
> contribute, so I can't talk too much, but I can't see how 4.0.0 could go GA
> with this.
> 
> 
> On Thu, Jun 3, 2021, 7:24 PM Sumanth Pasupuleti <
> sumanth.pasupuleti...@gmail.com> wrote:
> 
>>> I am on the side of "this sounds like a really bad bug" for the audit
>> pieces, maybe less so than FQL. Anyone using audit for real probably has
>> meaningful audit requirements, which means they're in an industry where
>> they get audited for security, which means logging passwords is a big deal.
>> 
>> +1. Given we are shipping audit logging feature for the first time with
>> 4.0, it would be great if this rather low complex patch can be included in
>> the 4.0 RC and thereby ship a "complete feature".
>> 
>> On Thu, Jun 3, 2021 at 4:04 PM Vinay Chella 
>> wrote:
>> 
 I think it can be argued that this is a pretty serious bug for a newly
>>> introduced feature, and qualifies for inclusion in an RC, but I don’t
>>> personally have a strong opinion on if this should happen.
>>>   +1
>>> 
 One more point - if we keep the workaround, that should be documented
>>> with
 big red letters for the users.
>>> 
>>> Yes, heavy +1, if we are not merging it. Another idea, if we are not
>>> merging this in, is to put DCL(CREATE ROLE/USER, ALTER ROLE/USER etc.,)
>>> queries in the default configuration (cassandra.yaml) exclude list to
>> avoid
>>> oops for operators, since that is the only query type that log passwords
>> in
>>> plain text and all other places they are not.
>>> 
>>> 
>>> Thanks,
>>> Vinay
>>> 
>>> 
>>> On Thu, Jun 3, 2021 at 3:58 PM bened...@apache.org 
>>> wrote:
>>> 
 I think it can be argued that this is a pretty serious bug for a newly
 introduced feature, and qualifies for inclusion in an RC, but I don’t
 personally have a strong opinion on if this should happen.
 
 I can’t imagine how this would be an _exception_ for inclusion in 4.0.1
 though.
 
 From: Mick Semb Wever 
 Date: Thursday, 3 June 2021 at 22:45
 To: dev@cassandra.apache.org 
 Subject: Re: Obfuscation of passwords in audit loging, in or not in
>> 4.0?
 Thanks for raising this Stefan.
 
 
 
> While I humbly think this is 4.0-worthy, the process we have, as far
> as I know, is that there should be only critical fixes in 4.0 so I
> guess this will go to 4.0.1, right? Or does this qualify to go to 4.0
> still?
> 
 
 
 I believe the question here is whether this patch is worthy of an
>>> exception
 to go to 4.0.x. (i.e. 4.0.1)
 At this point in time all improvements would be by default slated for
>> 4.x
 (i.e. 4.1)
 
 It does not qualify as a RC critical bug for 4.0.0.
 
 Looking at the patch it is simple, and one could almost consider it a
 security fix on a new 4.0 feature, so I'd say it's a valid exception
>> for
 4.0.x.
 Keen to hear what others think. And how we should go about requesting
>>> such
 exceptions for non-bugs during each annual release cycle.
 
>>> 
>> 


-
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org



Re: [DISCUSSION] Should we mark DROP COMPACT STORAGE as experimental

2021-06-04 Thread bened...@apache.org
This seems reasonable to me, but it raises a question of roadmap. My 
understanding is that we are deprecating compact storage, and will remove it in 
a future release (or have already partially removed it? I forget). Do these 
issues then constitute a blocking issue for GA, or do we modify our roadmap, or 
do we stipulate that users must upgrade to a future patch version of 4.0 before 
going to 4.next/5.0?


From: Benjamin Lerer 
Date: Friday, 4 June 2021 at 09:53
To: dev@cassandra.apache.org 
Subject: [DISCUSSION] Should we mark DROP COMPACT STORAGE as experimental
Hi everybody,

There are a significant amount of issues with DROP COMPACT STORAGE that can
be pretty surprising for users.
To name a few:
* Some hidden columns will show up changing the resultset returned for
wildcard queries
* As COMPACT tables did not have primary key liveness there empty rows
inserted AFTER the ALTER will be returned whereas the one inserted before
the ALTER will not.
* Also due to the lack of primary key liveness the amount of SSTables being
read will increase resulting in slower queries
* After DROP COMPACT it becomes possible to ALTER the table in a way that
makes all the row disappears
* There is a loss of functionality around null clustering when dropping
compact storage (CASSANDRA-16069)

In my opinion DROP COMPACT STORAGE is not ready for production use unless
users fully understand what they are doing.
By consequence, I am wondering if we should not mark it as experimental as
we did for the Materialized Views (CASSANDRA-13959).

What is your opinion?


Re: Materialized Views during node replace

2021-06-04 Thread Shaurya Gupta
Thanks!

Why are MVs being treated differently here ?
While node replace was ongoing sstables for MV also got streamed. Is
rebuilding the MV required to ensure consistency between main table and MVs
on the replaced node ?

On Fri, Jun 4, 2021 at 2:25 PM Erick Ramirez 
wrote:

> I can't recall if view builds affect the bootstrap but index builds
> definitely do. It's a bit late now but you could've run nodetool
> compactionstats to see what tasks were running at the time and those need
> to complete before the bootstrap is considered complete for the node to go
> into UP/NORMAL status. Cheers!
>


-- 
Shaurya Gupta


Re: Obfuscation of passwords in audit loging, in or not in 4.0?

2021-06-04 Thread Brandon Williams
On Fri, Jun 4, 2021 at 4:32 AM Sam Tunnicliffe  wrote:
> Shipping a brand new, non-experimental feature with a security hole like this 
> feels
> counter to our goal of releases being prod ready in .0, so I'm +1 on 
> including it in
> an rc/ga

I think I have to agree here.  We can ship a complete feature, we can
remove it and not ship it, but what is not acceptable is shipping it
in a broken and potentially dangerous state.

-
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org



Re: Obfuscation of passwords in audit loging, in or not in 4.0?

2021-06-04 Thread Ekaterina Dimitrova
+1, please, reclassify it as a bug.
Thank you Stefan

On Fri, 4 Jun 2021 at 9:13, Brandon Williams  wrote:

> On Fri, Jun 4, 2021 at 4:32 AM Sam Tunnicliffe  wrote:
> > Shipping a brand new, non-experimental feature with a security hole like
> this feels
> > counter to our goal of releases being prod ready in .0, so I'm +1 on
> including it in
> > an rc/ga
>
> I think I have to agree here.  We can ship a complete feature, we can
> remove it and not ship it, but what is not acceptable is shipping it
> in a broken and potentially dangerous state.
>
> -
> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> For additional commands, e-mail: dev-h...@cassandra.apache.org
>
>


Re: [DISCUSSION] Should we mark DROP COMPACT STORAGE as experimental

2021-06-04 Thread Ekaterina Dimitrova
+1, thank you Benjamin! There is already a warning in the docs but marking
it as experimental sounds more reasonable at this point. Thank you for all
your investigations and work done


On Fri, 4 Jun 2021 at 5:43, bened...@apache.org  wrote:

> This seems reasonable to me, but it raises a question of roadmap. My
> understanding is that we are deprecating compact storage, and will remove
> it in a future release (or have already partially removed it? I forget). Do
> these issues then constitute a blocking issue for GA, or do we modify our
> roadmap, or do we stipulate that users must upgrade to a future patch
> version of 4.0 before going to 4.next/5.0?
>
>
> From: Benjamin Lerer 
> Date: Friday, 4 June 2021 at 09:53
> To: dev@cassandra.apache.org 
> Subject: [DISCUSSION] Should we mark DROP COMPACT STORAGE as experimental
> Hi everybody,
>
> There are a significant amount of issues with DROP COMPACT STORAGE that can
> be pretty surprising for users.
> To name a few:
> * Some hidden columns will show up changing the resultset returned for
> wildcard queries
> * As COMPACT tables did not have primary key liveness there empty rows
> inserted AFTER the ALTER will be returned whereas the one inserted before
> the ALTER will not.
> * Also due to the lack of primary key liveness the amount of SSTables being
> read will increase resulting in slower queries
> * After DROP COMPACT it becomes possible to ALTER the table in a way that
> makes all the row disappears
> * There is a loss of functionality around null clustering when dropping
> compact storage (CASSANDRA-16069)
>
> In my opinion DROP COMPACT STORAGE is not ready for production use unless
> users fully understand what they are doing.
> By consequence, I am wondering if we should not mark it as experimental as
> we did for the Materialized Views (CASSANDRA-13959).
>
> What is your opinion?
>


Re: [DISCUSSION] Should we mark DROP COMPACT STORAGE as experimental

2021-06-04 Thread Brandon Williams
+1

On Fri, Jun 4, 2021, 3:53 AM Benjamin Lerer  wrote:

> Hi everybody,
>
> There are a significant amount of issues with DROP COMPACT STORAGE that can
> be pretty surprising for users.
> To name a few:
> * Some hidden columns will show up changing the resultset returned for
> wildcard queries
> * As COMPACT tables did not have primary key liveness there empty rows
> inserted AFTER the ALTER will be returned whereas the one inserted before
> the ALTER will not.
> * Also due to the lack of primary key liveness the amount of SSTables being
> read will increase resulting in slower queries
> * After DROP COMPACT it becomes possible to ALTER the table in a way that
> makes all the row disappears
> * There is a loss of functionality around null clustering when dropping
> compact storage (CASSANDRA-16069)
>
> In my opinion DROP COMPACT STORAGE is not ready for production use unless
> users fully understand what they are doing.
> By consequence, I am wondering if we should not mark it as experimental as
> we did for the Materialized Views (CASSANDRA-13959).
>
> What is your opinion?
>


Re: [DISCUSSION] Should we mark DROP COMPACT STORAGE as experimental

2021-06-04 Thread Joshua McKenzie
>
> not ready for production use unless users fully understand what they are
> doing.

This statement stood out to me - in my opinion we should think carefully
about the surface area of the user interfaces on new features before we add
more cognitive burden to our users. We already have plenty of "foot-guns"
in the project and should only add more if absolutely necessary.

Further, marking this as experimental would be another feature we've
released and then retroactively marked as experimental; that's a habit we
should not get into.

On balance, my .02 is the benefits to our end users and operators of
getting 4.0 to GA outweigh the costs of flagging this as experimental now
so I'm a +1 to the flagging idea, but I think there's some valuable lessons
for us to learn in retrospect from not just this feature but others like it
in the past.

Curious to hear Alex' thoughts about this situation in particular as author
of C-10857. I recall that being a pretty painful slog so apologies in
advance for picking at this scab. :)



On Fri, Jun 4, 2021 at 9:44 AM Brandon Williams  wrote:

> +1
>
> On Fri, Jun 4, 2021, 3:53 AM Benjamin Lerer  wrote:
>
> > Hi everybody,
> >
> > There are a significant amount of issues with DROP COMPACT STORAGE that
> can
> > be pretty surprising for users.
> > To name a few:
> > * Some hidden columns will show up changing the resultset returned for
> > wildcard queries
> > * As COMPACT tables did not have primary key liveness there empty rows
> > inserted AFTER the ALTER will be returned whereas the one inserted before
> > the ALTER will not.
> > * Also due to the lack of primary key liveness the amount of SSTables
> being
> > read will increase resulting in slower queries
> > * After DROP COMPACT it becomes possible to ALTER the table in a way that
> > makes all the row disappears
> > * There is a loss of functionality around null clustering when dropping
> > compact storage (CASSANDRA-16069)
> >
> > In my opinion DROP COMPACT STORAGE is not ready for production use unless
> > users fully understand what they are doing.
> > By consequence, I am wondering if we should not mark it as experimental
> as
> > we did for the Materialized Views (CASSANDRA-13959).
> >
> > What is your opinion?
> >
>


Re: Obfuscation of passwords in audit loging, in or not in 4.0?

2021-06-04 Thread Stefan Miklosovic
Hi,

ok, so this will make it to 4.0 then.

I would re-iterate on FQL logging though. What is our decision? Should
these passwords be clearly visible or we should obfuscate them too?

I am trying to close all remaining questions, while I do get that
passwords in audit are for sure problematic, I do not think that I
have a clear agreement what we should do with FQL yet.

Thank you

On Fri, 4 Jun 2021 at 15:22, Ekaterina Dimitrova  wrote:
>
> +1, please, reclassify it as a bug.
> Thank you Stefan
>
> On Fri, 4 Jun 2021 at 9:13, Brandon Williams  wrote:
>
> > On Fri, Jun 4, 2021 at 4:32 AM Sam Tunnicliffe  wrote:
> > > Shipping a brand new, non-experimental feature with a security hole like
> > this feels
> > > counter to our goal of releases being prod ready in .0, so I'm +1 on
> > including it in
> > > an rc/ga
> >
> > I think I have to agree here.  We can ship a complete feature, we can
> > remove it and not ship it, but what is not acceptable is shipping it
> > in a broken and potentially dangerous state.
> >
> > -
> > To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> > For additional commands, e-mail: dev-h...@cassandra.apache.org
> >
> >

-
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org



Re: Welcome Dinesh Joshi as Cassandra PMC member

2021-06-04 Thread Dikang Gu
Congrats Dinesh!

On Thu, Jun 3, 2021 at 3:59 PM Patrick McFadin  wrote:

> This is great. Congratulations Dinesh!
>
> On Thu, Jun 3, 2021 at 11:51 AM Jordan West  wrote:
>
> > Congratulations Dinesh!
> >
> > Jordan
> >
> > On Thu, Jun 3, 2021 at 1:40 AM Mick Semb Wever  wrote:
> >
> > > Congrats Dinesh. Thanks for all the help given and offered whenever it
> is
> > > needed!
> > >
> > > On Wed, 2 Jun 2021 at 18:16, Benjamin Lerer  wrote:
> > >
> > > >  The PMC's members are pleased to announce that Dinesh Joshi has
> > accepted
> > > > the invitation to become a PMC member.
> > > >
> > > > Thanks a lot, Dinesh, for everything you have done for the project
> all
> > > > these years.
> > > >
> > > > Congratulations and welcome
> > > >
> > > > The Apache Cassandra PMC members
> > > >
> > >
> >
>


-- 
Dikang


Re: Obfuscation of passwords in audit loging, in or not in 4.0?

2021-06-04 Thread Brandon Williams
On Fri, Jun 4, 2021 at 10:32 AM Stefan Miklosovic
 wrote:
> I would re-iterate on FQL logging though. What is our decision? Should
> these passwords be clearly visible or we should obfuscate them too?

I don't think it ever makes sense to log a password in plaintext, so
my feeling is we should obfuscate there as well.

-
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org