Orthogonality (WAS: Re: [DISCUSSION] CEP-38: CQL Management API)

Joel Shepherd Sat, 18 Oct 2025 06:38:42 -0700

Thanks for the clarifications on CEP-38, Maxim: I actually got someinsights from your comments below that had slipped by me while readingthe CEP.

I want to fork the thread a bit, so breaking this off from the CEP-38DISCUSS thread.

If I can back away a bit and squint ... It seems to me that there arethree initiatives floating around at the moment that could makeCassandra more awesome and manageable, or make it confusing and complex.

1) Patrick McFadin's proposal (as presented at CoC) to align CQLsyntax/semantics closely with PostgreSQL's. I haven't heard anyonestrongly object, but have heard several expressions of surprise. Maybesomething is already in the works, but I'd love to see and discuss aproposal for this, so there's consensus that it's a good idea and (ifneeded) guidelines on how to evolve CQL in that direction.

2) CQL management API (CEP-38): As mentioned in the CEP, it'll take sometime to implement all the functionality that could be in scope of thisCEP. I wonder if it'd be beneficial to have some kind of rubric orguidelines for deciding what kind of things make sense to manage viaCQL, and what don't. For example, skimming through the PostgreSQLmanagement commands, many of them look like they could be thin wrappersover SQL executed against "private" tables and views in the database. Idon't know that that is how they are implemented, but many of thecommands are ultimately just setting a value, or reading and returningvalues that could potentially be managed in tables/views of some sort.(E.g., like Cassandra virtual tables). That seems to fit pretty neatlywith preserving SQL as a declarative, data independent language for dataaccess, with limited side-effects. Is that a useful filter fordetermining what kinds of things can be managed via CQL management, andwhich should be handled elsewhere? E.g., is a filesystem operation likenodetool scrub a good candidate for CQL management or not? (I'd votenot: interested in what others think.)

3) Cassandra Sidecar: Like the CQL management API, I wonder if it'd bebeneficial to have a rubric for deciding what kinds of things make senseto go into Sidecar. The recent discussion about CEP-55 (generated rolenames) landed on implementing the functionality both as a CQL statementand as a Sidecar API. There's also activity around using SIdecar forrolling restarts, backup and restore, etc.: control plane activitiesthat are largely orthogonal to interacting with the data. Shouldoperations that are primarily generating or manipulating data beavailable via Sidecar to give folks the option of invoking them via CQLor HTTP/REST, or would Sidecar benefit from having a more narrowly scopecharter (e.g. data-agnostic control plane operations only)?

I think all of these tools -- CQL, CQL Management API and Sidecar --will be more robust, easier to use, and easier to maintain if we have aconsistent way of deciding where a given feature should live, and aminimal number of choices for accessing the feature. Orthogonalcontrols. Since Sidecar and CQL Management API are pretty new, it's agood time to clarify their charter to ensure they evolve well together.And to get consensus on the long-term direction for CQL.


Let me know if I can help -- Joel.


On 10/7/2025 12:22 PM, Maxim Muzafarov wrote:

Hello Folks,


First of all, thank you for your comments. Your feedback motivates me
to implement these changes and refine the final result to the highest
standard. To keep the vote thread clean, I'm addressing your questions
in the discussion thread.

The vote is here:
https://lists.apache.org/thread/zmgvo2ty5nqvlz1xccsls2kcrgnbjh5v


= The idea: =

First, let me focus on the general idea, and then I will answer your
questions in more detail.

The main focus is on introducing a new API (CQL) to invoke the same
node management commands. While this has an indirect effect on tooling
(cqlsh, nodetool), the tooling itself is not the main focus. The scope
(or Phase 1) of the initial changes is narrowed down only to the API
only, to ensure the PR remains reviewable.

This implies the following:
- the nodetool commands and the way they are implemented won't change
- the nodetool commands will be accessible via CQL, their
implementation will not change (and the execution locality)
- this change introduces ONLY a new way of how management commands
will be invoked
- this change is not about the tooling (cqlsh, nodetool), it will help
them evolve, however
- these changes are being introduced as an experimental API with a
feature flag, disabled by default


= The answers: =

how will the new CQL API behave if the user does not specify a hostname?

The changes only affect the API part; improvements to the tooling will
follow later. The command is executed on the node that the client is
connected to.
Note also that the port differs from 9042 (default) as a new
management port will be introduced. See examples here [1].

cqlsh 10.20.88.164 11211 -u myusername -p mypassword
nodetool -h 10.20.88.164 -p 8081 -u myusername -pw mypassword

If a host is not specified, the cli tool will attempt to connect to
localhost. I suppose.

My understanding is that commands like nodetool bootstrap typically run on a 
single node.

This is correct; however, as I don't control the implementation of the
command, it may actually involve communication with other nodes. This
is actually not part of this CEP. I'm only reusing the commands we
already have.

Will we continue requiring users to specify a hostname/port explicitly, or will 
the CQL API be responsible for orchestrating the command safely across the 
entire cluster or datacenter?

It seems that you are confusing the API with the tooling. The tooling
(cqlsh, nodetool) will continue to work as it does now. I am only
adding a new way in which commands can be invoked - CQL,
orchestration, however, is the subject of other projects. Cassandra
Sidecar?

It might, however, be worth verifying that the proposed CQL syntax aligns with 
PostgreSQL conventions, and adjusting it if needed for cross-compatibility.

It's a bit new info to me that we're targeting PostgreSQL as the main
reference and drifting towards the invoking management operations the
same way. I'm inclined to agree that the syntax should probably be
similar, more or less, however.

We are introducing a new CQL syntax in a minimal and isolated manner.
The CEP-38 defines a small set of management-oriented CQL statements
(EXECUTE COMMAND / DESCRIBE COMMAND) that can be used to match all
existing nodetool commands at once, introducing further aliases as an
option. This eliminates the need to introduce a new antlr grammar for
each management operation.

The command execution syntax is the main thing that users interact
with in this CEP, but I'm taking a more relaxed approach to it for the
following reasons:
- the tip of the iceberg, the unification of the JMX, CQL and possible
REST API for Cassandra is priority;
- the feature will be in experimental state in the major release, we
need collect the real feedback from users and their deployments;
- the aliasing will be used for some important commands like
compaction, bootstrap;

Taking all of the above into account, I still think it's important to
reach an agreement, or at least to avoid objections.
So, I've checked the PostgreSQL and SQL standards to identify areas of
alignment. The latter I think is relatively easy to support as
aliases.


The syntax proposed in the CEP:

EXECUTE COMMAND forcecompact WITH keyspace=distributed_test_keyspace
AND table=tbl AND keys=["k4", "k2", "k7"];

Other Cassandra-style options that I had previously considered:

1. EXECUTE COMMAND forcecompact (keyspace=distributed_test_keyspace,
table=tbl, keys=["k4", "k2", "k7"]);
2. EXECUTE COMMAND forcecompact WITH ARGS {"keyspace":
"distributed_test_keyspace", "table": "tbl", "keys":["k4", "k2",
"k7"]};

With the postgresql context [2] it could look like:

COMPACT (keys=["k4", "k2", "k7"]) distributed_test_keyspace.tbl;

The SQL-standard [3][4] procedural approach:

CALL system_mgmt.forcecompact(
   keyspace => 'distributed_test_keyspace',
   table    => 'tbl',
   keys     => ['k4','k2','k7'],
   options  => { "parallel": 2, "verbose": true }
);


Please let me know if you have any questions, or if you would like us
to arrange a call to discuss all the details.


[1]https://www.instaclustr.com/support/documentation/cassandra/using-cassandra/connect-to-cassandra-with-cqlsh/
[2]https://www.postgresql.org/docs/current/sql-vacuum.html
[3]https://en.wikipedia.org/wiki/Stored_procedure?utm_source=chatgpt.com#Implementation
[4]https://www.postgresql.org/docs/9.3/functions-admin.html

Orthogonality (WAS: Re: [DISCUSSION] CEP-38: CQL Management API)

Reply via email to