Reading through the two, the grouping approach seems like it's a lot more
friendly to newcomers as well as providing context specific cues for
relationships between params you're editing. Showing and not telling, if
you will.

Opening up a 1500+ line .yaml file is very daunting, even if most of it is
comments. Can't blame folks for being overwhelmed at the prospect of tuning
Cassandra w/that as our operator config API. :)

~Josh

On Thu, Sep 2, 2021 at 1:48 PM David Capwell <dcapw...@apple.com.invalid>
wrote:

> Thanks for bringing this back up; Caleb and I were talking about the lack
> of clarity with regard to CASSANDRA-16896, fleshing this out would make
> those configs nicer!
>
> >   To standardize naming - that we did by agreeing to the form noun_verb
>
> If we can document this, it would be great as stuff like “enabled” are
> inconsistent so not sure if I did it properly =D
>
> >
> >   Provision of values with units while maintaining backward
> compatibility.
>
> +1000000000000
>
> I really hate local_read_size_threshold_kb; I would love
> local_read_size_threshold: 10kb.  Once we have the infrastructure in place
> (believe your patch before had these tools) I would love to switch!
>
>
> > Another proposal is done by Benedict; grouping the config parameters.
>
> Yep, this is what triggered Caleb and I to talk about this thread!  To
> group or not to group; that is the question
>
> Personally I like grouping from an organization point of view so am in
> favor of that; though I will agree that it can be hard for some tools (such
> as bash templating), but feel we can always find a common ground
>
>
> > On Sep 2, 2021, at 8:44 AM, bened...@apache.org wrote:
> >
> > Thanks for bringing this to the list Ekaterina!
> >
> > It’s worth noting that the two don’t have to be in conflict: we could
> offer two template yaml with the parameters grouped differently, for users
> to decide for themselves.
> >
> > The proposals primarily define parameter names differently, with my
> proposal going by kind->place, and the other proposal maintaining (mostly)
> the existing name form (which is a bit more like place->kind). While the
> example yaml groups by kind, you can convert nested definitions into a
> ‘dot’ form (e.g. limits.concurrency.reads) for use in a different grouping.
> >
> > One advantage of grouping parameters together is that it aids
> maintaining coherency of naming between systems, and also potentially
> permits a more succinct config file and better discovery. But it’s far from
> a silver bullet, as value judgements have to be made about where the
> grouping lines are. I’m sure anything we settle on will be a huge
> improvement over the status quo, however.
> >
> >
> >
> >
> > From: Ekaterina Dimitrova <e.dimitr...@gmail.com>
> > Date: Thursday, 2 September 2021 at 16:32
> > To: dev@cassandra.apache.org <dev@cassandra.apache.org>
> > Subject: [DISCUSS] CASSANDRA-15234
> > Hi team,
> >
> > I would like to bring to the attention of the community CASSANDRA-15234,
> > standardise config and JVM parameters.
> >
> > This is work we discussed back in Summer 2020 just before our first 4.0
> > Beta release. During the discussion we figured out that there is more
> than
> > one option to do the job and not enough time to get user feedback and
> > finish it so this was delayed post-4.0 And here I am, bringing it back to
> > the table.
> >
> > This work’s goal is:
> >
> >   -
> >
> >   To standardize naming - that we did by agreeing to the form noun_verb
> >   -
> >
> >   Provision of values with units while maintaining backward
> compatibility.
> >
> >
> > Those two parts are more or less already done.
> >
> > More interesting is the third part - reorganizing the cassandra.yaml
> file.
> >
> > My personal approach was to split it into sections, done here
> > <
> https://github.com/ekaterinadimitrova2/cassandra/blob/b4eebe080835da79d032f9314262c268b71172a8/conf/cassandra.yaml
> >
> > .
> >
> > Another proposal is done by Benedict; grouping the config parameters.
> >
> > To make it clearer, he created a yaml
> > <
> https://github.com/belliottsmith/cassandra/blob/5f80d1c0d38873b7a27dc137656d8b81f8e6bbd7/conf/cassandra_nocomment.yaml
> >
> > with comments mostly stripped.
> >
> > In his version, there are basic settings for network, disk etc all
> grouped
> > together, followed by operator tuneables mostly under limits within which
> > we now have throughput, concurrency, capacity. This leads to settings for
> > some features being kept separate (most notably for caching), but helps
> the
> > operator understand what they have to play with for controlling resource
> > consumption.
> >
> > I am interested to hear what people think about the two options or if
> > anyone has another idea to share, open discussion.
> >
> > Thank you,
> >
> > Ekaterina
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> For additional commands, e-mail: dev-h...@cassandra.apache.org
>
>

Reply via email to