Re: [DISCUSS] auto-installing golang in `ant gen-doc` (CASSANDRA-19915)

2025-04-30 Thread Chris Lohfink
Cassandra's rube goldberg build system is so incredibly painful to integrate inside corporate CI environments already... maybe docker containers so you dont actually install random tools on the host computer it might not have privileges to do? On Wed, Apr 30, 2025 at 6:13 AM Josh McKenzie wrote:

Re: Welcome Jaydeepkumar Chovatia as Cassandra committer

2025-04-30 Thread Chris Lohfink
congratulations! On Wed, Apr 30, 2025 at 8:24 AM Joseph Lynch wrote: > Congratulations Jaydeep! > > On Wed, Apr 30, 2025 at 9:19 AM Bernardo Botella < > conta...@bernardobotella.com> wrote: > >> More great news! >> >> Congratulations Jaydeep! >> >> On Apr 30, 2025, at 6:11 AM, Jon Haddad wrote:

Re: Merging compaction improvements to 5.0

2025-04-15 Thread Chris Lohfink
+1 On Sun, Apr 13, 2025 at 12:32 PM Jordan West wrote: > Hi Folks, > > A bit delayed but I have the backport for 20092 ready. The branch can be > found here: > https://github.com/apache/cassandra/compare/cassandra-5.0...jrwest:cassandra:jwest/20092-5.0-backport. > I've run tests and all looked g

Re: Dropwizard/Codahale metrics deprecation in Cassandra server

2025-03-11 Thread Chris Lohfink
Just something to be mindful about what we had *before* codahale in Cassandra and avoid that again. Pre 1.1 it was pretty much impossible to collect metrics without looking at code (there were efficient custom made things, but each metric was reported differently) and that stuck through until 2.2 d

Re: [DISCUSS] synchronisation of properties between Config.java and cassandra.yaml

2025-01-27 Thread Chris lohfink
Might be a bit of a balance between exposing what people actually are likely to need to modify vs having a super intimidating config file. It's already nearly 2000 lines. Personally I'd rather see some auto-documentation or something that's in the docs

Re: [DISCUSS] CEP-45: Mutation Tracking

2025-01-09 Thread Chris Lohfink
Is this something we can disable? I can see scenarios where this would be strictly and severely worse then existing scenarios where we don't need repairs. ie short time window data, millions of writes a second that get thrown out after a few hours. If that data is small partitions we are nearly dou

Re: [VOTE] Release Apache Cassandra Java Driver 3.12.0 (2nd attempt)

2025-01-08 Thread Chris Lohfink
+1 On Wed, Jan 8, 2025 at 4:14 PM Bret McGuire wrote: > Greetings all! > >I’m proposing the Cassandra Java Driver 3.12.0 for release. This > represents a second attempt at releasing this version; an earlier attempt > failed due to a staging issue. > > sha1: 1a96d27130ea43ed5762c4f7b7cc182eb

Re: [DISCUSS] Replacement of SSTable's partition cardinality implementation from stream-lib to Apache Datasketches

2025-01-02 Thread Chris Lohfink
hould really convert >> from one HLL to another. >> >> The proposal to reduce allocations appears to be orthogonal to this >> library, so let’s separate out that discussion? If there’s evidence this >> library alone improves the memory profile let’s discuss that. >&g

Re: Default sorting in nodetool status CASSANDRA-20104

2024-12-13 Thread Chris Lohfink
In the case of not using vnodes, sorting by token would be ideal. Chris On Fri, Dec 13, 2024 at 6:52 AM Štefan Miklošovič wrote: > I want to add that I slightly prefer ordering by ID because for cases when > a cluster is deployed in a cloud environment, IP addresses assigned to > nodes are basi

Re: Re-evaluate compaction defaults in 5.1/trunk

2024-12-09 Thread Chris lohfink
ly in areas I know little of since we are not on that version yet), I do feel the need to bring it up in the scenario when talking about moving to the default. Chris On Sat, Dec 7, 2024 at 6:06 AM Štefan Miklošovič wrote: > > > On Sat, Dec 7, 2024 at 4:42 AM Chris Lohfink wrote: > &g

Re: Re-evaluate compaction defaults in 5.1/trunk

2024-12-06 Thread Chris Lohfink
While I am actually +1 on LCS being default as it handles more use cases well compared to STCS. I am -1 on UCS being default anywhere currently, the UX is horrible, documentation is unreadable and it's only available on a release barely anyone uses yet (not adequately tested in production). Seems l

Re: [DISCUSS] CEP-31 negotiated authentication

2024-12-05 Thread Chris Lohfink
Can also accomplish this on the client instead of server. Make client side supports multiple authentication mechanisms and does whatever START says. That's how we have upgraded to different auth mechanisms without downtime or interruption. For java in example, for AuthProvider of driver override p

Re: [VOTE] CEP-37: Repair scheduling inside C*

2024-11-06 Thread Chris Lohfink
+1 On Wed, Nov 6, 2024 at 11:10 AM Francisco Guerrero wrote: > +1 (nb) > > On 2024/11/06 14:07:47 "Tolbert, Andy" wrote: > > +1 (nb) > > > > On Tue, Nov 5, 2024 at 9:51 PM Josh McKenzie > wrote: > > > > > +1 > > > > > > On Tue, Nov 5, 2024, at 4:28 PM, Jaydeep Chovatia wrote: > > > > > > Hi Eve

Re: [Discuss] Repair inside C*

2024-10-21 Thread Chris Lohfink
;> chovatia.jayd...@gmail.com> wrote: >> >>> Sorry, there is a typo in the CEP-37 link; here is the correct link >>> <https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-37+Apache+Cassandra+Unified+Repair+Solution> >>> >>> >>> On T

Re: [DISCUSS] CASSANDRA-13704 Safer handling of out of range tokens

2024-09-12 Thread Chris Lohfink
While the code touches quite a few places the change itself is pretty innocuous but is massively impactful in bad scenarios. I am in favor of this patch myself as this protects the database from data loss that occurs in many different ways. An example I have seen recently (in 4.1) is when using GPF

Re: [DISCUSS] CEP-39: Cost Based Optimizer

2023-12-15 Thread Chris Lohfink
ing behind those plans. Secondary indexes are being > worked on. We hope that the optimizer will also help with some index > queries. > > It seems to me that this proposal is going toward the direction that you > want without introducing new problems for scalability. > > > L

Re: [DISCUSS] CEP-39: Cost Based Optimizer

2023-12-14 Thread Chris Lohfink
I don't wanna be a blocker for this CEP or anything but did want to put my 2 cents in. This CEP is horrifying to me. I have seen thousands of clusters across multiple companies and helped them get working successfully. A vast majority of that involved blocking the use of MVs, GROUP BY, secondary i

Re: CASSANDRA-18941 produce size bounded SSTables from CQLSSTableWriter

2023-10-24 Thread Chris Lohfink
+1 On Tue, Oct 24, 2023 at 11:24 AM Brandon Williams wrote: > +1 > > Kind Regards, > Brandon > > On Mon, Oct 23, 2023 at 6:22 PM Yifan Cai wrote: > > > > Hi, > > > > I want to propose merging the patch in CASSANDRA-18941 to 4.0 and up to > trunk and hope we are all OK with it. > > > > In CASSAN

Re: [VOTE] Accept java-driver

2023-10-03 Thread Chris Lohfink
+1 On Tue, Oct 3, 2023 at 10:30 AM Jeff Jirsa wrote: > +1 > > > On Mon, Oct 2, 2023 at 9:53 PM Mick Semb Wever wrote: > >> The donation of the java-driver is ready for its IP Clearance vote. >> https://incubator.apache.org/ip-clearance/cassandra-java-driver.html >> >> The SGA has been sent to t

Re: [DISCUSS] Backport CASSANDRA-18816 to 5.0? Add support for repair coordinator to retry messages that timeout

2023-09-19 Thread Chris Lohfink
I absolutely love the idea of this being in 5.0, I am +1 for what it is worth On Tue, Sep 19, 2023 at 4:04 PM David Capwell wrote: > To try to get repair more stable, I added optional retry logic (patch is > still in review) to a handful of critical repair verbs. This patch is > disabled by def

Re: [Discuss] Repair inside C*

2023-07-25 Thread Chris Lohfink
I think a CEP is the next step. Considering the number of companies involved, this might necessitate several drafts and rounds of discussions. I appreciate your initiative in starting this process, and I'm eager to contribute to the ensuing discussions. Maybe in a google docs or something initially

Re: [VOTE] CEP-8 Datastax Drivers Donation

2023-06-15 Thread Chris Lohfink
+1 On Wed, Jun 14, 2023 at 9:05 PM Jon Haddad wrote: > +1 > > On 2023/06/13 14:14:35 Jeremy Hanna wrote: > > Calling for a vote on CEP-8 [1]. > > > > To clarify the intent, as Benjamin said in the discussion thread [2], > the goal of this vote is simply to ensure that the community is in favor o

Re: [VOTE] CEP-21 Transactional Cluster Metadata

2023-02-08 Thread Chris Lohfink
+1 On Wed, Feb 8, 2023 at 10:36 AM Blake Eggleston wrote: > +1 > > On Feb 6, 2023, at 8:15 AM, Sam Tunnicliffe wrote: > > Hi everyone, > > I would like to start a vote on this CEP. > > Proposal: > > https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-21%3A+Transactional+Cluster+Metadata >

Re: Implicitly enabling ALLOW FILTERING on virtual tables

2023-02-03 Thread Chris Lohfink
e big ones. That could be specified in the doc for each > table, and it could also be included in the table properties, so it's > displayed by DESCRIBE TABLE queries. > > On Fri, 3 Feb 2023 at 20:56, Chris Lohfink wrote: > >> Just to 2nd what Scott days. While everything is in

Re: Implicitly enabling ALLOW FILTERING on virtual tables

2023-02-03 Thread Chris Lohfink
Just to 2nd what Scott days. While everything is in memory now, it may not be in the future, and if we add it implicitly, we are tying ourselves to be in memory only. However, I wouldn't -1 the idea. Another option may be a cqlsh option (ie like expand on/off) to always include a flag so it doesnt

Re: [DISCUSS] Diagnostic events in virtual tables

2021-09-13 Thread Chris Lohfink
Perhaps re-add the settings virtual table mutability. That way the same place can be used to update settings at runtime for multiple things instead of creating a new virtual table per service we want to make hot props for. Might be kinda nice to allow REGISTER and EVENT CQL events to be created wi

Re: [VOTE] CEP-13: Denylisting partitions

2021-09-08 Thread Chris Lohfink
+1 On Wed, Sep 8, 2021 at 11:58 AM bened...@apache.org wrote: > +1 > > From: Brandon Williams > Date: Wednesday, 8 September 2021 at 17:57 > To: dev@cassandra.apache.org > Subject: Re: [VOTE] CEP-13: Denylisting partitions > +1 > > On Wed, Sep 8, 2021 at 11:31 AM Sumanth Pasupuleti > wrote: >

Re: [DISCUSS] Virtual Tables and the future of NodeTool/JMX

2021-07-19 Thread Chris Lohfink
> a) Allow VirtualTables to be settable - to support changing parameters (ie. > nodetool setcompactionthroughput 32). Can be accomplished by updating or inserting on the settings virtual table via UPDATE system_views.settings SET value = 32 WHERE name = 'compaction_throughput'; INSERT INTO syste

Re: [VOTE] Release dtest-api 0.0.5

2020-09-25 Thread Chris Lohfink
+1 On Fri, Sep 25, 2020 at 10:11 AM Caleb Rackliffe wrote: > +1 > > On Fri, Sep 25, 2020 at 10:08 AM Brandon Williams > wrote: > > > +1 > > > > On Fri, Sep 25, 2020, 9:45 AM Oleksandr Petrov < > oleksandr.pet...@gmail.com > > > > > wrote: > > > > > Proposing the test build of in-jvm dtest API 0

Re: purpose of queryStates map in ServerConnection

2020-05-28 Thread Chris Lohfink
I cant see your profiler image so I may be missing something. I remember looking at this in past too. At least with 3.x version of java client the stream ids are reused as released so low volume apps wont have much overhead and highly concurrent ones it prevents allocating 20-100k objs/sec. I thin

Re: Calling for release managers (Committers and PMC)

2020-05-08 Thread Chris Lohfink
I'd like to get involved in this as well. On Thu, May 7, 2020 at 2:06 PM Jon Meredith wrote: > Sign me up. > > On Thu, May 7, 2020 at 12:36 PM Robert Stupp wrote: > > > > I can help > > > > -- > > Robert Stupp > > @snazy > > > > > Am 07.05.2020 um 20:29 schrieb Mick Semb Wever : > > > > > > Th

Re: server side describe

2020-04-09 Thread Chris Lohfink
I'd be in favor of going with the newer DESCRIBE option. The original patch was mostly focused on just getting the CQL correct and used virtual tables because its what the initial feedback was to do. Robert added a lot of functionality on top of what was there which is what people were starting to

Re: Virtual tables in Cassandra 4.0

2019-09-19 Thread Chris Lohfink
It's a new feature in a version thats just in alpha so how they will look may change in the end. There will hopefully be documentation around 4.0 GA release. The feature at a high level though is an interface to handle queries with an implementing class instead of the storage engine. A usecase tha

Re: [DISCUSS] Server side CQL schema

2019-09-05 Thread Chris Lohfink
> > > On Sep 5, 2019, at 2:57 PM, Chris Lohfink wrote: > > > > In https://issues.apache.org/jira/browse/CASSANDRA-14825 its been > discussed > > on how a server side DDL can be provided to clients. > > > > The original thought was to use a virtual tab

[DISCUSS] Server side CQL schema

2019-09-05 Thread Chris Lohfink
In https://issues.apache.org/jira/browse/CASSANDRA-14825 its been discussed on how a server side DDL can be provided to clients. The original thought was to use a virtual table to make it less invasive of a change while providing access to the TableCQLHelper output (and fixing it). The other is t

Re: "4.0: TBD" -> "4.0: Est. Q4 2019"?

2019-05-28 Thread Chris Lohfink
I think that was intent but ran out of time. There were 2 approaches to this, one being dumping the Metric's into a virtual table, and another to have specialized views (like nodetool). The later is started in CASSANDRA-14670 and still waitin

Re: Audit logging to tables.

2019-04-09 Thread Chris Lohfink
> >> > > > > >> On Mon, Mar 25, 2019 at 11:29 AM Sagar < > sagarmeansoc...@gmail.com> > > > > wrote: > > > > >> > > > > >> > Thanks Chris. I got caught up with a few things and couldn't > >

Re: Audit logging to tables.

2019-03-06 Thread Chris Lohfink
cing thing yet. As Chris pointed out, there's > other > >> work that would need to be done to work on virtual tables for large > >> datasets. > >> > >> Jon > >> > >> On Mon, Mar 4, 2019 at 6:42 AM Chris Lohfink > >> wrote: &

Re: Audit logging to tables.

2019-03-04 Thread Chris Lohfink
While you probably could put a virtual table wrapper over the binlogs, you would want to wait for something like https://issues.apache.org/jira/browse/CASSANDRA-14629 to get in so you would not OOM. The current virtual table implementation requires you have the entire result set to be returned at o

Re: Which approach should we use for exposing metrics through Virtual tables?

2018-06-22 Thread Chris Lohfink
lutions and there limitations and > advantages to find out which is the solution that people prefer or to see > if somebody can come up with another solution. > > In CASSANDRA-7622, Chris Lohfink proposed to expose the table metric using > the following schema: > > VIRTUAL TA

Re: Repair scheduling tools

2018-04-05 Thread Chris Lohfink
> I do have a hard time buying that an opt-in repair *scheduling* is going to > cause heap problems or impact the daemon significantly; the scheduler > literally reads a few bytes out of a Cassandra table and makes a function > call or two, and then sleeps for 2 minutes. I wouldn't trivialize it,

Re: Expensive metrics?

2018-02-28 Thread Chris Lohfink
> our histograms when built with a Meter use a ExponentiallyDecayingReservoir > but our histograms built directly use DecayingEstimatedHistogramReservoir > algorithm Meters dont use a decaying reservoir, they use EMWA

Re: Cassandra Needs to Grow Up by Version Five!

2018-02-21 Thread Chris Lohfink
Instead of saying "Make X better" you can quantify "Here's how we can make X better" in a jira and the conversation will continue with interested parties (opening jiras are free!). Being combative and insulting project on mailing list may help vent some frustrations but it is counter productive

Re: Coordinator Write Metrics per CF

2018-02-12 Thread Chris Lohfink
It would be good to have it. Its not that its not there because its difficult or anything. I think its more that the read latency metric was needed for speculative retry so it was added but the write side wasn't needed for that feature so wasn't added at same time. It would be very useful in det

Re: Do not use Cassandra 3.11.0+ or Cassandra 3.0.12+

2017-09-12 Thread Chris Lohfink
Last Ive seen of it OpsCenter does not collect this metric. I don't think any monitoring tools do. Chris > On Sep 11, 2017, at 4:06 PM, CPC wrote: > > Hi, > > Is this bug fixed in dse 5.1.3? As I understand calling jmx getTombStoneRatio > trigers that bug. We are using opscenter as well and d

Re: High CPU after read timeout

2017-07-13 Thread Chris Lohfink
There is no mechanism for reads to timeout once they have started. The messaging service will drop the request when its received on the ReadStage or RequestResponseStage. This is how its always operated so not unique to 3.9. If a client disconnects from a coordinator there is also no way for the re

Re: Weekly Cassandra Wrap-up

2017-04-03 Thread Chris Lohfink
https://github.com/tolbertam/sstable-tools https://github.com/instaclustr/cassandra-sstable-tools https://github.com/spotify/cassandra-opstools

Re: Truncate operation not available in Mutation Object

2017-02-23 Thread Chris Lohfink
The truncates are written to the truncated_at field in system.local and should be honored by the commit log replayer ( https://github.com/apache/cassandra/blob/af3fe39dcabd9ef77a00309ce6741268423206df/src/java/org/apache/cassandra/db/commitlog/CommitLogReplayer.java#L102 ). Chris On Wed, Feb 22,

Re: Cassandra Mutation object decoding

2016-11-22 Thread Chris Lohfink
There are different kinds of tombstones, a partition tombstone is held in the MutableDeletionInfo of the PartitionUpdate that you can get from deletionInfo() method which returns the private deletionInfo field from the holder. There are also row and cell deletions so you have to handle each of thos

Re: Re[3]: Histogram error "Unable to compute ceiling for max when histogram overflowed"

2016-10-20 Thread Chris Lohfink
i think this is already fixed in https://issues.apache.org/jira/browse/CASSANDRA-7 On Thu, Oct 20, 2016 at 3:56 PM, Nate McCall wrote: > Open a new issue and link to CASSANDRA-11063. Including a test case > addressing your issue that fails after the 11063 change would be ideal > as well. > >

Re: Contribution

2016-03-28 Thread Chris Lohfink
I would second the suggestion of going over https://academy.datastax.com/ then can check out http://www.datastax.com/dev/blog/deep-into-cassandra-internals Chris On Mon, Mar 28, 2016 at 8:04 AM, Pedro Gordo wrote: > Hi! > > I think that the best place to start is to see the DataStax videos. The