Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-12-14 Thread Claude Warren
Is there still interest in this? Can we get some points down on electrons so that we all understand the issues? While it is fairly simple to redirect the read/write to something other than the local system for a single node this will not solve the problem for tiered storage. Tiered storage w

Implementing a secondary index

2021-11-17 Thread Claude Warren
/blooming_cassandra -- [image: Instaclustr logo] *Claude Warren* Principal Software Engineer Instaclustr

Can I create a secondary index with multiple SSTables

2021-12-09 Thread Claude Warren
construct 2 different names for the call to TableMetadata.builder(). Is there anything special I need to do for Cassandra to manage the SSTable as per a normal index. -- [image: Instaclustr logo] *Claude Warren* Principal Software Engineer Instaclustr

Re: [DISCUSS] Periodic snapshot publishing with minor version bumps

2021-12-31 Thread Claude Warren
I am late to this party but wanted to add my 2-cents. I do not think that the minor revisions should be used to denote snapshot, nightly build, or any other not-fully-supported code. My reasoning is that semantic versioning defines under which conditions the version numbers are to change. By loo

Re: [DISCUSS] Periodic snapshot publishing with minor version bumps

2022-01-03 Thread Claude Warren
Claude On Fri, 31 Dec 2021 at 08:42, Claude Warren wrote: > I am late to this party but wanted to add my 2-cents. > > I do not think that the minor revisions should be used to denote snapshot, > nightly build, or any other not-fully-supported code. My reasoning is that > semantic

CQL Tuples & CQL Grammar

2022-03-09 Thread Claude Warren
? Claude -- [image: Instaclustr logo] *Claude Warren* Principal Software Engineer Instaclustr

Re: CQL Tuples & CQL Grammar

2022-03-10 Thread Claude Warren
n CQL. > > Le jeu. 10 mars 2022 à 06:50, Claude Warren > a écrit : > >> I have been looking at CqlParser.g4 file for cql3 and have a question >> about assignment tuples. The assignment tuple is defined as : >> >> assignmentTuple >>: syntaxBracketLr

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-08-31 Thread Claude Warren via dev
Is there enough support here for VIEWS to be the implementation strategy for displaying masking functions? It seems to me the view would have to store the query and apply a where clause to it, so the same PK would be in play. It has data leaking properties. It has more use cases as it can be

Re: [DISCUSS] LWT UPDATE semantics with + and - when null

2022-08-31 Thread Claude Warren via dev
I like this approach.  However, in light of some of the discussions on view and the like perhaps the function is  (column value as returned by select ) + 42 So a null counter column becomes 0 before the update calculation is applied. Then any null can be considered null unless addressed by IfN

[DISCUSS] CEP-23: Enhancement for Sparse Data Serialization

2022-09-05 Thread Claude Warren via dev
I have just posted a CEP  covering an Enhancement for Sparse Data Serialzation.  This is in response to CASSANDRA-8959 I look forward to responses.

Re: [DISCUSS] CEP-23: Enhancement for Sparse Data Serialization

2022-09-05 Thread Claude Warren via dev
efficiency improvement. The CEP guidelines say a CEP is encouraged “for significant user-facing or changes that cut across multiple subsystems”. Any reason why a Jira isn’t sufficient? Abe On Sep 5, 2022, at 1:57 AM, Claude Warren via dev wrote: I have just posted a CEP covering an Enhancement

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-09-07 Thread Claude Warren via dev
but no one impedes us to also have 3) if we get to have projected views. However, I think that projected views is a new general-purpose feature with its own complexities, so it would deserve its own CEP, if someone is willing to work on the implementation. On Wed, 31 Aug 2

Re: [DISCUSS] CEP-23: Enhancement for Sparse Data Serialization

2022-09-07 Thread Claude Warren via dev
engage in discussion it’s unclear to me what advantage this encoding scheme brings. It might be worth outlining what algorithmic advantage you foresee for what data distributions in which collection types. On 6 Sep 2022, at 07:16, Claude Warren via dev wrote: I am just learning the ropes

Committer needed for Deprecate Throwables.propagate usage

2022-09-20 Thread Claude Warren via dev
I made the necessary fixes to remove the deprecated Throwables.propagate calls.  However, I need a committer to review. https://issues.apache.org/jira/browse/CASSANDRA-14218 Thank you, Claude

Weird results

2022-12-15 Thread Claude Warren, Jr via dev
I am working on a StandaloneDowngrader.java based on StandaloneUpgrader.java While working on the tests I had a problem with 2 test (testFlagArgs and testDefaultCall) that failed with: ERROR [main] 2022-12-14 10:35:20,051 SSTableReader.java:496 - Cannot open /home/claude/apache/cassandra/build/te

upgrade sstable selection

2023-01-10 Thread Claude Warren, Jr via dev
Greetings, I am working on the downgradesstables code and seem to have a problem with ordering of the downgrade or perhaps the Directories.SSTableLister I lifted the code from upgradesstables to select the files to downgrade. The only difference in the code that selects the files to downgrade is

Re: upgrade sstable selection

2023-01-10 Thread Claude Warren, Jr via dev
on > > On Tue, Jan 10, 2023 at 4:44 AM Claude Warren, Jr via dev > wrote: > > > > Greetings, > > > > I am working on the downgradesstables code and seem to have a problem > with ordering of the downgrade or perhaps the Directories.SSTableLister > > > >

Re: [DISCUSS] Clear rules about sstable versioning and downgrade support

2023-01-16 Thread Claude Warren, Jr via dev
What does this mean for the Trie sstable format? Would it perhaps make sense to version the sstable upgrader (and future downgrader) based on the highest version they understand? for example sstableupgrader version N will handle the n? versions so it can upgrade from m? while sstabledowngrader ve

Upgrading sstables and default partitioner.

2023-01-26 Thread Claude Warren, Jr via dev
Greetings, I am working on porting a fix for table upgrade order into V3.0 and have come across the following issue: ERROR 10:23:31 Cannot open /home/claude/apache/cassandra/build/test/cassandra/data/system_schema/keyspaces-abac5682dea631c5b535b3d6cffd0fb6/me-89-big; partitioner org.apache.cassan

Re: [DISCUSSION] Cassandra's code style and source code analysis

2023-01-27 Thread Claude Warren, Jr via dev
Turn it on at warning (or lower) level now, so people have some idea of the size of change to their current code. On Wed, Jan 25, 2023 at 12:05 PM Miklosovic, Stefan < stefan.mikloso...@netapp.com> wrote: > Thank you Maxim for doing this. > > It is nice to see this effort materialized in a PR. >

Re: [DISCUSSION] Framework for Internal Collection Exposure and Monitoring API Alignment

2023-01-30 Thread Claude Warren, Jr via dev
Actually, Maxim's proposal does not depend on JMX being present or not. What the proposal does is make it easier to create/sync multiple presentations of the same internal data: Virtual Tables, JMX, Metrics, next year's greatest data presentation strategy. Removing JMX from the mix just reduces

Re: Cassandra CI Status 2023-01-07

2023-02-10 Thread Claude Warren, Jr via dev
New Failures from Build Lead Week 5 *** CASSANDRA-18198 - "AttributeError: module 'py' has no attribute 'io'" reported in multiple tests - reported in 4.1, 3.11, and 3.0 - identified as a possible class loader issue associated with CASSANDRA-18150 *** CASSANDRA-18191 - Native Transport SSL tests

downgrade sstables

2023-02-20 Thread Claude Warren, Jr via dev
I have been working on downgrading sstables for awhile now. I have the downgrader mostly working. The only issue is when downgrading system tables. Specifically during the 3.1 -> 4.0 changes a column broadcast_port was added to system/local. This means that 3.1 system can not read the table as

Re: Downgradability

2023-02-21 Thread Claude Warren, Jr via dev
My goal in implementing CASSANDRA-8928 was to be able to take the current version 4.x and write it as the earliest 3.x version possible. The reasoning being that if that was possible then whatever 3.x version was executed

Removing columns from sstables

2023-02-22 Thread Claude Warren, Jr via dev
Greetings, I have been looking through the code and I can't find any place where columns are removed from an sstable. I have found that rows can be deleted. Columns can be marked as deleted. But I have found no place where the deleted cell is removed from the row. Is there the concept of comp

Re: Removing columns from sstables

2023-02-22 Thread Claude Warren, Jr via dev
Close. It is still in the table so the v3.x code that reads system.local will detect it and fail on an unknown column as that code appears to be looking at the actual on-disk format. It sounds like the short answer is that there is no way to physically remove the column from the on-disk format on

Re: Downgradability

2023-02-23 Thread Claude Warren, Jr via dev
Broken downgrading can be fixed (I think) by modifying the SearializationHeader.toHeader() method where it currently throws an UnknownColumnException. If we can, instead of throwing the exception, create a dropped column for the unexpected column then I think the code will work. I realise that to

[DISCUSS] Single boilerplate script

2023-02-23 Thread Claude Warren, Jr via dev
Pull request https://github.com/apache/cassandra/pull/1950/files is an attempt to move the boilerplate coding from the script files into a single maintainable file. This change does 4 things: 1. Moves the standard boiler plate from the standard scripts into a single maintainable script to b

Re: Downgradability

2023-02-23 Thread Claude Warren, Jr via dev
y the price for it. >> I simply prefer that we take the time to do things right. >> >> Thanks to Scott and you, downgradability got a much better visibility so >> no matter what approach we pick, I am convinced that we will get there. >> >> Le jeu. 23 févr.

[DISCUSS] Moving standard boiler plate script blocks.

2023-03-22 Thread Claude Warren, Jr via dev
I would like to get some more eyes on https://github.com/apache/cassandra/pull/1950/files wich arises from CASSANDRA-17773 The basic idea is to: - Move the boiler plate script code to a single sourced file. - Add code to make debugging scripts easier, this in response to CASSANDRA-17773

[DISCUSS] Initial implementation of cassandra-conf with nodetool example

2023-04-17 Thread Claude Warren, Jr via dev
The pull request [1] is a proposed fix for CASSANDRA-17773. I am looking for comments and a decision as to whether to move forward or not with this change. The goal is to remove much of the boiler-plate code from scripts without changing their functionality or arguments and to add the ability to

[COMPRESSION PARAMETERS] Question

2023-04-19 Thread Claude Warren, Jr via dev
Currently the compression parameters has an option called enable. When enable=false all the other options have to be removed. But it seems to me that we should support enabled=false without removing all the other parameters so that users can disable the compression for testing or problem resoluti

[DISCUSS] Standalone downgrader

2023-05-15 Thread Claude Warren, Jr via dev
I have an open pull request [1] to merge in a standalone downgrader. The problem has been that between v3 and v4 there was a breaking change in the system local table where the columns "broadcast_port", "listen_port", and "rpc_port" were added. The code (in the current pull request) provides fun

Re: [DISCUSS] Bring cassandra-harry in tree as a submodule

2023-05-25 Thread Claude Warren, Jr via dev
Since the talk was not accepted for Cassandra Summit, would it be possible to record it as a simple youtube video and publish it so that the detailed information about how to use Harry is not lost? On Thu, May 25, 2023 at 7:36 AM Alex Petrov wrote: > While we are at it, we may also want to pull

Bloom filter calculation

2023-07-10 Thread Claude Warren, Jr via dev
Can someone explain to me how the Bloom filter table in BloomFilterCalculations was derived and how it is supposed to work? As I read the table it seems to indicate that with 14 hashes and 20 bits you get a fp of 6.71e-05. But if you plug those numbers into the Bloom filter calculator [1], that

Re: Bloom filter calculation

2023-07-11 Thread Claude Warren, Jr via dev
positive rate of a bloom > filter of 20 bits with 10 entries, which means only 2 bits per entry? > > On 11 Jul 2023, at 07:14, Claude Warren, Jr via dev < > dev@cassandra.apache.org> wrote: > >  > Can someone explain to me how the Bloom filter table in > BloomFilter

[DISCUSS] Tiered Storage

2023-07-24 Thread Claude Warren, Jr via dev
I have been thinking about tiered storage wherein infrequently used data can be moved off to slow (cold) storage (like S3). I think that CEP-17 in conjunction with CEP-21 provides an opportunity for an interesting approach. As I understand it CEP-17 clarified the SSTables interface(s) so that alt

Re: [DISCUSSION] Shall we remove ant javadoc task?

2023-08-03 Thread Claude Warren, Jr via dev
I think that we can get more developers interested if there are available javadocs. While many of the core classes are not going to be touched by someone just starting, being able to understand what the external touch points are and how they interact with other bits of the system can be invaluable

Fixes for UDF NPE during restart.

2023-08-15 Thread Claude Warren, Jr via dev
CASSANDRA-18739 describes a reproducible NPE on restart with some UDFs. The solution outlined in that ticket was not used and a much simpler solution provided by Stefan Miklosovic was implemented. There are 2 pull requests open for Cassandra 4.0 and 4.1 that have the fairly simple fix as well as a

[DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-24 Thread Claude Warren, Jr via dev
I have just filed CEP-36 [1] to allow for keyspace/table storage outside of the standard storage space. There are two desires driving this change: 1. The ability to temporarily move some keyspaces/tables to storage outside the normal directory tree to other disk so that compaction can o

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-25 Thread Claude Warren, Jr via dev
de, SSD, HDD, and data placement based > on requests. After briefly browsing the proposals, it seems that there are > some differences. Can you help to do some explain ? Thanks 。 > > > Claude Warren, Jr via dev 于2023年9月25日周一 > 14:49写道: > >> I have just filed CEP-36 [1]

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-25 Thread Claude Warren, Jr via dev
e discussing the whole final state, and not an implementation >> detail of one part of the final state? >> >> >> >> >> >> >> On Sun, Sep 24, 2023 at 11:49 PM Claude Warren, Jr via dev < >> dev@cassandra.apache.org> wrote: >> >>

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-26 Thread Claude Warren, Jr via dev
which feels a bit customized for a certain user > and is not universal enough.Am I right ? > > Claude Warren, Jr 于2023年9月26日周二 14:36写道: > >> My intention is to develop an S3 storage system using >> https://github.com/carlspring/s3fs-nio >> >> There are several issues

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-26 Thread Claude Warren, Jr via dev
backend > filesystems and their individual configuration that can be used, as well as > configuration and support for a "backend file router" for file creation > (and opening) that can be used to route files to the backend most > appropriate. > > > > Regards,

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-09-28 Thread Claude Warren, Jr via dev
ific behaviors that > show up above the `ChannelProxy` layer that will depend on the backend. > > > > Ideally there would be some config to specify several backend > filesystems and their individual configuration that can be used, as well as > configuration and support fo

multiple ParameterizedClass objects?

2023-10-03 Thread Claude Warren, Jr via dev
I have a case where I would like to be able to specify a collection of ParameterizedClass objects in the configuration file. Is there a standard way to do this? If not, does anyone have a suggestion for a clean way to implement it? Claude

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-10 Thread Claude Warren, Jr via dev
i, Sep 29, 2023 at 8:07 AM Claude Warren, Jr wrote: > Sorry I was out sick and did not respond yesterday. > > Henrik, How does your system work? What is the design strategy? Also is > your code available somewhere? > > After looking at the code some more I think that the best solut

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-18 Thread Claude Warren, Jr via dev
:28 AM Claude Warren, Jr wrote: > I have been exploring adding a second Path to the Cassandra File object. > The original path being the path within the standard Cassandra directory > tree and the second being a translated path when there is what was called a > ChannelProxy in place. &

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-18 Thread Claude Warren, Jr via dev
other FileSystem implementations. I look forward to hearing from you, Claude On Wed, Oct 18, 2023 at 9:00 AM Claude Warren, Jr wrote: > After a bit more analysis and some testing I have a new branch that I > think solves the problem. [1] I have also created a pull request internal > to

Re: [DISCUSS] CommitLog default disk access mode

2023-10-18 Thread Claude Warren, Jr via dev
I think introducing the feature is a good idea. I also think that it should _NOT_ be enabled by default for all the reasons stated above. Finding a cohort of users who are interested in turning it on would provide a nice testbed to shake out any issues without affecting everyone. On Tue, Oct 17,

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-19 Thread Claude Warren, Jr via dev
8, 2023 at 10:06 AM guo Maxwell wrote: > > If it is ok for Henrik to rebase the Astra implementation of this > functionality (FileSystemProvider) onto Cassandra trunk. > > Then we can create a jira to move this forward for a small step. > > Claude Warren, Jr 于2023年10月18日周三 15:05写

CASSANDRA-18775 (Cassandra supported OSs)

2023-10-20 Thread Claude Warren, Jr via dev
I am looking at https://issues.apache.org/jira/browse/CASSANDRA-18775 and want to ensure that I do not remove too many libraries. I think that preserving any sigar library where the file name contains the word "linux" or "macosx" should be acceptable. This will preserve: libsigar-amd64-linux.so l

CASSANDRA-16565

2023-10-24 Thread Claude Warren, Jr via dev
I am working on https://issues.apache.org/jira/browse/CASSANDRA-16565 and have a small testing program that executes the sigar and equivalent OSHI methods to verify that they are the same. I would like to have this run on various platforms. I have tgz with all the libraries and code as well as a

Re: CASSANDRA-16565

2023-10-25 Thread Claude Warren, Jr via dev
I ended up posting the code at https://github.com/Aiven-Labs/compare_oshi_sigar if anyone wants to take a look and see if they get differing results on various systems. On Tue, Oct 24, 2023 at 4:59 PM Brandon Williams wrote: > On Tue, Oct 24, 2023 at 7:48 AM Claude Warren, Jr via dev >

Development Dependencies documentation.

2023-10-25 Thread Claude Warren, Jr via dev
I just had to change dependencies in Cassandra for the first time and I think the documentation [1] is out of date. First I think most of the file edits are in the ".build" directory. Adding jars to the "lib" directory works until calling "ant realclean", so perhaps the instructions should inclu

Re: CASSANDRA-18775 (Cassandra supported OSs)

2023-10-25 Thread Claude Warren, Jr via dev
I closed 18775 as it did not seem reasonable after discussions here. I have been working on 16565 and have a pull request [1] and an experimental suite to show the differences. [2] [1] https://github.com/apache/cassandra/pull/2842 [2] https://github.com/Aiven-Labs/compare_oshi_sigar On Wed, Oc

Immediately Deprecated Code

2023-10-31 Thread Claude Warren, Jr via dev
I was thinking about code that is used to migrate from one version to another. For example the code that rewrote the order of the hash values used for Bloom filters. That code was necessary for the version it was coded in. But the next version does not need that code because the next version is

Re: Immediately Deprecated Code

2023-10-31 Thread Claude Warren, Jr via dev
sandra/blob/trunk/test/distributed/org/apache/cassandra/distributed/upgrade/UpgradeTestBase.java#L97-L108 > > ____ > From: Claude Warren, Jr via dev > Sent: Tuesday, October 31, 2023 10:57 > To: dev > Cc: Claude Warren, Jr > Subject: Immediatel

Re: [DISCUSS] CEP-36: A Configurable ChannelProxy to alias external storage locations

2023-10-31 Thread Claude Warren, Jr via dev
t to 20TB / node, easy scale up / > down, etc, and always-ZCS for non-object store deployments. > > Jon > > On 2023/09/25 06:48:06 "Claude Warren, Jr via dev" wrote: > > I have just filed CEP-36 [1] to allow for keyspace/table storage outside > of > > the sta

Re: Immediately Deprecated Code

2023-10-31 Thread Claude Warren, Jr via dev
you upgrade >> logic was triggered. >> >> (1) >> https://github.com/apache/cassandra/blob/trunk/test/distributed/org/apache/cassandra/distributed/upgrade/UpgradeTestBase.java#L97-L108 >> >> >> From: Claude Warren, Jr

Re: Immediately Deprecated Code

2023-11-01 Thread Claude Warren, Jr via dev
> For offline upgrades, we are aiming to maintain all compatibility. > > Take care when removing code, there are various (serdes) classes that look > like they are for other components but are also used in the storage engine. > > > > On Tue, 31 Oct 2023 at 18:42, Claude Warr

Re: [DISCUSS] CASSANDRA-19104: Standardize tablestats formatting and data units

2023-12-04 Thread Claude Warren, Jr via dev
Why not change the option so that -H will operate as it does now while -Hn (where n is a digit) will limit the number of decimal places to n. On Mon, Dec 4, 2023 at 5:11 PM Brad wrote: > Thanks, Jacek. Using three significant digits for disk space is a good > suggestion. > > On Mon, Dec 4, 2023

Re: Custom FSError and CommitLog Error Handling

2023-12-12 Thread Claude Warren, Jr via dev
I can see this as a strong improvement in Cassandra management and support it. +1 non binding On Mon, Dec 11, 2023 at 8:28 PM Raymond Huffman wrote: > Hello All, > > On our fork of Cassandra, we've implemented some custom behavior for > handling CommitLog and SSTable Corruption errors. Specific

[DISCUSS] Replace Sigar with OSHI (CASSANDRA-16565)

2023-12-14 Thread Claude Warren, Jr via dev
Greetings, I have submitted a pull request[1] that replaces the unsupported Sigar library with the maintained OSHI library. OSHI is an MIT licensed library that provides information about the underlying OS much like Sigar did. The change adds a dependency on oshi-core at the following coordinate

Re: [DISCUSS] Replace Sigar with OSHI (CASSANDRA-16565)

2023-12-17 Thread Claude Warren, Jr via dev
Can I get an another review/approval for the pull request? https://github.com/apache/cassandra/pull/2842/files On Fri, Dec 15, 2023 at 4:04 AM guo Maxwell wrote: > +1 too > > Mick Semb Wever 于2023年12月15日周五 10:01写道: > >> >> >> >>> >>> Are there objections to making this switch and adding a new d

Re: [DISCUSS] Replace Sigar with OSHI (CASSANDRA-16565)

2023-12-18 Thread Claude Warren, Jr via dev
The pull request is : https://github.com/apache/cassandra/pull/2842 On Mon, Dec 18, 2023 at 10:26 AM Mick Semb Wever wrote: > > > Can I get an another review/approval for the pull request? >> https://github.com/apache/cassandra/pull/2842/files >> > > > It is not clear on the ticket what is being

Re: Call for Presentations closing soon: Community over Code EU 2024

2024-01-09 Thread Claude Warren, Jr via dev
Additionally, if you have a talk about some underlying technology that could be applicable across multiple projects submit it or a poster based on it. We are looking for good cross-project presentations. Claude Chair, Community over Code, EU 2024. On Mon, Jan 8, 2024 at 8:24 PM Paulo Motta wrot

[DISCUSS] Update cassandra-stress to use Apache Commons CLI (CASSANDRA-18661)

2024-03-08 Thread Claude Warren, Jr via dev
I have been working on CASSANDRA-18661 to see if it is possible to migrate to the Apache commons-cli as noted in the ticket. It is possible to do so, and after several pull requests to commons-cli, I have managed to migrate the settings of the stress tool. We will have to wait for commons-cli 1.7

Patently invalid Compression parameters in CompressedSequentialWriterTest

2024-03-15 Thread Claude Warren, Jr via dev
I have been working at cleaning up the Yaml configuration for default table compression settings and found that the CompressedSequentialWriterTest uses some parameters that are outside the acceptable limits (like bufferLength not a power of 2, or maxCompressedLength > bufferLength). I can understa

Default table compression defined in yaml.

2024-03-18 Thread Claude Warren, Jr via dev
After much work by several people, I have pulled together the changes to define the default compression in the cassandra.yaml file and have created a pull request [1]. If you are interested this in topic, please take a look at the changes and give at least a cursory review. [1] https://github.co

Re: Default table compression defined in yaml.

2024-03-19 Thread Claude Warren, Jr via dev
precated format only to be > removed at a later version? > > > On 18/03/2024 14:39, Claude Warren, Jr via dev wrote: > > After much work by several people, I have pulled together the changes to > define the default compression in the cassandra.yaml file and have created > a pull

Re: Default table compression defined in yaml.

2024-03-19 Thread Claude Warren, Jr via dev
t;> deprecated, and the new format is `foobar: 123KiB`. Is there a need to >> introduce new settings entries with the deprecated format only to be >> removed at a later version? >> >> >> On 18/03/2024 14:39, Claude Warren, Jr via dev wrote: >> >> After m

Re: Default table compression defined in yaml.

2024-03-21 Thread Claude Warren, Jr via dev
123` format in the cassandra.yaml file is >>> deprecated, and the new format is `foobar: 123KiB`. Is there a need to >>> introduce new settings entries with the deprecated format only to be >>> removed at a later version? >>> >>> >>> On 18/03/202

Re: Default table compression defined in yaml.

2024-03-21 Thread Claude Warren, Jr via dev
t;"sstable" key sounds good to me. > > - - -- --- - - > Jacek Lewandowski > > > czw., 21 mar 2024 o 08:32 Claude Warren, Jr via dev < > dev@cassandra.apache.org> napisał(a): > >> Jacek, >> >> I am a bit confused here.

Re: [DISCUSS] CEP-40: Data Transfer Using Cassandra Sidecar for Live Migrating Instances

2024-04-18 Thread Claude Warren, Jr via dev
I think this solution would solve one of the problems that Aiven has with node replacement currently. Though TCM will probably help as well. On Mon, Apr 15, 2024 at 11:47 PM German Eichberger via dev < dev@cassandra.apache.org> wrote: > Thanks for the proposal. I second Jordan that we need more

Re: discuss: add to_human_size function

2024-04-19 Thread Claude Warren, Jr via dev
I like the idea. Is the intention to have the of the function be parsable by the config parsers like DataRateSpec, DataStorageSpec, or DurationSpec? Claude On Thu, Apr 18, 2024 at 9:47 PM Ariel Weisberg wrote: > Hi, > > I think it’s a good quality of life improvement, but I am someone who > b

Re: discuss: add to_human_size function

2024-04-25 Thread Claude Warren, Jr via dev
27;) > > I think this is more flexible and we should funnel this via > DataStorageSpec and similar as you mentioned. > > In the future, we might also add to_human_duration which would be > implemented against DurationSpec so similar conversions are possible. > > On Fri, Apr 19

Re: [DISCUSS] CEP-40: Data Transfer Using Cassandra Sidecar for Live Migrating Instances

2024-05-01 Thread Claude Warren, Jr via dev
Alex, you write: > We can implement CEP-40 using a similar approach: we can leave the source > node as both a read and write target, and allow the new node to be a target > for (pending) writes. Unfortunately, this does not help with availability > (in fact, it decreases write availability, sinc

Re: [DISCUSS] CEP-42: Constraints Framework

2024-06-12 Thread Claude Warren, Jr via dev
> > 2) > Is part of an enum is somehow suplying the lack of enum types. Constraint > could be something like CONSTRAINT belongsToEnum([list of valid values], > field): > CREATE TABLE keyspace.table ( > field text CONSTRAINT belongsToEnum(['foo', 'foo2'], field), > ... > ); > 3) > Similarly, we

Re: [DISCUSS] Stream Pipelines on hot paths

2024-06-13 Thread Claude Warren, Jr via dev
I brought this topic to commons-collections because we use some streaming in the Bloom filter implementation where we are very sensitive to processing time. I received this answer over there and thought I would bring the information here: You need to test it with some realistic data for a benchma

Re: [VOTE] Release Apache Cassandra 5.0-rc1

2024-07-01 Thread Claude Warren, Jr via dev
Perhaps we should consider a Milestone release. At least in some projects this is a way to provide a test bed with known issues that will be corrected before an RC. On Sun, Jun 30, 2024 at 9:50 PM Jon Haddad wrote: > This came in after our vote, but we might also have a problem with > performin

Re: [VOTE] Release Apache Cassandra 5.0-rc1

2024-07-03 Thread Claude Warren, Jr via dev
issues that will be > corrected before an RC. > > How does that differ from beta in our lifecycle? API stable but a test bed > to suss out issues like this. > > > On Mon, Jul 1, 2024, at 9:30 AM, Claude Warren, Jr via dev wrote: > > Perhaps we should consider a Mil

Re: [DISCUSS] Replace airlift/airline library with Picocli

2024-07-16 Thread Claude Warren, Jr via dev
There are several reasons to consider updating, foremost in my mind is the changes coming as part of CRA in Europe. IANAL, but I don't think that non-maintained code will meet the CRA requirements, nor will code maintained by a single individual. Our best approach may be to try to get picocli me

Re: [DISCUSS] Replace airlift/airline library with Picocli

2024-07-17 Thread Claude Warren, Jr via dev
My CRA arguments basically revolve around the "Open Source Steward" from the CRA. As far as I recall, for open source software to be used in commercial projects it must be maintained by a steward. The definition of steward is being discussed but foundations generally meet the requirement, project

review request for pull 1741

2022-08-02 Thread Claude Warren, Jr via dev
Greetings, Can I get a review of https://github.com/apache/cassandra/pull/1741 other than the obvious issue with CHANGES.txt does anyone see anyting that needs to be fixed?

CASSANDRA-14940 and flaky tests

2022-08-04 Thread Claude Warren, Jr via dev
I started looking at the backlog of critical errors in Jira. It contains a fully working example of the issue. While it was reported under version 3.11.3 it appears to be present under 4.0.5. I don't know the "go" language but my reading of the script is that, in a single cassandra configuratio

Re: key apt by apache cassandra seems deprecated and cqlsh is broken

2022-08-09 Thread Claude Warren, Jr via dev
Could this be related to the deprecation of apt-key on your system? You don't specify what version of which distribution you are using. However, there is a good example of how to solve the issue at https://www.linuxuprising.com/2021/01/apt-key-is-deprecated-how-to-add.html On Tue, Aug 9, 2022

[DISCUSS] Remove Dead Pull Requests

2022-08-10 Thread Claude Warren, Jr via dev
At the moment we have 222 open pull requests. Some dating back 4 years. For some the repository from which they were pulled from has been deleted. For many there are branch conflicts. Now, I am new here so please excuse any misstatements and attribute to ignorance not malice any offence. I would

Re: Cassandra project status update 2022-08-03

2022-08-10 Thread Claude Warren, Jr via dev
Perhaps flaky tests need to be handled differently. Is there a way to build a statistical model of the current flakiness of the test that we can then use during testing to accept the failures? So if an acceptable level of flakiness is developed then if the test fails, it needs to be run again or

Re: [DISCUSS] Remove Dead Pull Requests

2022-08-11 Thread Claude Warren, Jr via dev
e quite a lot of tickets with patches attached that the project > has either not been able to act on at the time; or which the original > contributor started but was unable to complete. We’ve picked up many of > these after a couple years and carried them to completion. Byte-comparable > ty

Re: [DISCUSS] Remove Dead Pull Requests

2022-08-11 Thread Claude Warren, Jr via dev
écrit : > >> Those all seem like good suggestions to me >> >> On 11 Aug 2022, at 08:44, Claude Warren, Jr via dev < >> dev@cassandra.apache.org> wrote: >> >>  >> My original goal was to reduce the number of pull requests in the backlog >> as

[Proposal] add pull request template

2022-08-15 Thread Claude Warren, Jr via dev
Github provides the ability to add a pull request template [1]. I think that such a template could assist in making the pull requests better. Something like the text below, along with verifying that CASSANDRA-### will link to Jira [2], should provide the information needed and remind submitters of

Re: [Proposal] add pull request template

2022-08-15 Thread Claude Warren, Jr via dev
to merge it will take extra > time and care when it comes to these formalities anyway. The reason > why a PR should be merged should be the priority. > > On Mon, 15 Aug 2022 at 10:41, Claude Warren, Jr via dev > wrote: > > > > Github provides the ability to add a

Re: [Proposal] add pull request template

2022-08-16 Thread Claude Warren, Jr via dev
I am all for simplification. How about - start of text Issue resolved: CASSANDRA- - [ ] Jira ticket contains a description of: what is fixed, why it is needed, and what branches to apply it to. - [ ] Commits have been squashed to remove intermediate development commit message

Re: [Proposal] add pull request template

2022-08-18 Thread Claude Warren, Jr via dev
Since there seems to be agreement, I opened a ticket (CASSANDRA-17837) and a pull request (https://github.com/apache/cassandra/pull/1799) in so that the final text can be hashed out and accepted. I also used the proposed pull request in the text of the pull so that it can be seen in all its glory

Is this an MV bug?

2022-08-19 Thread Claude Warren, Jr via dev
# Table definitions Table [ Primary key ] other data base [ A B C ] D E MV[ D C ] A B E # Initial data base -> MV [ a b c ] d e -> [d c] a b e [ a' b c ] d e -> [d c] a' b e ## Mutations -> expected outcome M1: base [ a b c ] d e' -> MV [ d c ] a b e' M2: base [ a b c ] d' e

Re: Is this an MV bug?

2022-08-19 Thread Claude Warren, Jr via dev
SP.mutate > > > On 19 Aug 2022, at 10:05, Claude Warren, Jr via dev < > dev@cassandra.apache.org> wrote: > > > >  > > > > # Table definitions > > > > Table [ Primary key ] other data > > base [ A B C ] D E > > MV[ D C ] A B E >

Re: Is this an MV bug?

2022-08-19 Thread Claude Warren, Jr via dev
Perhaps my diagram was not clear. I am starting with mutations on the base table. I assume they are not bundled together so from separate CQL statements. On Fri, Aug 19, 2022 at 11:11 AM Claude Warren, Jr wrote: > If each mutation comes from a separate CQL they would be separate, no? >

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-08-21 Thread Claude Warren, Jr via dev
I am more interested in the motivation where it is stated: Many users have the need of masking sensitive data, such as contact info, > age, gender, credit card numbers, etc. Dynamic data masking (DDM) allows to > obscure sensitive information while still allowing access to the masked > columns, an

Re: [DISCUSS] CEP-20: Dynamic Data Masking

2022-08-24 Thread Claude Warren, Jr via dev
This seems to me to be a client display filter, applied at the last moment as data are streaming back to the client. It has no impact on any keys, queries or secondary internal index or materialized view. It simply prevents the display from showing the complete value. It does not preclude determ

  1   2   >