[
https://issues.apache.org/jira/browse/CASSANDRA-20829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18014093#comment-18014093
]
Stefan Miklosovic commented on CASSANDRA-20829:
-----------------------------------------------
[CASSANDRA-20829-4.1|https://github.com/instaclustr/cassandra/tree/CASSANDRA-20829-4.1]
{noformat}
java11_pre-commit_tests
✓ j11_build 1m 58s
✓ j11_cqlsh_dtests_py311 5m 44s
✓ j11_cqlsh_dtests_py311_vnode 6m 42s
✓ j11_cqlshlib_cython_tests 11m 8s
✓ j11_cqlshlib_tests 7m 25s
✓ j11_dtests_vnode 41m 39s
✓ j11_jvm_dtests 18m 38s
✓ j11_jvm_dtests_vnode 12m 10s
✓ j11_unit_tests 10m 31s
✓ j11_unit_tests_repeat 32m 8s
✕ j11_cqlsh_dtests_py3 5m 42s
cql_tracing_test.TestCqlTracing test_tracing_simple
cql_tracing_test.TestCqlTracing test_tracing_unknown_impl
✕ j11_cqlsh_dtests_py38 5m 51s
cql_tracing_test.TestCqlTracing test_tracing_default_impl
✕ j11_cqlsh_dtests_py38_vnode 6m 1s
cql_tracing_test.TestCqlTracing test_tracing_unknown_impl
✕ j11_cqlsh_dtests_py3_vnode 6m 1s
cql_tracing_test.TestCqlTracing test_tracing_default_impl
✕ j11_dtests 55m 31s
refresh_test.TestRefresh test_refresh_deadlock_startup
{noformat}
I am not sure what's up with TestCqlTracing, that is hardly related to what we
do here. It does not fail in Jenkins.
[java11_pre-commit_tests|https://app.circleci.com/pipelines/github/instaclustr/cassandra/5946/workflows/599c24f0-d47b-49d8-b76d-c9bdfb00fda1]
> Secondary index implementations do not integrate with IndexGCTransaction when
> compaction contains fully expired SSTables
> ------------------------------------------------------------------------------------------------------------------------
>
> Key: CASSANDRA-20829
> URL: https://issues.apache.org/jira/browse/CASSANDRA-20829
> Project: Apache Cassandra
> Issue Type: Bug
> Components: Feature/2i Index, Local/Compaction, Local/Compaction/TWCS
> Reporter: Stefan Miklosovic
> Assignee: Stefan Miklosovic
> Priority: Normal
> Fix For: 4.0.x, 4.1.x
>
> Time Spent: 4.5h
> Remaining Estimate: 0h
>
> There is a test (1) which ensures that when data are TTLed and compacted,
> IndexGCTransaction is aware of that and it will invoke Indexer.removeRow()
> method eventually.
> However, this is not working properly when we have fully expired SSTables,
> e.g. as the result of a table being on TWCS and having TTL on that.
> The reason is that in CompactionTask, we are filtering out fully expired ones
> (2). These then do not go to the compaction process and then they are not
> reacted on in listener() (3) which contains this logic (4). Eventually,
> onRowMerge in IndexGCTransaction will make the diff and in its commit
> indexer.removeRow(row); will notify 2i about its removal.
>
> This integration is missing and it is quite a big problem because if there
> are custom secondary index implementations the fact that SSTables were fully
> expired is not propagated to them which means that data are never removed
> from whatever backend they use.
> The solution is to go to the compaction with fully expired SSTables as well
> _but only if we detected that respective column family has some indexes_
>
> (1)
> [https://github.com/apache/cassandra/blob/cassandra-4.1/test/unit/org/apache/cassandra/index/CustomIndexTest.java#L583-L607]
> (2)
> [https://github.com/apache/cassandra/blob/cassandra-4.1/src/java/org/apache/cassandra/db/compaction/CompactionTask.java#L174]
> (3)
> [https://github.com/apache/cassandra/blob/cassandra-4.1/src/java/org/apache/cassandra/db/compaction/CompactionIterator.java#L130]
> (4)
> [https://github.com/apache/cassandra/blob/cassandra-4.1/src/java/org/apache/cassandra/db/compaction/CompactionIterator.java#L235-L252]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]