Hi,

On Fri, Sep 06, 2024 at 10:47:04PM +0200, XXX XXX wrote:
> On Fri, 6 Sep 2024 22:04:27 +0200
> Salvatore Bonaccorso <car...@debian.org> wrote:
> 
> > Control: tags -1 + moreinfo
> > 
> > Hi,
> > 
> > On Mon, Sep 02, 2024 at 11:09:42PM +0200, XXX XXX wrote:
> > > Hi,
> > > this bug seems to be fixed in linux kernel 6.1.107,
> > > I suspect the commit that fixed it is:
> > > 
> > > commit 6dcc8ba8a6074bb79040f502dc66ad23a58a1c86
> > > Author: Florian Westphal <f...@strlen.de>
> > > Date:   Wed Aug 7 21:28:41 2024 +0200
> > > 
> > >     netfilter: nf_queue: drop packets with cloned unconfirmed conntracks
> > >     
> > >     [ Upstream commit 7d8dc1c7be8d3509e8f5164dd5df64c8e34d7eeb ]
> > >     
> > >     Conntrack assumes an unconfirmed entry (not yet committed to global 
> > > hash
> > >     table) has a refcount of 1 and is not visible to other cores.
> > >     
> > >     With multicast forwarding this assumption breaks down because such
> > >     skbs get cloned after being picked up, i.e.  ct->use refcount is > 1.
> > >     
> > >     Likewise, bridge netfilter will clone broad/mutlicast frames and
> > >     all frames in case they need to be flood-forwarded during learning
> > >     phase.
> > >     
> > >     For ip multicast forwarding or plain bridge flood-forward this will
> > >     "work" because packets don't leave softirq and are implicitly
> > >     serialized.
> > >     
> > >     With nfqueue this no longer holds true, the packets get queued
> > >     and can be reinjected in arbitrary ways.
> > >     
> > >     Disable this feature, I see no other solution.
> > >     
> > >     After this patch, nfqueue cannot queue packets except the last
> > >     multicast/broadcast packet.
> > >     
> > >     Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
> > >     Signed-off-by: Florian Westphal <f...@strlen.de>
> > >     Signed-off-by: Pablo Neira Ayuso <pa...@netfilter.org>
> > >     Signed-off-by: Sasha Levin <sas...@kernel.org>
> > 
> > Would you be able to confirm this? In case this is true, then this
> > would imply that the issue should be visible as well current testing
> > until <= 6.10.7-1.
> > 
> > Regards,
> > Salvatore
> 
> Hi,
> 
> for sure it was visible in linux-image-6.10.6+bpo-amd64 that I tried from 
> stable backports after the trace popped up again after upgrading to 
> linux-image-6.1.0-25-amd64.
> So  by checking the changelog for the source file and line shown in the 
> traces on kernel.org 
> I've spotted this patch that was interesting because  I use suricata in 
> nfqueue
> mode and because the trace happened always at boot (during the learning 
> phase).
> So I first erroneously I tried 6.1.106 and the trace was still there
> and then 6.1.107 and it was gone.
> Hope this helps.

Yes thanks. One option to get a final confirmation and proper closure
tracking, would be if you can cherry-pick the commit on top of the
6.1.106-3 version and see if it resolved the issue.

You could proceed as described in

https://kernel-team.pages.debian.net/kernel-handbook/ch-common-tasks.html#id-1.6.6.4

Regards,
Salvatore

Reply via email to