On Thu, May 28, 2015 at 10:41 PM, Andrey Korolyov <and...@xdel.ru> wrote:
> Hi,
>
> I am currently playing with SYNPROXY target to optimize SYN filtering
> performance and by occasion found that TCP SYN packets containing port
> 0 can result in a soft lockup when conntrack is enabled just by
> itself, given high packet ratio (I`ve reached 450kpps so far with 60b
> packets on a /32<->/32 flood with enabled flow control at the media
> level and middle-level E3 Xeon on receiver side). Same flood with port
>> 0 going just well, producing same ceil numbers but without visible
> lockups in kernel log. I`ve tested the issue on a broad range of 3.x
> kernels and all of them are seemingly affected. Fast and dirty grep
> revealed special conditions for port 0 only for protocol-specific
> helpers, but there are none of them.
>
> Please find both same captures and traceback below.

Attached trace without GSO, at its presence can be somehow confusing
in a previous sample. The testbed using net.nf_conntrack_max =
2000000, forgot to mention that previously.
[52671.706307] BUG: soft lockup - CPU#0 stuck for 24s! [rcuos/0:18]
[52671.706331] Modules linked in: ixgbe mdio xt_CT iptable_raw ipt_SYNPROXY 
nf_synproxy_core nf_conntrack_ipv4 nf_defrag_ipv4 xt_tcpudp xt_conntrack 
nf_conntrack iptable_filter ip_tables x_tables tun openvswitch nfsd auth_rpcgss 
oid_registry nfs_acl nfs lockd dns_resolver fscache sunrpc bridge stp llc 
w83627ehf hwmon_vid loop fuse dm_crypt dm_mod coretemp kvm_intel snd_pcm kvm 
snd_page_alloc snd_timer crc32c_intel ghash_clmulni_intel aesni_intel 
aes_x86_64 ablk_helper snd cryptd lrw iTCO_wdt soundcore iTCO_vendor_support 
gf128mul glue_helper joydev evdev pcspkr video processor button i2c_i801 
lpc_ich mfd_core shpchp ext4 crc16 jbd2 mbcache microcode sg sd_mod crc_t10dif 
hid_generic usbhid hid ahci libahci libata mpt2sas igb raid_class i2c_algo_bit 
dca scsi_transport_sas ptp pps_core i2c_core ehci_pci
[52671.706364]  scsi_mod ehci_hcd xhci_hcd usbcore usb_common thermal fan 
thermal_sys
[52671.706369] CPU: 0 PID: 18 Comm: rcuos/0 Not tainted 3.10-0.bpo.3-amd64 #1 
Debian 3.10.11-1~bpo70+19
[52671.706370] Hardware name: Supermicro X10SL7-F/X10SL7-F, BIOS 2.00 04/24/2014
[52671.706372] task: ffff88040d528790 ti: ffff88040d532000 task.ti: 
ffff88040d532000
[52671.706373] RIP: 0010:[<ffffffff812bf927>]  [<ffffffff812bf927>] 
sock_wfree+0x42/0x5b
[52671.706377] RSP: 0018:ffff88041fc03b18  EFLAGS: 00000202
[52671.706378] RAX: ffffffff812bf801 RBX: ffff88041fc10110 RCX: 0000000000000024
[52671.706379] RDX: ffff8802e0751610 RSI: 00000000ffffffff RDI: ffff88041fc0fe40
[52671.706380] RBP: 0000000000000300 R08: 0000000000000000 R09: 0000000000000000
[52671.706381] R10: ffff88041fc0ff44 R11: 0000000000000001 R12: ffff88041fc03a88
[52671.706382] R13: ffffffff8139705d R14: 0000000000000300 R15: ffff88041fc0fe40
[52671.706383] FS:  0000000000000000(0000) GS:ffff88041fc00000(0000) 
knlGS:0000000000000000
[52671.706384] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[52671.706385] CR2: 00007fb866cc2140 CR3: 000000000160c000 CR4: 00000000001407f0
[52671.706386] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[52671.706387] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[52671.706388] Stack:
[52671.706389]  ffff88040ac7b000 ffff88040ac7aa00 ffff88040ac7b000 
ffffffff812f8747
[52671.706391]  ffff88041fc0fe40 ffffffff812fb0d8 ffff88041fc03b80 
ffff88041fc03b78
[52671.706392]  ffffffff00000040 0000000000000286 ffffffff817dda00 
ffffffff0214140a
[52671.706394] Call Trace:
[52671.706395]  <IRQ> 

[52671.706399]  [<ffffffff812f8747>] ? skb_orphan+0x12/0x27
[52671.706402]  [<ffffffff812fb0d8>] ? ip_send_unicast_reply+0x243/0x297
[52671.706406]  [<ffffffff810477bf>] ? mod_timer+0x7b/0x89
[52671.706409]  [<ffffffff813120c2>] ? tcp_v4_send_reset+0x2db/0x324
[52671.706411]  [<ffffffff81312e56>] ? tcp_v4_rcv+0x387/0x559
[52671.706413]  [<ffffffff812f6017>] ? 
__xfrm_policy_check2.constprop.9+0x50/0x50
[52671.706415]  [<ffffffff812f6117>] ? ip_local_deliver_finish+0x100/0x176
[52671.706418]  [<ffffffff812cd640>] ? __netif_receive_skb_core+0x447/0x4bf
[52671.706420]  [<ffffffff812cd893>] ? netif_receive_skb+0x4c/0x7d
[52671.706422]  [<ffffffff812ce013>] ? napi_gro_receive+0x35/0x76
[52671.706427]  [<ffffffffa04ab24c>] ? ixgbe_poll+0xbc9/0xe0a [ixgbe]
[52671.706429]  [<ffffffff812cddaa>] ? net_rx_action+0xa7/0x1e1
[52671.706431]  [<ffffffff8106442c>] ? account_system_time+0x113/0x12c
[52671.706433]  [<ffffffff81041683>] ? __do_softirq+0xf1/0x216
[52671.706436]  [<ffffffff8139781c>] ? call_softirq+0x1c/0x30
[52671.706436]  <EOI> 

[52671.706439]  [<ffffffff8100eade>] ? do_softirq+0x3a/0x78
[52671.706440]  [<ffffffff8104142e>] ? _local_bh_enable_ip.isra.11+0x6a/0x88
[52671.706443]  [<ffffffff810a2413>] ? rcu_nocb_kthread+0x25e/0x298
[52671.706445]  [<ffffffff810573e3>] ? abort_exclusive_wait+0x79/0x79
[52671.706447]  [<ffffffff810a21b5>] ? force_qs_rnp+0x120/0x120
[52671.706448]  [<ffffffff810a21b5>] ? force_qs_rnp+0x120/0x120
[52671.706450]  [<ffffffff81056a54>] ? kthread+0x81/0x89
[52671.706452]  [<ffffffff810126f9>] ? paravirt_read_tsc+0x5/0x8
[52671.706454]  [<ffffffff810569d3>] ? __kthread_parkme+0x5d/0x5d
[52671.706457]  [<ffffffff813963bc>] ? ret_from_fork+0x7c/0xb0
[52671.706458]  [<ffffffff810569d3>] ? __kthread_parkme+0x5d/0x5d
[52671.706459] Code: e6 ff ff 84 c0 75 1d 8d 7d ff bd 01 00 00 00 48 8d b3 04 
01 00 00 e8 7d e3 ff ff 48 89 df ff 93 70 02 00 00 f0 29 ab 04 01 00 00 <40> 0f 
94 c5 40 84 ed 74 0b 48 89 df 5b 5b 5d e9 94 fe ff ff 41 

Reply via email to