*** This bug is a duplicate of bug 1926808 *** https://bugs.launchpad.net/bugs/1926808
** This bug has been marked a duplicate of bug 1926808 Bionic update: upstream stable patchset 2021-04-30 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-signed-hwe in Ubuntu. https://bugs.launchpad.net/bugs/1922466 Title: Writeback not flushing to disk in 4.15.0-137-generic and above Status in linux-signed-hwe package in Ubuntu: Confirmed Bug description: Hi! We've come across some interesting behaviour in kernel 4.15.0-137.141~16.04.1 and above. After booting a fresh Ubuntu 16.04 instance on AWS, we replace the AWS kernel with "linux-image-4.15.0-140-generic" (4.15.0-140.144~16.04.1) and reboot. Then we generate some I/O by running fio for a while: fio --name=random-write --ioengine=posixaio --rw=randwrite --bs=64k --size=256m --numjobs=16 --iodepth=16 --runtime=3600 --time_based --end_fsync=1 It does't matter whether fio is run against the boot disk or an attached secondary disk. After stopping fio we notice that some pages are stuck in "writeback" and are apparently not flushing to disk: # lsb_release -rd Description: Ubuntu 16.04.7 LTS Release: 16.04 # cat /proc/vmstat | grep "nr_writeback " nr_writeback 80 # cat /proc/meminfo | grep Writeback: Writeback: 320 kB This doesn't clear, not even days later. Running more fio only increases the amount of writeback pages. Downgrading the kernel to 4.15.0-136.140~16.04.1 resolves the issue, no writeback pages getting stuck. Going over the kernel changelog, I can see that between -136 and -137 the following patchset was applied, but I'm not sure whether it is related to the issue: https://www.spinics.net/lists/stable/msg435893.html Kernels 4.15.0-137-generic and above took down our Ceph cluster, because it seems that when the amount of "writeback" reaches the buffer ceiling of "dirty_bytes", all subsequent writes to the disk are incredibly slow. This is from an idle production system (not on AWS) running 16.04 with kernel 4.15.0-139-generic: # lsb_release -rd Description: Ubuntu 16.04.4 LTS Release: 16.04 # cat /proc/sys/vm/dirty_bytes 629145600 # cat /proc/sys/vm/dirty_background_bytes 314572800 # cat /proc/meminfo | grep Writeback: Writeback: 572108 kB # dd if=/dev/zero of=/test bs=1M count=10; rm /test 10+0 records in 10+0 records out 10485760 bytes (10 MB, 10 MiB) copied, 126.529 s, 82.9 kB/s Could there be a bug in kernel 4.15.0-137-generic and above? Thank you! Kind regards, Christoph Dwertmann ProblemType: Bug DistroRelease: Ubuntu 16.04 Package: linux-image-4.15.0-140-generic 4.15.0-140.144~16.04.1 ProcVersionSignature: User Name 4.15.0-140.144~16.04.1-generic 4.15.18 Uname: Linux 4.15.0-140-generic x86_64 ApportVersion: 2.20.1-0ubuntu2.30 Architecture: amd64 Date: Sun Apr 4 03:39:25 2021 Ec2AMI: ami-041e1cc8f4c429789 Ec2AMIManifest: (unknown) Ec2AvailabilityZone: ap-southeast-2c Ec2InstanceType: c5ad.xlarge Ec2Kernel: unavailable Ec2Ramdisk: unavailable ProcEnviron: TERM=xterm-256color PATH=(custom, no user) XDG_RUNTIME_DIR=<set> LANG=en_US.UTF-8 SHELL=/bin/bash SourcePackage: linux-signed-hwe UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux-signed-hwe/+bug/1922466/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp