Re: A 2025 NewYear present: make dpkg --force-unsafe-io the default?

Simon Richter Thu, 26 Dec 2024 04:24:06 -0800

Hi,

On 12/26/24 18:33, Julien Plissonneau Duquène wrote:

This should not make any difference in the number of write operationsnecessary, and only affect ordering. The data, metadata journal andmetadata update still have to be written.

I would expect that some reordering makes it possible for fewer actualphysical write operations to happen, i.e. writes to same/neighbouringblocks get merged/grouped (eventually by the hardware if not the kernel)which would make a difference on both spinning devices performance (lessseeks) and solid state devices longevity (as these have larger physicalblocks), but I don't know if that's actually how it works in that case.

On SSDs, it does not matter, both because modern media lasts longer thanthe rest of the computer now, and because the load balancer will largelyignore the logical block addresses when deciding where to put data intothe physical medium anyway.

On harddisks, it absolutely makes a noticeable difference, but so doesjournaling.

It would be surprising though that the dpkg man pages (among otherplaces) talks about performance degradations if these were not real.

ext4's delayed allocations mainly mean that the window where the inodeis zero sized is larger (can be a few seconds after dpkg exits with--force-unsafe-io), so the problem is more observable, while on otherfile systems, you more often get lucky and your files are filled withthe desired data instead of garbage.

The delayed allocations, on the other hand, allow the file system tomerge the entire allocation for the file, instead of gradually extendingit (but that can be easily fixed by using fallocate(2) ).


[filesystem level transactions]

That sounds interesting. But — do we have filesystems on Linux that cando that already, or is this still a wishlist item? Also worth noting, atleast one well-known implementation in another OS was deprecated [1]citing complexity and lack of popularity as the reasons for thatdecision, and the feature is missing in their next-gen FS. So maybe it'snot that great after all?

It is complex to the extent that it requires the entire file system tobe designed around it, including the file system API -- suddenly you getthings like isolation levels and transaction conflicts that programsneed to be at least vaguely aware of.

It would be easier to do in Linux than in Windows, certainly, because onWindows, file contents bypass the file system drivers entirely, andthere are additional APIs like transfer offload that would interactbadly with a transactional interface, and that would be sorely missed bypeople using a SAN as storage backend.

Anyway in the current toolbox besides --force-unsafe-io we also have:
- volume or FS snapshots, for similar or better safety but not theautomatic performance gains; probably not (yet?) available on most systems


Snapshots only work if there is a way to merge them back afterwards.

What the systemd people are doing with immutable images basically goesin the direction of snapshots -- you'd unpack the files using "unsafe"I/O, then finally create an image, fsync() that, and then update the OSmetadata which image to load at boot.

- the auto_da_alloc ext4 mount option that AIUI should do The RightThing in dpkg's use case even without the fsync, actual reliability andperformance impact unknown; appears to be set by default on trixie


Yes, that inserts the missing fsync(). :>

I'd expect it to perform a little bit better than the explicit fsync()though, because that does not impose an order of operation betweenfiles. The downside is that it also does not force an order between thefile system updates and the rewrite of the dpkg status file.

What I could see working in dpkg would be delaying the fsync() calluntil right before the rename(), which is in a separate "cleanup" roundof operations anyway for the cases that matter. The difficulty there isthat we'd have to keep the file descriptor open until then, which wouldneed careful management or a horrible hack so we don't run into the useror system-wide limit for open file descriptors, and recover if we do.

- eatmydata


That just neuters fsync().

- io_uring that allows asynchronous file operations; implementationwould require important changes in dpkg; potential performance gains indpkg's use case are not yet evaluated AFAIK but it looks like the rightsolution for that use case.


That would be Linux specific, though.

Nowadays, most machines are unlikely to be subject to power failures atthe worst time:

Yes, but we have more people running nVidia's kernel drivers now, so itall evens out.

The decision when it is safe to skip fsync() is mostly dependent onfactors that are not visible to the dpkg process, like "will the resultof this operation be packed together into an image afterwards?", so Idoubt there is a good heuristic.

My feeling is that this is becoming less and less relevant though,because it does not matter with SSDs.


   Simon

Re: A 2025 NewYear present: make dpkg --force-unsafe-io the default?

Reply via email to