On Jul 16, 2023, at 18:17, Kevin Oberman <rkober...@gmail.com> wrote:
> On Sun, Jul 16, 2023 at 1:57 PM Mark Millard <mark...@yahoo.com> wrote: >> I used a sequence that looked like: >> >> mount -onoatime -tmsdosfs /dev/gpt/CA72optM2efi /CA72optM2efi-media/ \ >> && rsync -x --delete -aAUHhh --info=progress2 /boot/efi/ /CA72optM2efi-media/ >> >> that got: >> >> file has vanished: "/CA72optM2efi-media/BCM271~5.DTB" >> file has vanished: "/CA72optM2efi-media/BCM271~6.DTB" >> 73.77K 0% 1.63MB/s 0:00:00 (xfr#7, to-chk=0/493) rsync >> warning: some files vanished before they could be transferred (code 24) at >> main.c(1357) [sender=3.2.7] >> >> After that, activity reported the likes of: >> >> rsync: [generator] delete_file: unlink(overlays/VC4-KM~8.DTB) failed: >> Read-only file system (30) >> and: >> rsync: [receiver] mkstemp "/CA72optM2efi-media/.fixup4.dat.2Wonu9" failed: >> Read-only file system (30) >> >> More than rsync was odd at that point: >> >> # ls -Tld /CA72optM2efi-media/*.DTB >> ls: /CA72optM2efi-media/BCM271~5.DTB: No such file or directory >> ls: /CA72optM2efi-media/BCM271~6.DTB: No such file or directory >> >> # rm /CA72optM2efi-media/*/*.DTB >> override rwxr-xr-x root/wheel uarch for >> /CA72optM2efi-media/overlays/SDHOST~1.DTB? y >> rm: /CA72optM2efi-media/overlays/SDHOST~1.DTB: Read-only file system >> . . . >> >> But: >> >> # mount | grep media >> /dev/gpt/CA72optM2efi on /CA72optM2efi-media (msdosfs, local, noatime) >> >> So the mount itself was not the source of the read-only status so far. >> >> I then tried: >> >> # umount /CA72optM2efi-media >> # newfs_msdos /dev/da0p1 >> # mount -onoatime -tmsdosfs /dev/gpt/CA72optM2efi /mnt >> # cp -aRx /boot/efi/ /mnt/ >> cp: utimensat: /mnt: Invalid argument >> >> (which is normal). >> >> # umount /mnt >> >> No more oddities , so far after that. >> >> >> For reference: >> >> # uname -apKU >> FreeBSD CA72-16Gp-ZFS 14.0-CURRENT FreeBSD 14.0-CURRENT #99 >> main-n264171-2a0c0aea4209-dirty: Fri Jul 14 21:00:44 PDT 2023 >> root@CA72-16Gp-ZFS:/usr/obj/BUILDs/main-CA72-nodbg-clang/usr/main-src/arm64.aarch64/sys/GENERIC-NODBG-CA72 >> arm64 aarch64 1400093 1400093 >> >> # pkg info rsync >> rsync-3.2.7 >> Name : rsync >> Version : 3.2.7 >> Installed on : Sat Jul 15 14:53:48 2023 PDT >> Origin : net/rsync >> Architecture : FreeBSD:14:aarch64 >> . . . >> Annotations : >> FreeBSD_version: 1400092 >> build_timestamp: 2023-07-02T06:57:44+0000 >> built_by : poudriere-git-3.3.99.20220831 >> cpe : cpe:2.3:a:samba:rsync:3.2.7:::::freebsd14:aarch64 >> port_checkout_unclean: no >> port_git_hash : f45cd5bd9d4b >> ports_top_checkout_unclean: yes >> ports_top_git_hash: 880f72e54deb > > > === > Mark Millard > marklmi at yahoo.co > > This looks a bit like an issue I was hitting on a 4 CPU, 4 thread Alder Lake > processor and 500GB SSD running 13.2-RELEASE. > > I saw several very strange corruptions, at least one "rsync warning: some > files vanished before they could be transferred". In one (the last) case, the > system crashed. The 'disc' was corrupted badly enough that fsck failed and I > could not boot up the system. The disk was UFS2 GPT format and EFI boot. The > interface is SATA, not nVME. In this case, I was installing on a new system > and copying the majority of the file system from my old system. > > The 'fix' is strange and probably not one many other can use. I installed a > spinning rust drive of 500GB and installed FreeBSD and used rsync again and > it worked. I can't say whether it was a fluke that it worked, but it really > smells like some sort of race condition. Could be rsync , VFS, or device > driver. Since then I have seen one crash while backing up the system disk > using rsync. No corruption and doing another rsync after reboot worked fine, > but it was a much smaller run as the first attempt was nearly complete when > the system crashed. Maybe unrelated. I do have the core file from the crash. > Stil, something weird has been going on. Same issue on two identical systems, > so not likely hardware. Adding background from my example, helping identify the variety of contexts that sometimes the message: The target drive was a 894 GiByte USB3 drive, of all things a Optane U2 used via a USB3 adapter. GPT. Per the mount that I showed: msdosfs partition, the one that has FreeBSD's UEFI loader copy that would be used in booting. (In my context, it also gets RPi* related boot materials to allow booting RPi4B's, a RPi3B, and a RPi2B v1.2. There are directories used as holding areas for alternate RPi* related materials (versions), the holding area for the ready-to-use materials being empty at the time: the mterials are out where they would be used.) The booted world/kernel media was a PCIe Optane. Its msdosfs also has the RPi* materials, despite the PCIe Optane not being bootable on an RPi* configuration that I can form. These materials where what rsync was copying from. So the rsync was between 2 msdosfs, not across file system types. aarch64 with 16 Cortex-A72 's and 64 GiBytes of RAM. main (so: 14). So a fair number of things are not in common with your examples, which might at least help limit the range of potential answers to: "what all is common to all the failures?". === Mark Millard marklmi at yahoo.com