On Tue, 02 Mar 2010 07:50 -0800, "J.C. Roberts" <[email protected]> wrote: > And I thought I was expected to be inconsistent. ;) > > Anyhow, I was upgrading from the Feb 2, to the most recent snapshot, and > fsck is coming up with a problem on one of my partitions. I can probably > get it working ("fix" is such a strong word) with `fsck -fy` but my real > concern is if the drive is failing? > > atactl tells me everything is just fine? > > I have a nearly identical system, with the same type of disk, which > reports similar atactl attributes... but then again, I don't really trust > SATA/PATA drives very much or their supposedly "smart" monitoring. > > The data on the system is not only backed up, but it's also easily > replaced since the machine is only used for src and ports builds. I think > I might lose a total of a few newly downloaded distfiles since the last > backup. > > What I really want to do here is understand *why* some portion of the > disk has become unreadable?
cd /bad_partition && dd if=/dev/zero of=big_file.zero bs=512 conv=sync,noerror Let it run until it finishes. That won't explain why the sectors are bad, but it may give a good indication of the problem area and answer the failing drive question. If dd reports IO issues, you may want to replace the drive. Brad > All of the below were done in single user mode over serial. > (sorry about the width) > > ------------------------------------------------------------------------ > # atactl wd0 smartenable > # atactl wd0 readattr > Attributes table revision: 16 > ID Attribute name Threshold Value Raw > 3 Spin Up Time 63 180 0x0000000046f2 > 4 Start/Stop Count 0 253 0x0000000000d2 > 5 Reallocated Sector Count 63 253 0x000000000007 > 6 Read Channel Margin 100 253 0x000000000000 > 7 Seek Error Rate 0 253 0x000000000000 > 8 Seek Time Performance 187 253 0x000000009edb > 9 Power-On Hours Count 0 235 0x00000000ee5c > 10 Spin Retry Count 157 253 0x000000000000 > 11 Calibration Retry Count 223 253 0x000000000000 > 12 Device Power Cycle Count 0 253 0x0000000000f0 > 192 Power-Off Retract Count 0 253 0x000000000000 > 193 Load Cycle Count 0 253 0x000000000000 > 194 Temperature 0 253 0x00000000000f > 195 Hardware ECC Recovered 0 253 0x00000000170d > 196 Reallocation Event Count 0 253 0x000000000000 > 197 Current Pending Sector Count 0 253 0x000000000001 > 198 Off-Line Scan Uncorrectable Sect 0 253 0x000000000000 > 199 Ultra DMA CRC Error Count 0 199 0x000000000000 > 200 Write Error Rate 0 253 0x000000000000 > 201 Soft Read Error Rate 0 253 0x000000000000 > 202 Data Address Mark Errors 0 253 0x000000000000 > 203 Run Out Cancel 180 253 0x000000000001 > 204 Soft ECC Correction 0 253 0x000000000000 > 205 Thermal Asperity Check 0 253 0x000000000000 > 207 Spin High Current 0 253 0x000000000000 > 208 Spin Buzz 0 253 0x000000000000 > 209 Offline Seek Performance 0 253 0x000000000000 > 99 Unknown 0 253 0x000000000000 > 100 Unknown 0 253 0x000000000000 > 101 Unknown 0 253 0x000000000000 > # > > ------------------------------------------------------------------------ > # atactl wd0 smartstatus > No SMART threshold exceeded > # > > ------------------------------------------------------------------------ > # atactl wd0 identify > Model: 6Y250L6, Rev: YAR41BW0, Serial #: > Device type: ATA, fixed > Cylinders: 16383, heads: 16, sec/track: 63, total sectors: 490234752 > Device capabilities: > ATA standby timer values > IORDY operation > IORDY disabling > Device supports the following standards: > ATA-1 ATA-2 ATA-3 ATA-4 ATA-5 ATA-6 ATA-7 > Device supports the following command sets: > NOP command > READ BUFFER command > WRITE BUFFER command > Host Protected Area feature set > Read look-ahead > Write cache > Power Management feature set > SMART feature set > Flush Cache Ext command > Flush Cache command > Device Configuration Overlay feature set > 48bit address feature set > Automatic Acoustic Management feature set > Set Max security extension commands > Advanced Power Management feature set > DOWNLOAD MICROCODE command > SMART self-test > SMART error logging > Device has enabled the following command sets/features: > NOP command > READ BUFFER command > WRITE BUFFER command > Host Protected Area feature set > Read look-ahead > Write cache > Power Management feature set > SMART feature set > Flush Cache Ext command > Flush Cache command > Device Configuration Overlay feature set > 48bit address feature set > DOWNLOAD MICROCODE command > # > > > ------------------------------------------------------------------------ > # disklabel wd0 > # /dev/rwd0c: > type: ESDI > disk: ESDI/IDE disk > label: 6Y250L6 > flags: > bytes/sector: 512 > sectors/track: 63 > tracks/cylinder: 255 > sectors/cylinder: 16065 > cylinders: 30515 > total sectors: 490234752 > rpm: 3600 > interleave: 1 > boundstart: 63 > boundend: 490223475 > drivedata: 0 > > 16 partitions: > # size offset fstype [fsize bsize cpg] > a: 2097152 63 4.2BSD 2048 16384 1 # / > b: 4699192 2097215 swap > c: 490234752 0 unused > d: 8388608 6796407 4.2BSD 2048 16384 1 # /tmp > e: 16738416 15185015 4.2BSD 2048 16384 1 # /var > f: 4194304 31923431 4.2BSD 2048 16384 1 # /usr > g: 2097152 36117735 4.2BSD 2048 16384 1 # > /usr/X11R6 > h: 12582912 38214887 4.2BSD 2048 16384 1 # > /usr/local > i: 4194304 50797799 4.2BSD 2048 16384 1 # > /usr/src > j: 4194304 54992103 4.2BSD 2048 16384 1 # > /usr/obj > k: 62923658 59186407 4.2BSD 2048 16384 1 # /home > l: 10490445 122110065 4.2BSD 2048 16384 1 # > /usr/xobj > m: 4209030 132600510 4.2BSD 2048 16384 1 # > /usr/ports > n: 62926605 136809540 4.2BSD 2048 16384 1 # > /usr/ports/pobj > o: 290487330 199736145 4.2BSD 2048 16384 1 # /arc > # > > ------------------------------------------------------------------------ > # fsck -fp > /dev/rwd0a: 2524 files, 22205 used, 492234 free (226 frags, 61501 blocks, > 0.0% fragmentation) > /dev/rwd0o: 2240 files, 8672169 used, 62800009 free (329 frags, 7849960 > blocks, 0.0% fragmentation) > /dev/rwd0k: 7893 files, 11044647 used, 4436977 free (409 frags, 554571 > blocks, 0.0% fragmentation) > /dev/rwd0d: 7 files, 5 used, 2062698 free (42 frags, 257832 blocks, 0.0% > fragmentation) > wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975 > (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying > wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975 > (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying > wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975 > (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying > wd0: transfer error, downgrading to Ultra-DMA mode 4 > wd0(pciide1:0:0): using PIO mode 4, Ultra-DMA mode 4 > wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975 > (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying > wd0f: uncorrectable data error reading fsbn 1658944 of 1658848-1658975 > (wd0 bn 33582375; cn 2090 tn 103 sn 36), retrying > wd0f: uncorrectable data error reading fsbn 1658944 of 1658848-1658975 > (wd0 bn 33582375; cn 2090 tn 103 sn 36) > CANNOT READ: BLK 1658848 > /dev/rwd0f: UNEXPECTED INCONSISTENCY; RUN fsck_ffs MANUALLY. > /dev/rwd0g: 9101 files, 85995 used, 428444 free (7212 frags, 52654 > blocks, 1.4% fragmentation) > /dev/rwd0h: 74624 files, 782097 used, 2312782 free (4254 frags, 288566 > blocks, 0.1% fragmentation) > /dev/rwd0j: 1 files, 1 used, 1030526 free (14 frags, 128814 blocks, 0.0% > fragmentation) > /dev/rwd0m: 92637 files, 128800 used, 905408 free (7368 frags, 112255 > blocks, 0.7% fragmentation) > /dev/rwd0n: 570852 files, 5602927 used, 9879434 free (15154 frags, > 1233035 blocks, 0.1% fragmentation) > /dev/rwd0i: 89556 files, 421361 used, 609166 free (2854 frags, 75789 > blocks, 0.3% fragmentation) > /dev/rwd0l: 1 files, 1 used, 2579961 free (17 frags, 322493 blocks, 0.0% > fragmentation) > /dev/rwd0e: 1863 files, 13191 used, 4104164 free (388 frags, 512972 > blocks, 0.0% fragmentation) > THE FOLLOWING FILE SYSTEM HAD AN UNEXPECTED INCONSISTENCY: > ffs: /dev/rwd0f (/usr) > > > ------------------------------------------------------------------------ > # dmesg > OpenBSD 4.7-beta (GENERIC) #521: Tue Feb 2 17:34:24 MST 2010 > [email protected]:/usr/src/sys/arch/i386/compile/GENERIC > cpu0: Intel(R) Pentium(R) 4 CPU 2.40GHz ("GenuineIntel" 686-class) 2.40 > GHz cpu0: > FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,CNXT-ID,xTPR > real mem = 2137550848 (2038MB) avail mem = 2062446592 (1966MB) > mainbus0 at root > bios0 at mainbus0: AT/286+ BIOS, date 04/17/03, BIOS32 rev. 0 @ > 0xfd844, SMBIOS rev. 2.31 @ 0xf0240 (51 entries) bios0: vendor IBM > version "24KT41AUS" date 04/17/2003 bios0: IBM 8310XXS > acpi0 at bios0: rev 0 > acpi0: tables DSDT FACP TCPA APIC BOOT > acpi0: wakeup devices USB1(S3) USB2(S3) USB3(S3) USBE(S3) SLOT(S5) KBC_ > (S3) COMA(S5) COMB(S5) acpitimer0 at acpi0: 3579545 Hz, 24 bits > acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat > cpu0 at mainbus0: apid 0 (boot processor) > cpu0: apic clock running at 132MHz > ioapic0 at mainbus0: apid 1 pa 0xfec00000, version 20, 24 pins > acpiprt0 at acpi0: bus 0 (PCI0) > acpiprt1 at acpi0: bus 2 (SLOT) > acpiprt2 at acpi0: bus -1 (AGP_) > acpicpu0 at acpi0 > acpitz0 at acpi0: critical temperature 105 degC > acpibtn0 at acpi0: PWRB > bios0: ROM list: 0xc0000/0xb400 0xe0000/0x10000! > pci0 at mainbus0 bus 0: configuration mode 1 (bios) > pchb0 at pci0 dev 0 function 0 "Intel 82845G Host" rev 0x01 > vga1 at pci0 dev 2 function 0 "Intel 82845G Video" rev 0x01 > wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) > wsdisplay0: screen 1-5 added (80x25, vt100 emulation) > intagp0 at vga1 > agp0 at intagp0: aperture at 0x88000000, size 0x8000000 > inteldrm0 at vga1: apic 1 int 16 (irq 11) > drm0 at inteldrm0 > uhci0 at pci0 dev 29 function 0 "Intel 82801DB USB" rev 0x01: apic 1 > int 16 (irq 11) uhci1 at pci0 dev 29 function 1 "Intel 82801DB USB" rev > 0x01: apic 1 int 19 (irq 10) uhci2 at pci0 dev 29 function 2 "Intel > 82801DB USB" rev 0x01: apic 1 int 18 (irq 5) ehci0 at pci0 dev 29 > function 7 "Intel 82801DB USB" rev 0x01: apic 1 int 23 (irq 9) usb0 at > ehci0: USB revision 2.0 uhub0 at usb0 "Intel EHCI root hub" rev > 2.00/1.00 addr 1 ppb0 at pci0 dev 30 function 0 "Intel 82801BA > Hub-to-PCI" rev 0x81 pci1 at ppb0 bus 2 > fxp0 at pci1 dev 8 function 0 "Intel PRO/100 VE" rev 0x81, i82562: apic > 1 int 20 (irq 11), address 00:09:6b:f3:41:ad inphy0 at fxp0 phy 1: > i82562EM 10/100 PHY, rev. 0 pciide0 at pci1 dev 9 function 0 "CMD > Technology SiI3512 SATA" rev 0x01: DMA pciide0: using apic 1 int 21 > (irq 11) for native-PCI interrupt cbb0 at pci1 dev 10 function 0 "Ricoh > 5C475 CardBus" rev 0x81: apic 1 int 22 (irq 9) cardslot0 at cbb0 slot 0 > flags 0 cardbus0 at cardslot0: bus 3 device 0 cacheline 0x0, lattimer > 0x20 pcmcia0 at cardslot0 > ichpcib0 at pci0 dev 31 function 0 "Intel 82801DB LPC" rev 0x01 > pciide1 at pci0 dev 31 function 1 "Intel 82801DB IDE" rev 0x01: DMA, > channel 0 configured to compatibility, channel 1 configured to > compatibility wd0 at pciide1 channel 0 drive 0: < 6Y250L6> wd0: > 16-sector PIO, LBA48, 239372MB, 490234752 sectors wd0(pciide1:0:0): > using PIO mode 4, Ultra-DMA mode 5 atapiscsi0 at pciide1 channel 1 > drive 0 scsibus0 at atapiscsi0: 2 targets > cd0 at scsibus0 targ 0 lun 0: <LITE-ON, LTR-48246S, SUS5> ATAPI 5/cdrom > removable cd0(pciide1:1:0): using PIO mode 4, Ultra-DMA mode 2 > ichiic0 at pci0 dev 31 function 3 "Intel 82801DB SMBus" rev 0x01: apic > 1 int 17 (irq 9) iic0 at ichiic0 > admtemp0 at iic0 addr 0x4c: adm1032 > spdmem0 at iic0 addr 0x50: 1GB DDR SDRAM non-parity PC2700CL2.5 > spdmem1 at iic0 addr 0x51: 1GB DDR SDRAM non-parity PC2700CL2.5 > auich0 at pci0 dev 31 function 5 "Intel 82801DB AC97" rev 0x01: apic 1 > int 17 (irq 9), ICH4 AC97 ac97: codec id 0x41445374 (Analog Devices > AD1981B) ac97: codec features headphone, 20 bit DAC, No 3D Stereo > audio0 at auich0 > usb1 at uhci0: USB revision 1.0 > uhub1 at usb1 "Intel UHCI root hub" rev 1.00/1.00 addr 1 > usb2 at uhci1: USB revision 1.0 > uhub2 at usb2 "Intel UHCI root hub" rev 1.00/1.00 addr 1 > usb3 at uhci2: USB revision 1.0 > uhub3 at usb3 "Intel UHCI root hub" rev 1.00/1.00 addr 1 > isa0 at ichpcib0 > isadma0 at isa0 > com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo > com0: console > com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo > pckbc0 at isa0 port 0x60/5 > pckbd0 at pckbc0 (kbd slot) > pckbc0: using irq 1 for kbd slot > wskbd0 at pckbd0: console keyboard, using wsdisplay0 > pms0 at pckbc0 (aux slot) > pckbc0: using irq 12 for aux slot > wsmouse0 at pms0 mux 0 > pcppi0 at isa0 port 0x61 > midi0 at pcppi0: <PC speaker> > spkr0 at pcppi0 > lpt0 at isa0 port 0x378/4 irq 7 > npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16 > fdc0 at isa0 port 0x3f0/6 irq 6 drq 2 > fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec > mtrr: Pentium Pro MTRR support > vscsi0 at root > scsibus1 at vscsi0: 256 targets > softraid0 at root > root on wd0a swap on wd0b dump on wd0b > wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975 > (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying wd0f: uncorrectable > data error reading fsbn 1658848 of 1658848-1658975 (wd0 bn 33582279; cn > 2090 tn 102 sn 3), retrying wd0f: uncorrectable data error reading fsbn > 1658848 of 1658848-1658975 (wd0 bn 33582279; cn 2090 tn 102 sn 3), > retrying wd0: transfer error, downgrading to Ultra-DMA mode 4 wd0 > (pciide1:0:0): using PIO mode 4, Ultra-DMA mode 4 wd0f: uncorrectable > data error reading fsbn 1658848 of 1658848-1658975 (wd0 bn 33582279; cn > 2090 tn 102 sn 3), retrying wd0f: uncorrectable data error reading fsbn > 1658944 of 1658848-1658975 (wd0 bn 33582375; cn 2090 tn 103 sn 36), > retrying wd0f: uncorrectable data error reading fsbn 1658944 of > 1658848-1658975 (wd0 bn 33582375; cn 2090 tn 103 sn 36) > # > > --

