On Tue, 02 Mar 2010 07:50 -0800, "J.C. Roberts"
<[email protected]> wrote:
> And I thought I was expected to be inconsistent. ;)
> 
> Anyhow, I was upgrading from the Feb 2, to the most recent snapshot, and
> fsck is coming up with a problem on one of my partitions. I can probably
> get it working ("fix" is such a strong word) with `fsck -fy` but my real
> concern is if the drive is failing?
> 
> atactl tells me everything is just fine?
> 
> I have a nearly identical system, with the same type of disk, which
> reports similar atactl attributes... but then again, I don't really trust
> SATA/PATA drives very much or their supposedly "smart" monitoring.
> 
> The data on the system is not only backed up, but it's also easily
> replaced since the machine is only used for src and ports builds. I think
> I might lose a total of a few newly downloaded distfiles since the last
> backup.
> 
> What I really want to do here is understand *why* some portion of the
> disk has become unreadable?


cd /bad_partition && dd if=/dev/zero of=big_file.zero bs=512
conv=sync,noerror

Let it run until it finishes. That won't explain why the sectors are
bad, but it may give a good indication of the problem area and answer
the failing drive question. If dd reports IO issues, you may want to
replace the drive.

Brad

 
> All of the below were done in single user mode over serial.
> (sorry about the width)
> 
> ------------------------------------------------------------------------
> # atactl wd0 smartenable
> # atactl wd0 readattr
> Attributes table revision: 16
> ID   Attribute name                  Threshold  Value  Raw
>   3  Spin Up Time                      63       180    0x0000000046f2
>   4  Start/Stop Count                   0       253    0x0000000000d2
>   5  Reallocated Sector Count          63       253    0x000000000007
>   6  Read Channel Margin              100       253    0x000000000000
>   7  Seek Error Rate                    0       253    0x000000000000
>   8  Seek Time Performance            187       253    0x000000009edb
>   9  Power-On Hours Count               0       235    0x00000000ee5c
>  10  Spin Retry Count                 157       253    0x000000000000
>  11  Calibration Retry Count          223       253    0x000000000000
>  12  Device Power Cycle Count           0       253    0x0000000000f0
> 192  Power-Off Retract Count            0       253    0x000000000000
> 193  Load Cycle Count                   0       253    0x000000000000
> 194  Temperature                        0       253    0x00000000000f
> 195  Hardware ECC Recovered             0       253    0x00000000170d
> 196  Reallocation Event Count           0       253    0x000000000000
> 197  Current Pending Sector Count       0       253    0x000000000001
> 198  Off-Line Scan Uncorrectable Sect   0       253    0x000000000000
> 199  Ultra DMA CRC Error Count          0       199    0x000000000000
> 200  Write Error Rate                   0       253    0x000000000000
> 201  Soft Read Error Rate               0       253    0x000000000000
> 202  Data Address Mark Errors           0       253    0x000000000000
> 203  Run Out Cancel                   180       253    0x000000000001
> 204  Soft ECC Correction                0       253    0x000000000000
> 205  Thermal Asperity Check             0       253    0x000000000000
> 207  Spin High Current                  0       253    0x000000000000
> 208  Spin Buzz                          0       253    0x000000000000
> 209  Offline Seek Performance           0       253    0x000000000000
>  99  Unknown                            0       253    0x000000000000
> 100  Unknown                            0       253    0x000000000000
> 101  Unknown                            0       253    0x000000000000
> #
> 
> ------------------------------------------------------------------------
> # atactl wd0 smartstatus
> No SMART threshold exceeded
> # 
> 
> ------------------------------------------------------------------------
> # atactl wd0 identify
> Model:        6Y250L6, Rev: YAR41BW0, Serial #: 
> Device type: ATA, fixed
> Cylinders: 16383, heads: 16, sec/track: 63, total sectors: 490234752
> Device capabilities:
>         ATA standby timer values
>         IORDY operation
>         IORDY disabling
> Device supports the following standards:
> ATA-1 ATA-2 ATA-3 ATA-4 ATA-5 ATA-6 ATA-7 
> Device supports the following command sets:
>         NOP command
>         READ BUFFER command
>         WRITE BUFFER command
>         Host Protected Area feature set
>         Read look-ahead
>         Write cache
>         Power Management feature set
>         SMART feature set
>         Flush Cache Ext command
>         Flush Cache command
>         Device Configuration Overlay feature set
>         48bit address feature set
>         Automatic Acoustic Management feature set
>         Set Max security extension commands
>         Advanced Power Management feature set
>         DOWNLOAD MICROCODE command
>         SMART self-test
>         SMART error logging
> Device has enabled the following command sets/features:
>         NOP command
>         READ BUFFER command
>         WRITE BUFFER command
>         Host Protected Area feature set
>         Read look-ahead
>         Write cache
>         Power Management feature set
>         SMART feature set
>         Flush Cache Ext command
>         Flush Cache command
>         Device Configuration Overlay feature set
>         48bit address feature set
>         DOWNLOAD MICROCODE command
> # 
> 
> 
> ------------------------------------------------------------------------
> # disklabel wd0
> # /dev/rwd0c:
> type: ESDI
> disk: ESDI/IDE disk
> label:        6Y250L6  
> flags:
> bytes/sector: 512
> sectors/track: 63
> tracks/cylinder: 255
> sectors/cylinder: 16065
> cylinders: 30515
> total sectors: 490234752
> rpm: 3600
> interleave: 1
> boundstart: 63
> boundend: 490223475
> drivedata: 0 
> 
> 16 partitions:
> #                size           offset  fstype [fsize bsize  cpg]
>   a:          2097152               63  4.2BSD   2048 16384    1 # /
>   b:          4699192          2097215    swap                   
>   c:        490234752                0  unused                   
>   d:          8388608          6796407  4.2BSD   2048 16384    1 # /tmp
>   e:         16738416         15185015  4.2BSD   2048 16384    1 # /var
>   f:          4194304         31923431  4.2BSD   2048 16384    1 # /usr
>   g:          2097152         36117735  4.2BSD   2048 16384    1 #
>   /usr/X11R6
>   h:         12582912         38214887  4.2BSD   2048 16384    1 #
>   /usr/local
>   i:          4194304         50797799  4.2BSD   2048 16384    1 #
>   /usr/src
>   j:          4194304         54992103  4.2BSD   2048 16384    1 #
>   /usr/obj
>   k:         62923658         59186407  4.2BSD   2048 16384    1 # /home
>   l:         10490445        122110065  4.2BSD   2048 16384    1 #
>   /usr/xobj
>   m:          4209030        132600510  4.2BSD   2048 16384    1 #
>   /usr/ports
>   n:         62926605        136809540  4.2BSD   2048 16384    1 #
>   /usr/ports/pobj
>   o:        290487330        199736145  4.2BSD   2048 16384    1 # /arc
> # 
> 
> ------------------------------------------------------------------------
> # fsck -fp
> /dev/rwd0a: 2524 files, 22205 used, 492234 free (226 frags, 61501 blocks,
> 0.0% fragmentation)
> /dev/rwd0o: 2240 files, 8672169 used, 62800009 free (329 frags, 7849960
> blocks, 0.0% fragmentation)
> /dev/rwd0k: 7893 files, 11044647 used, 4436977 free (409 frags, 554571
> blocks, 0.0% fragmentation)
> /dev/rwd0d: 7 files, 5 used, 2062698 free (42 frags, 257832 blocks, 0.0%
> fragmentation)
> wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975
> (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying
> wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975
> (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying
> wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975
> (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying
> wd0: transfer error, downgrading to Ultra-DMA mode 4
> wd0(pciide1:0:0): using PIO mode 4, Ultra-DMA mode 4
> wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975
> (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying
> wd0f: uncorrectable data error reading fsbn 1658944 of 1658848-1658975
> (wd0 bn 33582375; cn 2090 tn 103 sn 36), retrying
> wd0f: uncorrectable data error reading fsbn 1658944 of 1658848-1658975
> (wd0 bn 33582375; cn 2090 tn 103 sn 36)
> CANNOT READ: BLK 1658848
> /dev/rwd0f: UNEXPECTED INCONSISTENCY; RUN fsck_ffs MANUALLY.
> /dev/rwd0g: 9101 files, 85995 used, 428444 free (7212 frags, 52654
> blocks, 1.4% fragmentation)
> /dev/rwd0h: 74624 files, 782097 used, 2312782 free (4254 frags, 288566
> blocks, 0.1% fragmentation)
> /dev/rwd0j: 1 files, 1 used, 1030526 free (14 frags, 128814 blocks, 0.0%
> fragmentation)
> /dev/rwd0m: 92637 files, 128800 used, 905408 free (7368 frags, 112255
> blocks, 0.7% fragmentation)
> /dev/rwd0n: 570852 files, 5602927 used, 9879434 free (15154 frags,
> 1233035 blocks, 0.1% fragmentation)
> /dev/rwd0i: 89556 files, 421361 used, 609166 free (2854 frags, 75789
> blocks, 0.3% fragmentation)
> /dev/rwd0l: 1 files, 1 used, 2579961 free (17 frags, 322493 blocks, 0.0%
> fragmentation)
> /dev/rwd0e: 1863 files, 13191 used, 4104164 free (388 frags, 512972
> blocks, 0.0% fragmentation)
> THE FOLLOWING FILE SYSTEM HAD AN UNEXPECTED INCONSISTENCY:
>         ffs: /dev/rwd0f (/usr)
> 
> 
> ------------------------------------------------------------------------
> # dmesg
> OpenBSD 4.7-beta (GENERIC) #521: Tue Feb  2 17:34:24 MST 2010
>     [email protected]:/usr/src/sys/arch/i386/compile/GENERIC
> cpu0: Intel(R) Pentium(R) 4 CPU 2.40GHz ("GenuineIntel" 686-class) 2.40
> GHz cpu0:
> FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,CNXT-ID,xTPR
> real mem  = 2137550848 (2038MB) avail mem = 2062446592 (1966MB)
> mainbus0 at root
> bios0 at mainbus0: AT/286+ BIOS, date 04/17/03, BIOS32 rev. 0 @
> 0xfd844, SMBIOS rev. 2.31 @ 0xf0240 (51 entries) bios0: vendor IBM
> version "24KT41AUS" date 04/17/2003 bios0: IBM 8310XXS
> acpi0 at bios0: rev 0
> acpi0: tables DSDT FACP TCPA APIC BOOT
> acpi0: wakeup devices USB1(S3) USB2(S3) USB3(S3) USBE(S3) SLOT(S5) KBC_
> (S3) COMA(S5) COMB(S5) acpitimer0 at acpi0: 3579545 Hz, 24 bits
> acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat
> cpu0 at mainbus0: apid 0 (boot processor)
> cpu0: apic clock running at 132MHz
> ioapic0 at mainbus0: apid 1 pa 0xfec00000, version 20, 24 pins
> acpiprt0 at acpi0: bus 0 (PCI0)
> acpiprt1 at acpi0: bus 2 (SLOT)
> acpiprt2 at acpi0: bus -1 (AGP_)
> acpicpu0 at acpi0
> acpitz0 at acpi0: critical temperature 105 degC
> acpibtn0 at acpi0: PWRB
> bios0: ROM list: 0xc0000/0xb400 0xe0000/0x10000!
> pci0 at mainbus0 bus 0: configuration mode 1 (bios)
> pchb0 at pci0 dev 0 function 0 "Intel 82845G Host" rev 0x01
> vga1 at pci0 dev 2 function 0 "Intel 82845G Video" rev 0x01
> wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
> wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
> intagp0 at vga1
> agp0 at intagp0: aperture at 0x88000000, size 0x8000000
> inteldrm0 at vga1: apic 1 int 16 (irq 11)
> drm0 at inteldrm0
> uhci0 at pci0 dev 29 function 0 "Intel 82801DB USB" rev 0x01: apic 1
> int 16 (irq 11) uhci1 at pci0 dev 29 function 1 "Intel 82801DB USB" rev
> 0x01: apic 1 int 19 (irq 10) uhci2 at pci0 dev 29 function 2 "Intel
> 82801DB USB" rev 0x01: apic 1 int 18 (irq 5) ehci0 at pci0 dev 29
> function 7 "Intel 82801DB USB" rev 0x01: apic 1 int 23 (irq 9) usb0 at
> ehci0: USB revision 2.0 uhub0 at usb0 "Intel EHCI root hub" rev
> 2.00/1.00 addr 1 ppb0 at pci0 dev 30 function 0 "Intel 82801BA
> Hub-to-PCI" rev 0x81 pci1 at ppb0 bus 2
> fxp0 at pci1 dev 8 function 0 "Intel PRO/100 VE" rev 0x81, i82562: apic
> 1 int 20 (irq 11), address 00:09:6b:f3:41:ad inphy0 at fxp0 phy 1:
> i82562EM 10/100 PHY, rev. 0 pciide0 at pci1 dev 9 function 0 "CMD
> Technology SiI3512 SATA" rev 0x01: DMA pciide0: using apic 1 int 21
> (irq 11) for native-PCI interrupt cbb0 at pci1 dev 10 function 0 "Ricoh
> 5C475 CardBus" rev 0x81: apic 1 int 22 (irq 9) cardslot0 at cbb0 slot 0
> flags 0 cardbus0 at cardslot0: bus 3 device 0 cacheline 0x0, lattimer
> 0x20 pcmcia0 at cardslot0
> ichpcib0 at pci0 dev 31 function 0 "Intel 82801DB LPC" rev 0x01
> pciide1 at pci0 dev 31 function 1 "Intel 82801DB IDE" rev 0x01: DMA,
> channel 0 configured to compatibility, channel 1 configured to
> compatibility wd0 at pciide1 channel 0 drive 0: < 6Y250L6> wd0:
> 16-sector PIO, LBA48, 239372MB, 490234752 sectors wd0(pciide1:0:0):
> using PIO mode 4, Ultra-DMA mode 5 atapiscsi0 at pciide1 channel 1
> drive 0 scsibus0 at atapiscsi0: 2 targets
> cd0 at scsibus0 targ 0 lun 0: <LITE-ON, LTR-48246S, SUS5> ATAPI 5/cdrom
> removable cd0(pciide1:1:0): using PIO mode 4, Ultra-DMA mode 2
> ichiic0 at pci0 dev 31 function 3 "Intel 82801DB SMBus" rev 0x01: apic
> 1 int 17 (irq 9) iic0 at ichiic0
> admtemp0 at iic0 addr 0x4c: adm1032
> spdmem0 at iic0 addr 0x50: 1GB DDR SDRAM non-parity PC2700CL2.5
> spdmem1 at iic0 addr 0x51: 1GB DDR SDRAM non-parity PC2700CL2.5
> auich0 at pci0 dev 31 function 5 "Intel 82801DB AC97" rev 0x01: apic 1
> int 17 (irq 9), ICH4 AC97 ac97: codec id 0x41445374 (Analog Devices
> AD1981B) ac97: codec features headphone, 20 bit DAC, No 3D Stereo
> audio0 at auich0
> usb1 at uhci0: USB revision 1.0
> uhub1 at usb1 "Intel UHCI root hub" rev 1.00/1.00 addr 1
> usb2 at uhci1: USB revision 1.0
> uhub2 at usb2 "Intel UHCI root hub" rev 1.00/1.00 addr 1
> usb3 at uhci2: USB revision 1.0
> uhub3 at usb3 "Intel UHCI root hub" rev 1.00/1.00 addr 1
> isa0 at ichpcib0
> isadma0 at isa0
> com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
> com0: console
> com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
> pckbc0 at isa0 port 0x60/5
> pckbd0 at pckbc0 (kbd slot)
> pckbc0: using irq 1 for kbd slot
> wskbd0 at pckbd0: console keyboard, using wsdisplay0
> pms0 at pckbc0 (aux slot)
> pckbc0: using irq 12 for aux slot
> wsmouse0 at pms0 mux 0
> pcppi0 at isa0 port 0x61
> midi0 at pcppi0: <PC speaker>
> spkr0 at pcppi0
> lpt0 at isa0 port 0x378/4 irq 7
> npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
> fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
> fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
> mtrr: Pentium Pro MTRR support
> vscsi0 at root
> scsibus1 at vscsi0: 256 targets
> softraid0 at root
> root on wd0a swap on wd0b dump on wd0b
> wd0f: uncorrectable data error reading fsbn 1658848 of 1658848-1658975
> (wd0 bn 33582279; cn 2090 tn 102 sn 3), retrying wd0f: uncorrectable
> data error reading fsbn 1658848 of 1658848-1658975 (wd0 bn 33582279; cn
> 2090 tn 102 sn 3), retrying wd0f: uncorrectable data error reading fsbn
> 1658848 of 1658848-1658975 (wd0 bn 33582279; cn 2090 tn 102 sn 3),
> retrying wd0: transfer error, downgrading to Ultra-DMA mode 4 wd0
> (pciide1:0:0): using PIO mode 4, Ultra-DMA mode 4 wd0f: uncorrectable
> data error reading fsbn 1658848 of 1658848-1658975 (wd0 bn 33582279; cn
> 2090 tn 102 sn 3), retrying wd0f: uncorrectable data error reading fsbn
> 1658944 of 1658848-1658975 (wd0 bn 33582375; cn 2090 tn 103 sn 36),
> retrying wd0f: uncorrectable data error reading fsbn 1658944 of
> 1658848-1658975 (wd0 bn 33582375; cn 2090 tn 103 sn 36)
> #  
> 
> -- 

Reply via email to