Re: HDD failing? :'(
Hi Ben,
Here is the "context". Hope it is enough/correct!
>From /var/log/syslog.0
Oct 13 13:36:33 nias -- MARK --
Oct 13 13:47:40 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
Oct 13 13:47:40 nias kernel: hdc: bad status at DMA end, dstat=8480
Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00
Oct 13 13:47:41 nias kernel: Current sd0b:00: sns = 70 3
Oct 13 13:47:41 nias kernel: ASC= 2 ASCQ= 0
Oct 13 13:47:41 nias kernel: Raw sense data:0x70 0x00 0x03 0x00 0x00 0x00 0x00 0x0a 0x08 0x00 0x00 0x00 0x02 0x00 0x00 0x00 0x00 0x00
Oct 13 13:47:41 nias kernel: I/O error: dev 0b:00, sector 64
Oct 13 13:47:41 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: RRIP_1991A
.....
Oct 13 20:26:53 nias kernel: hda: bad status at DMA end, dstat=8400
Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
Oct 13 20:26:53 nias kernel:
Oct 13 20:26:53 nias kernel: hda: drive not ready for command
Oct 13 20:26:53 nias kernel: hda: status timeout: status=0xd0 { Busy }
Oct 13 20:26:53 nias kernel:
Oct 13 20:26:53 nias kernel: hda: drive not ready for command
Oct 13 20:27:08 nias kernel: ide0: reset: success
Oct 13 20:27:08 nias kernel: blk: queue c02de6a8, I/O limit 4095Mb (mask 0xffffffff)
Oct 13 20:27:21 nias kernel: hda: irq timeout: status=0xd0 { Busy }
Oct 13 20:27:21 nias kernel:
Oct 13 20:28:16 nias kernel: hda: DMA disabled
Oct 13 20:28:16 nias kernel: ide0: reset: success
Oct 13 20:30:36 nias kernel: hda: status timeout: status=0xd0 { Busy }
Oct 13 20:30:36 nias kernel:
Oct 13 20:30:36 nias kernel: hda: no DRQ after issuing WRITE
Oct 13 20:30:38 nias kernel: ide0: reset: success
Oct 13 20:30:58 nias kernel: hda: irq timeout: status=0xd0 { Busy }
Oct 13 20:30:58 nias kernel:
Oct 13 20:31:03 nias kernel: ide0: reset: success
( Here is where I run the Apple Hardware Tests in the CD )
And from var/log/kern.log
Oct 12 21:25:53 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
Oct 12 21:25:53 nias kernel: hdc: bad status at DMA end, dstat=8480
Oct 12 21:25:53 nias kernel: I/O error: dev 0b:00, sector 0
Oct 12 21:25:53 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
Oct 12 21:25:53 nias kernel: hdc: bad status at DMA end, dstat=8480
Oct 12 21:25:53 nias kernel: I/O error: dev 0b:00, sector 0
( Funny that the DVD errors also happened the day before :-? )
Oct 13 13:47:40 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
Oct 13 13:47:40 nias kernel: hdc: bad status at DMA end, dstat=8480
Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00
Oct 13 13:47:41 nias kernel: Current sd0b:00: sns = 70 3
Oct 13 13:47:41 nias kernel: ASC= 2 ASCQ= 0
Oct 13 13:47:41 nias kernel: Raw sense data:0x70 0x00 0x03 0x00 0x00 0x00 0x00 0x0a 0x08 0x00 0x00 0x00 0x02 0x00 0x00 0x00 0x00 0x00
Oct 13 13:47:41 nias kernel: I/O error: dev 0b:00, sector 64
Oct 13 13:47:41 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 15:12:19 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:12:19 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 15:18:49 nias kernel: attempt to access beyond end of device
Oct 13 15:18:49 nias kernel: 0b:00: rw=0, want=34, limit=2
Oct 13 15:18:49 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
Oct 13 15:21:25 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:21:25 nias kernel: ISOFS: changing to secondary root
Oct 13 15:23:22 nias kernel: attempt to access beyond end of device
Oct 13 15:23:22 nias kernel: 0b:00: rw=0, want=34, limit=2
Oct 13 15:23:22 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
Oct 13 15:23:24 nias kernel: attempt to access beyond end of device
Oct 13 15:23:24 nias kernel: 0b:00: rw=0, want=34, limit=2
Oct 13 15:23:24 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
Oct 13 15:24:03 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:24:03 nias kernel: ISOFS: changing to secondary root
Oct 13 15:34:06 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:34:06 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 15:35:06 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:35:06 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 15:43:15 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:43:15 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 15:48:10 nias kernel: sr0: CDROM (ioctl) reports ILLEGAL REQUEST.
Oct 13 15:48:10 nias kernel: cdrom: open failed.
Oct 13 15:48:29 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:48:29 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 15:55:37 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 15:55:37 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:05:21 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:05:21 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:09:39 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:09:39 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:14:54 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:14:54 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:20:47 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:20:47 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:25:49 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:25:49 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:31:06 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:31:06 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:36:30 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:36:30 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:41:41 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:41:41 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:46:58 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:46:58 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:50:22 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:50:22 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 16:54:12 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 16:54:12 nias kernel: ISO 9660 Extensions: RRIP_1991A
Oct 13 19:29:18 nias kernel: sr0: CDROM (ioctl) reports ILLEGAL REQUEST.
Oct 13 19:29:18 nias kernel: cdrom: open failed.
Oct 13 19:29:31 nias kernel: sr0: CDROM (ioctl) reports ILLEGAL REQUEST.
Oct 13 19:29:31 nias kernel: cdrom: open failed.
Oct 13 19:29:56 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
Oct 13 19:29:56 nias kernel: ISO 9660 Extensions: RRIP_1991A
.....
Oct 13 20:26:53 nias kernel: hda: bad status at DMA end, dstat=8400
Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
Oct 13 20:26:53 nias kernel:
Oct 13 20:26:53 nias kernel: hda: drive not ready for command
Oct 13 20:26:53 nias kernel: hda: status timeout: status=0xd0 { Busy }
Oct 13 20:26:53 nias kernel:
Oct 13 20:26:53 nias kernel: hda: drive not ready for command
Oct 13 20:27:08 nias kernel: ide0: reset: success
Oct 13 20:27:08 nias kernel: blk: queue c02de6a8, I/O limit 4095Mb (mask 0xffffffff)
Oct 13 20:27:21 nias kernel: hda: irq timeout: status=0xd0 { Busy }
Oct 13 20:27:21 nias kernel:
Oct 13 20:28:16 nias kernel: hda: DMA disabled
Oct 13 20:28:16 nias kernel: ide0: reset: success
Oct 13 20:30:36 nias kernel: hda: status timeout: status=0xd0 { Busy }
Oct 13 20:30:36 nias kernel:
Oct 13 20:30:36 nias kernel: hda: no DRQ after issuing WRITE
Oct 13 20:30:38 nias kernel: ide0: reset: success
Oct 13 20:30:58 nias kernel: hda: irq timeout: status=0xd0 { Busy }
Oct 13 20:30:58 nias kernel:
Oct 13 20:31:03 nias kernel: ide0: reset: success
( And here is the same reboot point )
So basically, there are two things... stuff happening on HDC (the
superdrive) at around 13:XX and stuff on HDA, my HDD, at around 20:XX.
I am also sending the smartclt -a /dev/hda4 results, in case someone
wiser than me knows how to read more stuff than I do in them! :)
smartctl version 5.1-18 Copyright (C) 2002-3 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Device Model: FUJITSU MHS2060AT
Serial Number: NL24T3114CF3
Firmware Version: 8105
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 6
ATA Standard is: ATA/ATAPI-6 T13 1410D revision 3a
Local Time is: Tue Oct 14 10:49:41 2003 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Off-line data collection status: (0x02) Offline data collection activity was
completed without error.
Auto Off-line Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete off-line
data collection: ( 492) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Automatic timer ON/OFF support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging support.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 83) minutes.
Conveyance self-test routine
recommended polling time: ( 2) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 100 100 046 Pre-fail Always - 191114
2 Throughput_Performance 0x0005 100 100 030 Pre-fail Offline - 292
3 Spin_Up_Time 0x0003 100 100 025 Pre-fail Always - 25601
4 Start_Stop_Count 0x0032 099 099 000 Old_age Always - 382
5 Reallocated_Sector_Ct 0x0033 080 080 024 Pre-fail Always - 497
7 Seek_Error_Rate 0x000f 100 100 047 Pre-fail Always - 776
8 Seek_Time_Performance 0x0005 100 100 019 Pre-fail Offline - 0
9 Power_On_Hours 0x0032 068 068 000 Old_age Always - 17366468
10 Spin_Retry_Count 0x0013 100 100 020 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 251
192 Power-Off_Retract_Count 0x0032 099 099 000 Old_age Always - 28
193 Load_Cycle_Count 0x0032 049 049 000 Old_age Always - 189502
194 Temperature_Celsius 0x0022 100 100 000 Old_age Always - 39 (Lifetime Min/Max 21/52)
195 Hardware_ECC_Recovered 0x001a 100 100 000 Old_age Always - 9645
196 Reallocated_Event_Count 0x0032 080 080 000 Old_age Always - 480
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x000f 084 071 060 Pre-fail Always - 2898836
203 Run_Out_Cancel 0x0002 100 100 000 Old_age Always - 3732310457836
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
No self-tests have been logged
Thanks in advance!
--
J. Javier Maestro
<jjmaestro@computer.org>
http://rigel.homelinux.com
On Oct Mon 13 2003 20:28, Benjamin Herrenschmidt wrote:
>
> >
> > as:~# grep -i error /var/log/syslog
> > Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00
> > Oct 13 13:47:41 nias kernel: I/O error: dev 0b:00, sector 64
> > Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
> >
> > nias:~# grep -i error /var/log/kern.log
> > Oct 12 21:25:53 nias kernel: I/O error: dev 0b:00, sector 0
> > Oct 12 21:25:53 nias kernel: I/O error: dev 0b:00, sector 0
> > Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00
> > Oct 13 13:47:41 nias kernel: I/O error: dev 0b:00, sector 64
> > Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
>
> can you give some context around the HD errors ?
>
> Ben.
Reply to: