I've just noticed that a server I support has been logging SCSI errors
every Friday at about the same time. It's just a quick blip of error
messages, then on to normal business:
Nov 29 20:31:25 esd-tls01 kernel: SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 28000002
Nov 29 20:31:25 esd-tls01 kernel: Current sd08:07: sense key Hardware Error
Nov 29 20:31:25 esd-tls01 kernel: Additional sense indicates Internal target failure
Nov 29 20:31:25 esd-tls01 kernel: I/O error: dev 08:07, sector 92510
Nov 29 20:31:25 esd-tls01 kernel: sym53c896-0-<0,*>: FAST-20 WIDE SCSI 40.0 MB/s (50.0 ns, offset 15)
Nov 29 20:31:29 esd-tls01 kernel: SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 28000002
Nov 29 20:31:29 esd-tls01 kernel: Current sd08:07: sense key Hardware Error
Nov 29 20:31:29 esd-tls01 kernel: Additional sense indicates Internal target failure
Nov 29 20:31:29 esd-tls01 kernel: I/O error: dev 08:07, sector 92510
Nov 29 20:31:29 esd-tls01 kernel: sym53c896-0-<0,*>: FAST-20 WIDE SCSI 40.0 MB/s (50.0 ns, offset 15)
Here's the output of "cat /proc/scsi/scsi"
Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: IBM Model: DGHS09Y CLAR09 Rev: 1A1A
Type: Direct-Access ANSI SCSI revision: 03
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: SEAGATE Model: ST39102L CLAR09 Rev: AV05
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 06 Lun: 00
Vendor: ESG-SHV Model: SCA HSBP M8 Rev: 0.05
Type: Processor ANSI SCSI revision: 02
Host: scsi3 Channel: 00 Id: 00 Lun: 00
Vendor: DGC Model: RAID 5 Rev: 0847
Type: Direct-Access ANSI SCSI revision: 04
Host: scsi3 Channel: 00 Id: 00 Lun: 01
Vendor: DGC Model: RAID 5 Rev: 0847
Type: Direct-Access ANSI SCSI revision: 04
I've looked at my cron jobs, and I see anything that runs about that
time on Fridays.
One other weird thing... The console is unresponsive. I can't log in
on tty1. The other tty's seem fine. I tried killing the process on
tty1 so it would respawn (which it did), but that did not make tty1
usable.
Any ideas?
TIA,
Bill