A long time ago, in a galaxy far, far way, someone said...

[...]
> My hardware is:
> 
> - Dell NetPlex 486SX/25
> - 32MB RAM
> - AHA1542C SCSI Controller
> - 3COM 509B Ethernet Controller
> - 2-port 16550A Serial Card
> - 3 x SCSI Hard Drives External
> - 1 x SCSI Jaz Drive External
> - 1 x SCSI Sony DDS-3 DAT Drive External
> 
> My software is:
> 
> - Debian GNU/Linux 2.1 (slink)
> - Glibc 2.1 added from potato
> - Linux 2.2.11 kernel

[...]

> After the system has been up for a random length of time (usually
> about a week or so) it will crash in the middle of the night during a
> full backup to the DAT drive using cpio. The machine hangs in either
> an infinite loop or a kernel panic. I originally was running Debian
> 2.1 with a 2.0.36 kernel, and I would see the following scrolling
> endlessly off the screen after a crash:
> 
> Sending SCSI DID_RESET...
> Sending SCSI DID_RESET...
> Sending SCSI DID_RESET...
> Sending SCSI DID_RESET...
> Sending SCSI DID_RESET...
> other scsi messages, etc...
> 
> Since installing the 2.2 kernel and associated upgraded packages as
> detailed in the errata for slink, the crashes *seem* to occur less often,
> but this morning I saw:
> 
> aha1542_out failed...
> aha1542_out failed... failed to reset target...
> ...
> Kernel panic: unable to find empty mailbox for aha1542...
> 
> and the system was locked up. Since upgrading to the 2.2 kernel, I
> also notice periodic messages in the syslog (about one per day) like
> this:
> 
> aha1542.c: interrupt received but no mail
> 
> The system will run perfectly for a week or so, doing this same backup
> routine every night, and then it will just pull this trick on some random
> night.
> 
> I have tried:
> 
> - disconnecting all devices except the tape drive hard drives
> - installing the highest quality cables I can find for the external
>   devices (this machine currently has about $400 US worth of Granite
>   Digital cables hanging off of it).
> - installing a Granite Digital active terminator on the end of the SCSI
>   chain
> - verifying that there are no interrupt or IO port confilicts both in the
>   device jumper configurations and from the /proc filesystem

Tried a different (newer) kernel?  IIRC there have been changes to the
aha1542 driver since 2.2.11 - current is 2.2.13.

> I am completely at my wits end with this. I have searched DejaNews
> repeatedly for any discussions of kernel panics and crashes with
> Adaptec cards, Linux, SCSI in general, etc., and all I can find is one
> thread from about a year ago mentioning the same sorts of problems but
> no solution.
> 
> Is this a problem that anyone else has ever had with Linux and an
> AHA1542C in particular or SCSI in general? Can anyone recommend which
> part of the setup I should change or eliminate?

> Is it a bad card?

It's a possibility.

> Are Adaptec cards bad in general?

Not at all - they're considered to be among the best.

> Is the aha1542 scsi driver problematic?

It's a possibility.  Try a newer kernel.  I have a different revision 1542
(I think its a 1542CF) with a small HD and a CD-ROM drive hanging off it,
and have no problems.

> Is Linux SCSI in general problematic?

Not in my experience.  I have 3 computers with running Linux and SCSI, and
none of them have this problem.

-- 
----------------------------------------------------------------------
Phil Brutsche                                   [EMAIL PROTECTED]

"There are two things that are infinite; Human stupidity and the
universe. And I'm not sure about the universe." - Albert Einstein

Reply via email to