How did the multiprocessor system handle CPU crashes?
Surprisingly enough, very badly. When one CPU crashed, all the CPUs crashed. “The philosophy of the 11/74 was high availability, not high reliability. As such, from a philosophical viewpoint, we wanted crash dumps of all the CPUs to catch software problems.” “Pragmatically speaking, continuing would be difficult. The crashing CPU is in the kernel, owning at least $EXECL in all likelihood, and perhaps some other spin locks. Of course, any lock it owned was owned to protect an atomic transaction, and the crash caused some decay.” “The fork list may not be intact, the Pool may not be intact, device states may be inconsistent, the context of the running task on the crashed CPU (which could be MCR or F11ACP) is lost in what may have been an atomic transaction inside the component (remember $LOCKL?), and a host of other problems may exist. [These] will simply cascade into a mass of wreckage where a crash dump ought to be.” Source: Brian S. McCarthy (July 2005) How was the multiprocessor syst
Surprisingly enough, very badly. When one CPU crashed, all the CPUs crashed. “The philosophy of the 11/74 was high availability, not high reliability. As such, from a philosophical viewpoint, we wanted crash dumps of all the CPUs to catch software problems. “Pragmatically speaking, continuing would be difficult. The crashing CPU is in the kernel, owning at least $EXECL in all likelihood, and perhaps some other spin locks. Of course, any lock it owned was owned to protect an atomic transaction, and the crash caused some decay.” “The fork list may not be intact, the Pool may not be intact, device states may be inconsistent, the context of the running task on the crashed CPU (which could be MCR or F11ACP) is lost in what may have been an atomic transaction inside the component (remember $LOCKL?), and a host of other problems may exist. [These] will simply cascade into a mass of wreckage where a crash dump ought to be.” Source: Brian S. McCarthy (July 2005) How was the multiprocessor syste
Related Questions
- Can system standby and hibernate features render a computer unstable, causing system crashes and/or preventing it from waking?
- Why is it that System Monitor and other CPU monitors indicate that my CPU is 100% busy when running CPU Cool?
- Im using sppp, IPSec, or some other IP Tunnel and Im seeing random system crashes or CPU "thrash," what do I do?