Claims
- 1. A method, comprising:
receiving an instruction to execute a system boot operation; executing the system boot operation using data resident in a primary shared memory node; and initializing a secondary shared memory node upon completion of the system boot operation.
- 2. The method of claim 1, wherein the instruction to execute a system boot operation is received from software by a processing node.
- 3. The method of claim 1, wherein the secondary shared memory node is in a standby status after initialization.
- 4. The method of claim 1, further comprising initializing a third shared memory node upon completion of the system boot operation.
- 5. The method of claim 4, further comprising initializing a fourth shared memory node upon completion of the system boot operation.
- 6. A method, comprising:
accessing a primary shared memory node; executing software processes in a processing node; duplicating events occurring in the primary shared memory node in a secondary shared memory node; monitoring communication between the processing node and the primary shared memory node to recognize an error in communication between the processing node and the primary shared memory node; monitoring events occurring in the primary shared memory node to recognize an error in the events occurring in the primary shared memory node; if an error is recognized, writing a FAILED code to the primary shared memory node and designating the primary shared memory node as failed; and if an error is recognized, switching system operation to the secondary shared memory node.
- 7. The method of claim 6, further comprising: if an error has been recognized, retrying at least one member selected from the group consisting of monitoring communication between the processing node and the primary shared memory node and monitoring events occurring in the primary shared memory node.
- 8. The method of claim 7, wherein retrying is repeated a programmable number of times.
- 9. The method of claim 6, further comprising:
if an error has been recognized,
abandoning operation of the failed primary shared memory node once a FAILED code is written to it; writing a copy of the FAILED code to the secondary shared memory node; and writing a copy of the FAILED code to a shared memory node repair register.
- 10. The method of claim 9, further comprising: restoring system operation to a repaired primary shared memory node; and reinitializing the secondary shared memory node.
- 11. The method of claim 6, wherein switching system operation to the secondary shared memory node includes providing the processing node with unrestricted access to the secondary shared memory node.
- 12. The method of claim 6, further comprising:
monitoring communication between the processing node and the secondary shared memory node to recognize an error in the events occurring in the secondary shared memory node; monitoring events occurring in the secondary shared memory node to recognize an error in the events occurring in the secondary shared memory node.
- 13. The method of claim 6, further comprising:
duplicating events occurring in the memory node responsible for system operation in a third memory node; monitoring communications between the processing node and the third shared memory node to recognize an error in the events occurring in the third shared memory node; monitoring events occurring in the third shared memory node to recognize an error in the events occurring in the secondary shared memory node.
- 14. The method of claim 13, further comprising:
duplicating events occurring in the memory node responsible for system operation in a fourth memory node; monitoring communications between the processing node and the fourth shared memory node to recognize an error in the events occurring in the fourth shared memory node; monitoring events occurring in the fourth shared memory node to recognize an error in the events occurring in the secondary shared memory node.
- 15. The method of claim 12, further comprising:
if an error is recognized in the secondary shared memory node, writing a FAILED code to the secondary shared memory node and designating the secondary shared memory node as failed.
- 16. The method of claim 15, further comprising:
if an error is recognized in the secondary shared memory node, switching backup of the primary shared memory node to a third shared memory node.
- 17. The method of claim 12, further comprising:
if an error is recognized in the secondary shared memory node, attempting to copy events occurring in the primary shared memory node to the secondary shared memory node.
- 18. The method of claim 13, further comprising:
if an error is recognized in the third shared memory node, writing a FAILED code to the third shared memory node and designating the third shared memory node as failed.
- 19. The method of claim 13, further comprising:
if an error is recognized in the third shared memory node, attempting to copy events occurring in the primary shared memory node to the third shared memory node.
- 20. The method of claim 14, further comprising:
if an error is recognized in the fourth shared memory node, writing a FAILED code to the fourth shared memory node and designating the fourth shared memory node as failed.
- 21. The method of claim 14, further comprising:
if an error is recognized in the third shared memory node, attempting to copy events occurring in the primary shared memory node to the fourth shared memory node.
- 22. The method of claim 18, further comprising:
if an error is recognized in the third shared memory node, switching backup of the secondary shared memory node to a fourth shared memory node.
- 23. An apparatus, comprising:
a processing node; a dual-port adapter coupled to the processing node; a primary shared memory node coupled to the dual-port adapter; and a secondary shared memory node coupled to the dual-port adapter.
- 24. The apparatus of claim 1, wherein the dual-port adapter includes a dual-port PCI adapter.
- 25. The apparatus of claim 23, wherein the processing node includes a device selected from the group consisting of microprocessors, programmable logic devices, and microcontrollers.
- 26. The apparatus of claim 23, wherein the primary shared memory node is accessible by a plurality of processing nodes.
- 27. The apparatus of claim 23, wherein the secondary shared memory node is accessible by a plurality of processing nodes.
- 28. The apparatus of claim 23, further comprising:
another processing node coupled to the primary shared memory node and the secondary shared memory node; another dual-port adapter coupled to the another processing node; another primary shared memory node coupled to the another dual-port adapter; and another secondary shared memory node coupled to the another dual-port adapter.
- 29. The apparatus of claim 23, wherein the dual-port adapter includes a dual-port PCI adapter.
- 30. The apparatus of claim 23, wherein the dual-port adapter includes at least a logic control, a switching circuit, and a non-volatile memory.
CROSS-REFERENCES TO RELATED APPLICATIONS
[0001] This application is a continuation-in-part of, and claims a benefit of priority under 35 U.S.C. 119(e) and/or 35 U.S.C. 120 from, copending U.S. Ser. No. 60/220,974, filed Jul. 26, 2000, and 60/220,748, also filed Jul. 26, 2000, the entire contents of both of which are hereby expressly incorporated by reference for all purposes.
Provisional Applications (2)
|
Number |
Date |
Country |
|
60220974 |
Jul 2000 |
US |
|
60220748 |
Jul 2000 |
US |