Claims
- 1. A computer system comprising:
a microprocessor; a first memory board comprising random access memory (RAM); a second memory board comprising RAM; a first bridge device coupled to the microprocessor, the first bridge also coupled to the first and second memory boards by way of a memory bus; a switch device coupled between the first bridge device and each of the first and second memory boards, the switch adapted to selectively couple each of the first and second memory boards to the memory bus; a logic device coupled to the switch device and adapted to control the selective coupling of each of the first and second memory devices by the quick switch device; a second bridge device coupled to the first bridge device by way of a primary expansion bus; a read only memory (ROM) device coupled to the second bridge device, and wherein the ROM device stores programs executable by the microprocessor to configure the memory boards; and wherein either of the first and second memory boards may be selective removed from and added to the computer system during operation of the computer system.
- 2. The computer system as defined in claim 1 further comprising a plurality of microprocessors, each microprocessor coupled to the first bridge logic.
- 3. The computer system as defined in claim 2 wherein each of the microprocessors comprises an X86 based microprocessor.
- 4. The computer system as defined in claim 3 wherein the microprocessors further comprise Pentium® 4 Xeon microprocessors produced by Intel Corporation.
- 5. The computer system as defined in claim 4 further comprises a hard disk drive coupled to the microprocessors, and where the hard disk drive stores an operating system program executable by the microprocessors, and wherein the operating system does not support removal and adding of memory boards during operation of the computer.
- 6. The computer system as defined in claim 1 wherein the switch device further comprises a Pericom Corporation part No. PI5C34171C demultiplexer bus switch.
- 7. The computer system as defined in claim 1 wherein the first bridge logic device and the second bridge logic device are part of a ServerWorks™ Grand Champion™ HE chipset manufactured by Server Works, Inc.
- 8. The computer system as defined in claim 1 wherein the logic device coupled to the switch device further comprises a programmable array logic (PAL).
- 9. The computer system as defined in claim 8 wherein the PAL is further adapted to selective enable providing of auxiliary power to the first and second memory boards.
- 10. The computer system as defined in claim 8 wherein the PAL is further adapted to selective enable providing of primary power to the first and second memory boards.
- 11. The computer system as defined in claim 8 wherein the PAL is further adapted to interface with the first bridge logic device to request a stoppage of main memory traffic prior to coupling to or de-coupling from the memory bus by one of the first and second memory boards.
- 12. The computer system as defined in claim 1 wherein the programs stored on the ROM are further configured to determine compatibility of memory modules on a newly installed memory board.
- 13. The computer system as defined in claim 1 wherein the programs stored on the ROM are further adapted to configure memory controllers of memory boards remaining in the computer system after installation or removal.
- 14. The computer system as defined in claim 1 wherein the programs stored on the ROM are further adapted to configure the first bridge logic device to properly use main memory added to or removed from the computer system during operation.
- 15. The computer system as defined in claim 1 further comprises:
a first strobe signal coupled between the first bridge logic device and the first memory board, the first strobe signal associated with a first data group; a second strobe signal coupled between the first bridge logic device and the second memory board, the second strobe signal associated with the first data group; and wherein the first and second strobe signals are substantially identical.
- 16. A method comprising:
installing a main memory board into an operational computer system; coupling, under control of a logic device, the main memory board to a memory bus of the computer system; configuring, using software programs not a part of an operating system, the main memory board for use; and thereafter utilizing memory on the main memory board.
- 17. The method as defined in claim 16 wherein coupling the main memory board to a memory bus of the computer system further comprises enabling auxiliary power to the main memory board by the logic device.
- 18. The method as defined in claim 17 wherein coupling the main memory board to a memory bus of the computer system further comprises enabling primary power to the main memory board by the logic device.
- 19. The method as defined in claim 18 wherein coupling the main memory board to a memory bus of the computer system further comprises:
requesting stoppage of main memory traffic in the computer system; verifying stoppage of main memory traffic; then closing a plurality of quick switch devices to electrically couple the main memory board to control and data lines of a memory bus; and then allowing resumption of main memory traffic.
- 20. The method as defined in claim 16 wherein configuring, using software programs not a part of an operating system, the main memory board for use further comprises configuring the main memory board for use by software programs stored on a read only memory device (ROM) and executed by a microprocessor.
- 21. The method as defined in claim 20 wherein the configuring step further comprises:
configuring a memory controller on the memory board for dual card operation; configuring a memory controller on an existing memory board for dual card operation; and configuring a north bridge device for dual card operation.
- 22. A computer system comprising:
a microprocessor; a main memory array acting as a working memory of the computer system; a hot spare memory array acting as a back-up memory of the computer system; a first bridge device coupling the microprocessor to the main memory array and the hot spare memory array; wherein the computer system is adapted to make the hot spare memory the working memory when a rate of correctable errors in the main memory array exceeds a threshold.
- 23. The computer system as defined in claim 22 further comprising:
a logic device coupled to the first bridge device, the logic device adapted to periodically assert a control signal, and wherein assertion of the control signal invokes a read only memory (ROM) program that clears a log of correctable errors; and wherein the computer system makes the hot spare memory the working memory when a number of entries in the log exceeds a threshold prior to clearing of the log by ROM program.
- 24. The computer system as defined in claim 23 wherein the first bridge logic device maintains the log of correctable errors.
- 25. The computer system as defined in claim 24 wherein the logic device is a programmable array logic (PAL) configured to assert the control signal once an hour.
- 26. A method of operating a computer system comprising making a hot spare memory device a working memory for the computer system as a rate of correctable errors from a first memory device previously the working memory exceeds a threshold.
- 27. The method as defined in claim 26 further comprising:
logging a number of correctable errors experienced in the first memory device of the computer system; clearing the log of correctable errors of the first memory device periodically; and making the hot spare memory the working memory if the number of entries in the log exceeds a threshold before the clearing step.
- 28. The method as defined in claim 27 wherein logging a number of correctable errors experienced in a first memory device of the computer system further comprises logging the number of correctable errors by a bridge logic device.
- 29. The method as defined in claim 27 wherein clearing the log of correctable errors of the first memory device periodically further comprises:
asserting a control signal periodically; and based on assertion of the control signal invoking a program executable on a microprocessor of the computer system that clears the log in the bridge logic device.
- 30. The method as defined in claim 29 wherein the invoking step further comprises issuing a system management interrupt signal based on the assertion of the control signal.
- 31. The method as defined in claim 29 wherein periodically asserting the control signal further comprises asserting the control signal once per hour.
- 32. In a computer system having a main memory array, a method comprising logging uncorrectable errors experienced by the main memory array to a non-volatile storage location in the main memory array.
- 33. The method as defined in claim 32 wherein logging uncorrectable errors further comprises writing an indication of the uncorrectable errors to the serial presence detect (SPD) bytes of the main memory array.
- 34. The method as defined in claim 33 wherein the writing step further comprises writing the indication of the uncorrectable errors to the SPD bytes over an I2C bus.
- 35. The method as defined in claim 34 further comprising writing the indication of the uncorrectable errors in the upper 128 bytes of the SPD non-volatile random access memory.
- 36. In a computer system having a main memory, a method comprising:
executing a power on self test (POST) program in the computer system; checking a main memory for uncorrectable errors with the POST program; and disabling portions of the main memory that experience uncorrectable errors during execution of the POST program.
- 37. The method as defined in claim 36 wherein disabling portions of the main memory that experience uncorrectable errors during execution of the POST program further comprises swapping the computer system to a hot spare main memory.
- 38. The method as defined in claim 36 wherein disabling portions of the main memory that experience uncorrectable errors during execution of the POST program further comprises mapping out portions of the main memory that experience uncorrectable errors.
- 39. The method as defined in claim 36 further comprising triggering the executing of the POST program step based on a reboot of the computer caused by a watchdog timer operating within the computer system.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This specification claims the benefit of Provisional Application Serial No. 60/377,863 (Attorney Docket No. 1662-65800) filed May 3, 2002 titled “Advanced Memory Protection,” which is incorporated by reference herein as if reproduced in full below. Further, this specification is related to Application Ser. No. ______ (Attorney Docket No. 1662-66600), filed concurrently herewith, titled “Hot Mirroring in a Computer System With Redundant Memory Subsystems,” which is also incorporated by reference herein as if reproduced in full below.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60377863 |
May 2002 |
US |