1. Technical Field
The present inventions relate to memory systems with primary and redundant memory.
2. Background Art
Computer systems typically include memory devices. Dynamic random access memories (DRAMs) are commonly used memory devices that store relatively large amounts of data. Memory controllers issue write requests and read requests to DRAMs. The data to be stored in response to a write request may originate from a processor or another chip. The data provided by the DRAM in response to a read request may be used by the processor or another chip. The memory controller may be in a physically separate chip from the processor or may be on the same chip as the processor.
Computer systems, including server systems, follow a technology trend in which memory subsystems are increasing in both absolute size and in device density. Accompanying the larger memory subsystems is an increasing occurrence of both soft and hard errors in the DRAM devices used to implement the memory subsystem. As the memory subsystem grows, so does the statistical probability of a multi-bit error in any given quantum of data manipulated by the memory controller. In many cases, the memory controller operates on a fixed data size corresponding to a fraction of a cache-line size of the platform processor complex. For example, a memory controller designed for CPUs with a 64 Byte line may store eight 64 bit fragments independently.
Recent server system products have exhibited several features targeted at limiting the system impact of both soft and hard errors in the DRAM memory subsystem. Today, it is common for memory controllers to implement an “Error Correcting Code” (ECC) algorithm, where additional bits of data are stored along with each cache-line fragment, such that any single bit error or combination of bit errors within an aligned nibble may be corrected in hardware. This mechanism permits a system to continue operating reliably in the presence of occasional single-bit soft errors, as well as in the presence of a hard error affecting up to an entire ×4 DRAM device. Extensions on this algorithm are available to protect against failed ×8 DRAM devices. But the ECC mechanism may break down when multiple soft errors are encountered on a single access to the memory store, because the limited correcting code (typically 8 check bits for every 64 data bits, or 16 check bits for every 128 data bits) may not be able to cover all permutations of two or more bit errors scattered across the affected data.
Mirroring data refers to maintaining two copies of every datum in the main memory store. Mirroring every bit of data cuts the effective capacity of a given memory subsystem implementation in half. Known solutions available today also require that the available bandwidth of the memory subsystem be cut in half to provide the mirroring capability.
In another memory system, a memory controller is coupled to a first repeater hub through the primary channel and a second repeater hub through the mirror channel. Two subchannels are coupled to each repeater hub. Memory modules are coupled to the subchannels. Primary dated is stored in the memory modules of the subchannels coupled to the first repeater hub and redundant data is stored in the memory modules of the subchannels coupled to the second repeater hub.
Memory systems with more than two channels have been proposed.
The inventions will be understood more fully from the detailed description given below and from the accompanying drawings of embodiments of the inventions which, however, should not be taken to limit the inventions to the specific embodiments described, but are for explanation and understanding only.
The inventions involve interleaved mirrored memory systems. An interleaved mirrored memory system is a mirrored memory system that typically operates in an interleaved mirrored memory state, but which may temporarily operate in a non-interleaved state in response to the detection of a triggering failure in the memory assembly. A discussion of triggering failures is provided below. When the failure is corrected, the system returns to the interleaved mirrored memory state. Each channel in the system is coupled to one or more memory assemblies. In an interleaved mirrored memory state, the memory of each channel stores both primary and redundant data sections, and corresponding primary and redundant data sections are stored in the memory of different channels.
A memory controller of the system controls the operation of the interleaved mirrored memory state and any non-interleaved states. The memory controller is in an interleaved mirrored memory mode when it is controlling the system in an interleaved mirrored memory state. The memory controller is in a non-interleaved mode when it is controlling the system in a non-interleaved state.
Several examples of systems operating in interleaved mirrored memory states are provided below. Examples of non-interleaved states are also discussed. However, the inventions are not limited to the details of these examples of systems in interleaved and non-interleaved states.
Various types of memory assemblies may be used. Examples of memory assemblies include memory modules and discs or portions of disc in hard drives systems, but the inventions are not so limited. Memory modules each include one or more memory devices. Merely as an example, and not a requirement, memory modules may be 72-bit dual in-line memory (DIMMs) for 64 bits for data and 8 bits for ECC. ECC is not required. The memory devices are not limited to a particular technology. For example, the memory devices may be DRAMs chips manufactured according to currently widely used technology. As another example, the memory devices may be polymer memories. The inventions are not restricted to a particular signaling technique.
The term “data section” refers to data that is stored in a particular memory assembly or a portion of the memory assembly at a particular time. It is intended to be a broad, non-restrictive term. The data section may include data that is physically discontinuous in the memory assembly and that is discontinuous in logical memory. Typically, the contents and extent of the data sections change over time.
A. Examples with One Data Section per Memory Assembly
Although system 50 and some other systems described herein are illustrated with four memory assemblies per channel, this is not required. There may be more or less than four memory assemblies per channel. In some embodiments, the system will work even though some of the slots that may receive memory modules are empty.
The data to be stored in the memory assemblies is divided into primary data sections DA1, DA2, DB1, and DB2 and corresponding redundant data sections DA1′, DA2′, DB1′, and DB2′, which are identical or essentially identical to primary sections DA1, DA2, DB1, and DB2. It is expected that in most if not all actual implementations of the inventions that the redundant data sections would be identical to the corresponding primary data sections (for example, DA1′ is identical to DA1). However, a primary and corresponding redundant data section could differ in a known way. For example, the bits in the redundant data sections could be the inverse of the corresponding bits in the primary data sections.
The primary data section is the one that the memory controller ordinarily reads, with the corresponding redundant data section being a backup. In some embodiments, the memory controller always reads the primary data section rather than the corresponding redundant data section if no failure is detected in the system. However, in other embodiments, such as are discussed below, the memory controller may choose to read the redundant data section rather than the primary data section even though there is no failure in the system.
In the illustrated example of
As an example, in some embodiments, the small part of DA1 is a half cache line and the small part of DB1 is a half cache line which together make up a whole cache line. (As an example, the small part of DA1 may be an odd half cache line and the small part of DB1 may be an even half cache line.) An advantage of doing this is that for reads, half of the data group can be read from the first channel at the same time that another half of the data group is read from the second channel. This results in twice the read bandwidth and half the read latency.
In contrast, in prior art system 10, all the primary data sections are on the same channel. Therefore, it will take at least twice as long for data sections DA1 and DB1 to be read through system 10 than through system 50, assuming the controller, channel, and memory chips in systems 10 and 50 otherwise have the same capabilities.
The inventions are not restricted to any particular order of storing primary and redundant data sections with respect to the order of the memory assemblies. For example, in another embodiment, data sections DA1, DA2, DB1′, and DB2′ may be stored in memory assemblies M1, M3, M5, and M7, respectively, and data sections DB1, DB2, DA1′, and DA2′ be stored in memory assemblies M2, M4, M6, and M8, respectively. As another example, data sections DB1′, DB2′, DA1, and DA2 may be stored in memory assemblies M1, M3, M5, and M7, respectively, and data sections DA1′, DA2′, DB1, and DB2, be stored in memory assemblies M2, M4, M6, and M8, respectively.
In
Primary memory assemblies are those memory assemblies that store only primary data sections; redundant memory assemblies are those memory assemblies that store only redundant data sections; and mixed memory assemblies are those memory assemblies that store both primary and redundant data sections. In
B. Examples with Two Data Sections per Memory Assembly
In the example of
Many other orders of data sections with respect to memory assemblies may be used. For example, in another embodiment data sections DA1 and DB1′ are stored in memory assembly M1, data sections DA2 and DB2′ are stored in memory assembly M3, data sections DA3 and DB3′ are stored in memory assembly M5, data sections DA4 and DB4′ are stored in memory assembly M7, data sections DB1 and DA1′ are stored in memory assembly M2, data sections DB2 and DA2′ are stored in memory assembly M4, data sections DB3 and DA3′ are stored in memory assembly M6, and data sections DB4 and DA4′ are stored in memory assembly M8. In this example, the memory assemblies are mixed, because they have both primary and redundant data.
An advantage of having both primary and redundant data on the same memory assembly is that it can be implemented with merely one memory assembly per channel. For, example,
In systems 70 and 80, there are two data sections per memory assembly. The two data sections are in different portions of the memory assembly. There are various ways in which these may be partitioned or distributed into different portions of a memory assembly.
Under some embodiments, some of the chips (or a single chip) of the memory module may hold one data section, while others of the chips (or a single chip) may hold another data section. For example,
As used herein, the term “rank” refers to those memory devices in a memory assembly that are selected together as a group. That selection may be through a “chip select” contact on the memory chips or through other techniques such as a chip select command multiplexed with other information. In some cases, there is only one rank per memory assembly and in other cases, there are more than one rank per memory assembly. Module 160 may include one, two, or more ranks. The memory assemblies of systems described in this disclosure may have one, two, or more ranks per module, depending on how they are implemented, except where they expressly are limited to a particular number of ranks (for example,
In some embodiments, the ranks correspond to the portions of the memory modules holding different data sections. For example, in
The difference between
In the case in which the chip select command is multiplexed, it may be carried on the channel. In the case of a separate “chip select” contact on the memory chips, the chip select information may be conveyed over conductors that may be considered part of the channel or separate from it. For example, in
Referring again to
There are various orders in which the data sections may be stored with respect to order of memory assemblies. For example, chart 2 shows an example of an order of data sections in system 70 (and M1 and M2 of system 80) that is different than that shown in chart 1.
As mentioned, in some embodiments, the memory controller may choose to read the redundant data section rather than the primary data section even though there is no failure in the system. An example of a reason to do this is it may be quicker to read from the redundant data section. For example, the desired primary data section may be in a rank that is busy or has just been accessed. In these embodiments, there may be an advantage in having primary and corresponding redundant data sections in different ranks, although there may also be disadvantages for doing this.
The examples heretofore have been given with systems having two channels. However, the inventions may be implemented with systems having four channels. For example,
C. Examples with Two Data Sections per Rank
In some embodiments, there are two or more data sections per rank. Such systems may include one or more ranks per memory module. For example,
An advantage of using two data sections per rank as in
Examples of systems with two data sections per rank with more than one rank per assembly are provided below.
D. Examples with More Than Two Data Sections per Memory Assembly
In some embodiments, there are more than two data sections stored in a memory assembly. For example,
E. Examples of System Details
ECC circuitry 324 provides ECC control for memory controller 322 and may be part of memory controller 322 or separate from it. ECC control is optional. Operating system (OS) and/or other software 326 provide software control for memory controller 322. Typically, this software control would be indirectly applied to memory controller 322 through various intermediate circuitry. Failure indicating circuitry 328 provides an indication of certain failures in the memory subsystem. Examples of the failure indicating circuitry 328 include circuitry that sends a message to a display indicating the failure of a memory assembly and/or channel. Circuitry that controls a light emitting diode(s) (LED(s)) is another example. Power control circuitry 330 provides power for the memory modules. In some embodiments, when certain failures of the memory subsystem are detected, the power provided to all or some of the memory assemblies of a channel may be shut down. In other embodiments, as described, when certain failures are detected, a user manually removes power. Failure indicating circuitry 328 and power control circuitry 330 are not required. Other techniques may be used for instead. The communication between memory controller 322 and failure indicating circuitry 328 and power control circuitry 330 may be indirect through various intermediate circuits.
Failure detection circuitry 350 detects triggering failures in memory assemblies. Failure detection circuitry 350 also includes circuitry to detect when the failure has been corrected. A triggering failure is one that causes the memory controller to change from interleaved mirrored memory mode to a non-interleaved mode. An example of a triggering failure is when data is lost where the data cannot be recovered without obtaining it from a redundant memory. A correctable failure is one that can be corrected without copying data from redundant memory. A correctable error may be corrected, for example, through ECC codes without leaving the interleaved mirrored memory state. As described below, in some embodiments, several correctable errors in the same memory assembly can be interpreted as a triggering error. Ordinarily, correctable failures will not be detected as triggering failures, but in some embodiments, some correctable failures may be treated as triggering failures. Further, in some embodiments, there could be some failures that are not correctable without replacing a memory assembly and copying data from a non-failed memory assembly that will not be treated as triggering failures. Ideally, there would be no such cases. Causes of triggering failures may include an original or developed defect in a memory assembly, extreme noise, and some soft errors.
Referring to
When a triggering failure is detected (box 354), for example because an uncorrectable ECC error is encountered, a read is re-issued to the corresponding memory assembly or assemblies. Hardware in memory controller 322 re-assembles the data in the proper order when reads are issued to the non-failed memory assemblies. Hardware will identify the “victim” memory assembly responsible for the triggering failure, and reconfigure dynamically such that data sections in the victim remain or become redundant data sections. If necessary, designation of primary and redundant memory assembly pairs are swapped such that the preferred read destination does not include the victim.
Upon reception of the triggering failure notification, software may place some sort of call for service. In response, a person may notify the machine that a hot-swap event is imminent. The software routine will request fail-down to single-channel operation (box 356), isolating the channel containing the victim memory assembly. Configuration write (CFG-write) is a software action on hardware. The inventions are not limited to using this type of software control. Hardware or other types of software control could be used.
Hardware will sequence the transition from interleaved mirrored memory state operation to single-channel non-interleaved operation, ensuring that no transactions are lost, and that the change in memory configuration remains invisible to compatible software. Memory controller 322 will then electrically isolate the victim channel; disabling all clocking, and tri-stating channel interface signals. Software communicates with the baseboard management controller to remove power from the victim channel. Software notifies the technician that the victim channel has been isolated, and communicates the location of the victim memory assembly. The location of the victim memory assembly may be exposed to software through, for example, the configuration register space of memory controller 322 or associated circuitry (for example, an MCH). Ways in which this may be communicated include a graphical representation sent to the display, or a baseboard controller access to illuminate an LED.
AUTO (traffic) refers to hardware automatically waiting for the traffic on the channel to complete through either finishing or canceling existing operations intended for the memory assemblies. Cancelled operations may be reissued to the non-victim channel.
Once the channel has been isolated (box 358), the technician may swap out the failed memory assembly along with any other memory assemblies on that channel if desired.
Software notifies memory controller 322 through a configuration access to begin bringing the victim channel back on-line, and communicates with the baseboard controller to re-establish power to the victim channel. This sequence of events allows the memory controller to prepare for channel power-up by removing the required interface signals from tri-state (high impedance), and driving the necessary power-up state onto the interface. In some embodiments, memory controller 322 is responsible for driving clock signals used by the memory subsystem, and this reactivates that clocking as well.
Software waits an appropriate interval for power to stabilize and for any subordinate phase locked loop (PLL) and/or delayed locked loop (DLL) circuitry in the memory subsystem to lock, typically a few milliseconds (box 362). At this point, software may utilize the “serial presence detect” mechanism to ensure that the victim slots are populated with the correct technology. If the swapped-in devices do not precisely match the technology in use on the channel still in use, the remainder of the sequence may produce spurious results. It would be possible for software to inform the technician that the devices are unsuitable, as well as precisely what is wrong with them.
Software then moves memory controller 322 into its re-configuring state, and utilizes internal configuration register accesses to restore the operational register settings in the memory assembly on the victim channel (box 364). This stage includes any impedance and timing calibration required prior to utilizing the replaced devices.
Once the victim channel is back up and calibrated, software moves memory controller 322 to its memory copy state, where it will automatically duplicate all of memory onto the channel including the victim memory assembly (box 368). Hardware will detect which channel was the victim such that the proper data interleave is restored by the copy operation.
Hardware will automatically detect when the copy is complete (AUTO (compete)), and dynamically sequence itself back to interleaved mirrored memory state operation with read and write operations once again interleaved across both channels (box 370, AUTO(traffic), and box 352). As mentioned, the inventions are not restricted to the details of
There are different types of non-interleaved states. For example, in a fully non-interleaved state, the victim channel is completely shut down so that the system is no longer mirrored. In essence, the non-victim channel contains all the data sections and for practical purposes, they are all treated as primary data sections. When power is restored to the victim channel, the data is copied into its memory assemblies and the primary and redundant status of memory sections is reestablished. In a partially non-interleaved state, the victim channel and any memory assemblies therein remain operating except for the failed memory assembly (or assemblies). In that case, the data sections of the memory assembly in the non-victim channel that corresponds to the failed memory assembly remain in or change to primary status until the failure is corrected. In some cases and in some embodiments, the failure may be corrected by merely copying data from the corresponding non-failed memory assembly to the failed memory. In these cases, in this partially non-interleaved state, the victim channel does not have to be shut down as it is in
In each system shown in a figure, the memory controllers each have a different reference number to suggest that the memory controllers represented could be different. However, a memory controller may be flexible enough to have different implementations and work with some or all of the systems shown or described herein. The various channels shown in the figures may be the same or different. Which one is referred to as a first channel and which is called a second channel is arbitrary. The various memory assemblies in a particular system may be identical or different in details.
An embodiment is an implementation or example of the inventions. Reference in the specification to “an embodiment,” “one embodiment,” “some embodiments,” or “other embodiments” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least some embodiments, but not necessarily all embodiments, of the inventions. The various appearances “an embodiment,” “one embodiment,” or “some embodiments” are not necessarily all referring to the same embodiments.
If the specification states a component, feature, structure, or characteristic “may”, “might”, or “could” be included, that particular component, feature, structure, or characteristic is not required to be included. If the specification or claim refers to “a” or “an” element, that does not mean there is only one of the element. If the specification or claims refer to “an additional” element, that does not preclude there being more than one of the additional element.
The inventions are not restricted to the particular details listed herein. Indeed, those skilled in the art having the benefit of this disclosure will appreciate that many other variations from the foregoing description and drawings may be made within the scope of the present inventions. Accordingly, it is the following claims including any amendments thereto that define the scope of the inventions.
The present patent application is a divisional of application Ser. No. 10/290,888, filed Nov. 8, 2002, now U.S. Pat. No. 7,017,017. The present application and application Ser. No. 10/290,889 entitled “Interleaved Mirrored Memory Systems” were filed on the same day, have essentially identical specifications, and claim related subject matter.
Number | Name | Date | Kind |
---|---|---|---|
4518947 | Poston et al. | May 1985 | A |
4870643 | Bultman et al. | Sep 1989 | A |
5088081 | Farr | Feb 1992 | A |
5191584 | Anderson | Mar 1993 | A |
5367669 | Holland et al. | Nov 1994 | A |
5579277 | Kelly | Nov 1996 | A |
5604662 | Anderson et al. | Feb 1997 | A |
5668974 | Grassi et al. | Sep 1997 | A |
5724501 | Dewey et al. | Mar 1998 | A |
6112257 | Mason et al. | Aug 2000 | A |
6130853 | Wang et al. | Oct 2000 | A |
6178135 | Kang | Jan 2001 | B1 |
6219287 | Sugiyama | Apr 2001 | B1 |
6226720 | Henderson et al. | May 2001 | B1 |
6381674 | DeKoning et al. | Apr 2002 | B1 |
6393534 | Chen et al. | May 2002 | B1 |
6418068 | Raynham | Jul 2002 | B1 |
6430702 | Santeler et al. | Aug 2002 | B1 |
6473339 | De Ambroggi et al. | Oct 2002 | B1 |
6567889 | DeKoning et al. | May 2003 | B1 |
6574709 | Skazinski et al. | Jun 2003 | B1 |
6681339 | McKean et al. | Jan 2004 | B1 |
6785835 | MacLaren et al. | Aug 2004 | B1 |
6854070 | Johnson et al. | Feb 2005 | B1 |
6877076 | Cho et al. | Apr 2005 | B1 |
20020069317 | Chow et al. | Jun 2002 | A1 |
20030070055 | Johnson et al. | Apr 2003 | A1 |
20040090827 | Dahlen et al. | May 2004 | A1 |
20040172508 | Nguyen et al. | Sep 2004 | A1 |
Number | Date | Country |
---|---|---|
106 0731 | Apr 1992 | CN |
106 1865 | Jun 1992 | CN |
118 2913 | May 1998 | CN |
0 755 009 | Jan 1997 | EP |
WO 0286653 | Oct 2002 | WO |
Number | Date | Country | |
---|---|---|---|
20050262388 A1 | Nov 2005 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10290888 | Nov 2002 | US |
Child | 11181583 | US |