The present disclosure relates generally to refreshing memory, and more specifically, to managing memory refreshes.
Dynamic random access memory (DRAM) may have more than one internal memory bank. The DRAM is normally refreshed responsive to a refresh command that is periodically issued to the DRAM by a memory controller. When the DRAM receives a refresh command, the internal memory banks wait for a time period to complete a ReFresh Cycle (tRFC) before any of the internal memory banks can be accessed by a read or a write operation. As the capacity of DRAM increases, the tRFC may also increase. For example a 1 Gigabyte (Gb), a 2 Gb, a 4 Gb, and a 8 Gb double data rate (DDR) 3 DRAM may have a tRFC of 110 nanoseconds (ns), 160 ns, 300 ns, and 350 ns, respectively.
In a memory subsystem that supports multiple memory ranks, refresh commands between any two memory ranks are typically not overlapped. Overlapping the refresh commands increases the probability that an incoming read command will wait (for the overlapping refresh cycles to complete) before the read command can be sent to the DRAMs. To avoid overlapping refresh commands, an interval timer may be used to equally space refresh commands issued by the memory controller. Under some circumstances (e.g., during link retraining), the memory controller may place the DRAM in a self-refresh mode. During the self-refresh mode, the DRAM may not wait for refresh commands from the memory controller to initiate a refresh. The DDR DRAM specification states that a memory controller issue at least one refresh command after the DRAM exits from a first self-refresh mode and before the DRAM enters into a second self-refresh mode.
When a memory controller determines that a number of transmission errors occurring at a high-speed memory bus exceeds a predetermined threshold, the memory controller may initiate a link retraining process to resynchronize the high-speed memory bus to reduce the transmission errors. As part of the link retraining process, the DRAM may be placed in a self-refresh mode. After the link retraining process has completed, the DRAM may come out of the self-refresh mode. If the memory controller determines that the number of transmission errors occurring at the high-speed memory bus again exceeds the predetermined threshold, the memory controller cannot initiate a second link retraining. Initiating a second retraining would cause the DRAM to be placed into a second self-refresh mode. However, placing the DRAM in a second self-refresh mode may violate the DDR DRAM specification, because the memory controller may not have issued at least one refresh command to the DRAM after the DRAM exited from the self-refresh mode and before the DRAM entered into the second self-refresh mode.
In a particular embodiment, a method of managing memory includes determining, at a memory controller device, that a number of transmission errors between a memory controller port and a memory redrive device exceeds an error threshold. The method also includes initiating a first link retraining process between the memory controller port and the memory redrive device. The method further includes placing one or more dynamic random access memory modules associated with the memory redrive device in a self-refresh mode. The method also includes removing the one or more dynamic random access memory modules from the self-refresh mode after the link retraining process has completed. The method further includes enabling overlapping refreshes of the one or more dynamic random access memory modules.
In another particular embodiment, a system includes one or more dynamic random access memory modules. The system also includes one or more memory redrive devices coupled to the one or more dynamic random access memory modules via one or more memory buses. The system also includes a memory controller device having one or more memory controller ports coupled to the one or more memory redrive devices via pairs of high speed buses. The memory controller device is capable of executing logic to determine that a number of transmission errors occurring at a particular pair of the high speed buses exceeds an error threshold. The memory controller device is capable of initiating a link retraining process between a memory controller port coupled to a memory redrive device via the particular pair of the high speed buses. The memory controller device is capable of placing in a self-refresh mode at least one dynamic random access memory module that is associated with the memory redrive device. The memory controller device is capable of removing the at least one dynamic random access memory module from the self-refresh mode after the link retraining process has completed. The memory controller device is capable of initiating at least one overlapping refresh of the one or more dynamic random access memory modules.
In another particular embodiment, a computer program product includes a computer usable medium having computer usable program code embodied therewith. The computer usable program code is executable by a processor to determine that a particular event has occurred at a computer. The computer usable program code is also executable by the processor to enable overlapping refreshes of one or more dynamic random access memory modules in response to determining that the particular event has occurred at the computer. The computer usable program code is further executable by the processor to initiate at least one overlapping refresh of the one or more dynamic random access memory modules. The computer usable program code is also executable by the processor to disable overlapping refreshes of the one or more dynamic random access memory modules after the at least two overlapping refreshes of the one or more dynamic random access memory modules has completed.
These and other advantages and features that characterize embodiments of the invention are set forth in the claims annexed hereto and forming a further part hereof. However, for a better understanding of the invention, and of the advantages and objectives attained through its use, reference should be made to the drawings and to the accompanying descriptive matter in which there are described exemplary embodiments of the invention.
Embodiments of the present invention may improve memory management by dynamically allowing memory refreshes of different ranks to overlap in a manner that may reduce latency and increase overall efficiency. In one example, a memory controller device may detect transmission errors, such as cyclic redundancy check (CRC) errors, occurring at a high speed bus that couples a memory redrive device to a memory controller port of the memory controller device. The high speed bus may be one of a serial management interface (SMI) bus, a fully buffered dual inline memory module (FB-DIMM) bus, and a fully buffered dual inline memory module 2 (FBD2) bus.
When a number of transmission errors between the memory controller port and the memory redrive device exceeds an error threshold, the memory controller device may initiate a first link retraining process between the memory controller port and the memory redrive device. The number of transmission errors may include a first number of transmission errors associated with a current transfer between the memory controller port and the memory redrive device. The number of transmission errors may also include a second number of transmission errors associated with two previous transfers between the memory controller port and the memory redrive device.
Initiating the first link training process may include placing one or more DRAM modules associated with the memory redrive device into a self-refresh mode. The memory controller device may remove the one or more DRAM modules from the self-refresh mode after the link retraining process has completed. The memory controller device may enable and initiate overlapping refreshes of the one or more DRAM modules.
Upon completion of the overlapping refreshes, the memory controller device may disable overlapping refreshes for the one or more DRAM modules. The memory controller device may initiate a second link retraining process between the memory controller port and the memory redrive device.
Turning particularly to the drawings,
The memory controller device 102 is configured to send commands to read and write data to and from the memory 104. The memory controller device 102 includes one or more memory controller ports, such as the representative memory controller ports 110-113. The memory controller device 102 may include logic 114 (e.g., operational instructions) that may be executable by the memory controller device 102 to perform various functions of the memory controller device 102.
The memory redrive devices 106-109 are coupled to the memory controller ports 110-113 via pairs of high speed buses 103 that include representative pairs of high speed buses 120-123. The high speed buses 120-123 may include: an SMI bus, an FB-DIMM bus, an FBD2 bus, another type of high-speed memory bus, or any combination thereof.
The pairs of the high speed buses 120-123 may experience transmission errors. In
One or more of the memory redrive devices 106-109 may receive command packets from one or more of the memory controller ports 110-113. For example, the memory redrive device 109 may receive command packets 132 from the memory controller port 113. One or more of the memory redrive devices 106-109 may decode the received command packets and reformat the decoded command packets into DRAM commands, such as the representative DRAM commands 133.
One or more of the memory redrive devices 106-109 may send the DRAM commands to the memory 104 via one or more of the memory buses 105. The memory buses 105 may include command (CMD) buses 124, 125, 126 and 127 and data buses 128, 129, 130, and 131. In a particular embodiment, the memory buses 105 may be DDR buses. After sending DRAM commands to the memory 104, one or more of the memory redrive devices 106-109 may communicate a result of sending the DRAM commands back to the memory controller device 102 via one or more of the high speed buses 120-123.
The memory controller device 102 may store data at the memory 104 and retrieve stored data from the memory 104. The memory 104 may include DRAM, such as the representative DRAM 115, 116, 117, and 118. In a particular embodiment, each of the DRAMs 115-118 may be packaged as a Dual Inline Memory Module (DIMM). The DRAMs 115-118 may be DDR memory, DDR2 memory, DDR3 memory, DDR4 memory, Quad Date Rate (QDR) memory, another type of high-speed data rate memory, or any combination thereof.
In operation, the memory controller device 102 may execute the logic 114 to detect the transmission errors 140 occurring at one of more of the pairs of high speed buses 120-123. When the memory controller device 102 determines that a number of the transmission errors 140 exceeds an error threshold 142, the memory controller device 102 may initiate a link retraining 144 of the link (e.g., the pair of high speed buses 121) between the affected memory controller port (e.g., the memory control port 111) and affected memory redrive device (e.g., the memory redrive device 107).
The link retraining 144 may resynchronize the affected memory controller port 111 and the affected memory redrive device 107 via their associated pair of high speed buses 121 to reduce the transmission errors 140. As part of the link retraining 144, the DRAM associated with the affected memory redrive device may be placed into a self-refresh mode. For example, when the link retraining 144 is initiated between the memory controller port 111 and the memory redrive device 107, the DRAM 116 may be placed into a self-refresh mode 146.
When the link retraining 144 has completed, the memory controller device 102 may remove the associated DRAM (e.g., the DRAM 116) from the self-refresh mode (e.g., the self-refresh mode 146). When the associated DRAM (e.g., the DRAM 116) is no longer in self-refresh mode, the memory controller device 102 may enable overlapping refreshes of the DRAMs 115-118. Subsequently or concurrently, overlapping refreshes of the associated DRAM (e.g., the DRAM 116) may be initiated. The overlapping refreshes of the associated DRAM may be initiated by the memory controller device 102, one of the memory controller ports 110-113, one of the memory redrive devices 106-109, or any combination thereof. When the overlapping refreshes of the associated DRAM (e.g., the DRAM 116) has have completed, the memory controller device 102 may disable overlapping refreshes of the DRAMs 115-118. By performing overlapping refreshes of the DRAM 116 after the link retraining 144, the memory controller device 102 may be able to perform at least one refresh of the DRAM 116 after the DRAM 116 exits from a first self-refresh and before the DRAM 116 re-enters into a second self-refresh.
Thus, the memory controller device 102 may enable overlapping refreshes of the DRAMs 115-118, initiate overlapping refreshes of one or more of the DRAMs 115-118, and disable overlapping refreshes of the DRAMs 115-118 during different situations, such as after a link retraining of a particular pair of high speed buses (e.g. the pair of high speed buses 121). A second situation where the memory controller device 102 may initiate overlapping memory refreshes may occur during an initialization of a computer. For example, after the system 100 has been configured during the initialization, the memory controller device 102 may enable overlapping refreshes of the DRAMs 115-118, initiate overlapping refreshes of one or more of the DRAMs 115-118, and disable overlapping refreshes of the DRAMs 115-118. Performing overlapping refreshes during initialization may be useful when testing the system 100 because link retraining may be initiated after the initialization has completed without initiating a refresh of one or more of the DRAMs 115-118.
A third situation where the memory controller device 102 may initiate overlapping memory refreshes may occur when the memory controller device 102 receives a command destined for one of the DRAMs 115-118 that is in a low power mode. For example, to reduce power consumption of the DRAMs 115-118, the memory controller device 102 may place one or more of the DRAMs 115-118 in a low power mode (e.g., when one or more of the DRAMs 115-118 has not been accessed for more than a particular time period). The low power mode may be similar to the self-refresh mode 146 because the DRAM 116 may self-refresh without receiving refresh commands from the memory controller 102. When the memory controller device 102 receives a command and determines that the command is destined for one or more of the DRAMs 115-118 that is in the low power mode, the memory controller device 102 may remove one or more of the DRAMs 115-118 from the low power mode. The memory controller device 102 may initiate overlapping refreshes of one or more of the DRAMs 115-118. The memory controller device 102 may forward the received command to the destination DRAM after the overlapping refreshes of the one or more DRAMs 115-118 have completed.
Thus, the memory controller device 102 or the memory redrive devices 106-109 may initiate overlapping refreshes of one or more of the DRAMs 115-118 during different situations, such as after link retraining, during initialization, or after receiving a command destined for one or more of the DRAMs 115-118 that are in a low power mode. By doing so, the memory control device 102 may issue at least one refresh command to one or more of the DRAMs 115-118 after one or more of the DRAMs 115-118 exit from a self-refresh mode, thereby complying with the DDR specification.
The first quad rank DIMM 204 may include a buffer 210 and memory ranks 212, 213, 214, and 215. The second quad rank DIMM 205 may include memory ranks 216, 217, 218, and 219. Each of the memory ranks 212-219 may include multiple DRAM modules. Each DRAM module may supply 4, 8, 16, . . . 2″ data bits, where n>1. Each DRAM module may be subdivided into eight internal banks. Each internal bank may be accessed, with up to four accesses in progress during a rolling “four active window.”
An internal bank may not be accessed during a memory refresh command because the memory refresh command may refresh all banks within a particular rank. During runtime operation, overlapping of refreshes may not be desirable because overlapping refreshes may decrease the probability that an incoming command may be sent to the memory for execution with minimal latency.
The memory controller device 201 may initiate overlapping refreshes of the quad rank memory DIMMs 204 and 205 during different situations, such as after link retraining, during initialization, or after receiving a command destined for one or more of the quad rank memory DIMMs 204 and 205 that are in a low power mode. In this manner, the memory control device 201 may issue at least one refresh command to the quad rank memory DIMMs 204 and 205 after the quad rank memory DIMMs 204 and 205 exit from a self-refresh mode.
The method begins at 302, by detecting transmission errors in communications between a memory controller and a memory redrive device. For example, the memory controller 102 of
At 304, a determination may be made as to whether a number of the transmission errors exceeds an error threshold. The number of the transmission errors may include a first number of transmission errors associated with a current transfer between the memory controller port and the memory redrive device and include a second number of transmission errors associated with two previous transfers between the memory controller port and the memory redrive device. For example, the error threshold may be four transmission errors per second. To illustrate, in
Advancing to 308, one or more DRAM modules associated with the memory redrive device may be placed in a self-refresh mode. For example, the memory controller 102 of
At 310, the one or more DRAM modules may be removed from the self-refresh mode. For example, in
Continuing to 312, overlapping refreshes of the one or more DRAM modules may be enabled. At 314, at least two overlapping refreshes of the one or more DRAM modules may be initiated. For example, the memory controller device 102 of
At 316, the overlapping refreshes of the DRAM modules may be disabled after completing the at least two overlapping refreshes. For instance, the memory controller device 102 of
Thus, a memory controller device may initiate overlapping refreshes of DRAM after a link retraining process during which the DRAM was placed in a self-refresh mode. The memory control device may issue at least one refresh command to the DRAM after the DRAM exits from a self-refresh mode.
The method begins at 402, by determining that a particular event has occurred. At 404, a determination may be made as to whether the particular event is one of a pre-specified set of events (e.g., link retraining completed, configuration completed during initialization, command received that is destined for a memory that is in a low power mode).
When the determination is “no,” i.e., the particular event is not one of the pre-specified set of events, the method may return to 402. When the determination is alternatively “yes,” i.e., the particular event is one of a pre-specified set of events, overlapping refreshes of one or more DRAM modules are enabled, at 406. For example, when the memory controller device 102 of
Advancing to 408, at least two overlapping refreshes of the one or more DRAM modules may be initiated. At 410, the overlapping refreshes of the one or more DRAM modules may be disabled after completing the at least two overlapping refreshes. For example, the memory controller device 102 of
Thus, a memory controller device may initiate overlapping refreshes of DRAM during different situations, such as after completing a link retraining process, during an initialization process, or after receiving a command destined for the DRAM when the DRAM is in a low power mode. By doing so, the memory control device can issue at least one refresh command to the DRAM after the DRAM exits from a self-refresh mode.
A computer 502 may be implemented as a server or multi-user computer system that is coupled via a network 504 to one or more client computers 506 and 508. For the purposes of the illustrated embodiment, each of the computers 502, 506, and 508 may represent any type of computer, computer system or other programmable electronic device. Moreover, each of the computers 502, 506, and 508 may be implemented using one or more networked computers, e.g., in a cluster or other distributed computing system. In the alternative, applications may be implemented within a single computer or other programmable electronic device, such as a desktop computer, a laptop computer, a handheld computer, a cell phone, a set top box, a non-networked, standalone configuration, etc.
The computer 502 typically comprises a central processing unit (CPU) 508 that includes at least one microprocessor coupled to a memory 510. The memory 510 may represent random access memory (RAM) devices comprising a main storage device of the computer 502, as well as any supplemental levels of memory, such as cache memories, non-volatile or backup memories (programmable or flash memories), read-only memories, another type of memory, or any combination thereof. In addition, the memory 510 may be considered to include memory storage physically located elsewhere in the computer 502, e.g., any cache memory in a processor in CPU 508, as well as any storage capacity used as a virtual memory, as stored on a mass storage device 512 or on another computer coupled to the computer 502.
The computer 502 may receive a number of inputs and outputs for communicating information externally. For interface with a user or operator, the computer 502 may include a user interface 514 incorporating one or more user input devices (e.g., a keyboard, a mouse, a trackball, a joystick, a touchpad, and/or a microphone, etc.) and a display (e.g., a CRT monitor, an LCD display panel, and/or a speaker, etc.). User input may also be received via another computer or terminal.
For additional storage, the computer 502 may also include one or more mass storage devices 512, such as a floppy or other removable disk drive, a hard disk drive, a direct access storage device (DASD), an optical drive (a CD drive, a DVD drive, etc.), and/or a tape drive, among others. Furthermore, the computer 512 may include a network interface 516 with one or more networks 504 (e.g., a LAN, a WAN, a wireless network, and/or the Internet, and so on) to permit the communication of information with other computers and electronic devices. It should be appreciated that the computer 502 may include suitable analog and/or digital interfaces between CPU 508 and each of components 510, 512, 514, and 516 as is well known in the art.
The computer 502 operates under the control of an operating system 518 and executes or otherwise relies upon various computer software applications, components, programs, objects, modules, data structures, etc., as will be described in greater detail below. Moreover, various applications, components, programs, objects, modules, etc. may also execute on one or more processors in another computer coupled to the computer 502 via the network 504, e.g., in a distributed or client-server computing environment, whereby the processing required to implement the functions of a computer program may be allocated to multiple computers over a network.
As shown in
In general, the routines executed to implement the embodiments of the invention, whether implemented as part of an operating system or a specific application, component, program, object, module or sequence of instructions, or even a subset thereof, will be referred to herein as “algorithms,” “program code,” or simply “programs.” Program code typically comprises one or more instructions that are resident at various times in various memory and storage devices in a computer. When read and executed by one or more processors in a computer, the instructions cause that computer to perform the steps necessary to execute steps or elements embodying the various aspects of embodiments of the invention.
While embodiments of the invention have been described in the context of fully functioning computers and computer systems, those skilled in the art will appreciate that the various embodiments of the invention are capable of being distributed as a program product in a variety of forms, and that embodiments of the invention apply equally regardless of the particular type of computer readable media used to actually carry out the distribution. Examples of computer readable media comprise, but are not limited to tangible, recordable type media and transmission type media. Examples of tangible, recordable type media include volatile and non-volatile memory devices, floppy and other removable disks, hard disk drives, magnetic tape, and optical disks (CD-ROMs, DVDs, etc.). Examples of transmission type media include digital and analog communication links.
In addition, various program code described herein may be identified based upon the application within which it is implemented in a specific embodiment of the invention. However, it should be appreciated that any particular program nomenclature is used merely for convenience, and thus embodiments of the invention should not be limited to use solely in any specific application identified and/or implied by such nomenclature.
Furthermore, given the typically endless number of manners in which computer programs may be organized into routines, procedures, methods, modules, objects, and the like, as well as the various manners in which program functionality may be allocated among various software layers that are resident within a typical computer (e.g., operating systems, libraries, API's, applications, applets, etc.), it should be appreciated that embodiments of the invention are not limited to the specific organization and allocation of program functionality described herein.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the disclosed embodiments. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the scope of the disclosure. Thus, the present disclosure is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope possible consistent with the principles and features as defined by the following claims.