Computers, smartphones, and other electronic devices rely on processors and memories. A processor executes code based on data to run applications and provide features to a user. The processor obtains the code and the data from a memory. The memory in an electronic device can include volatile memory (e.g., random-access memory (RAM)) and non-volatile memory (e.g., flash memory). Like the capabilities of a processor, the capabilities of a memory can impact the performance of an electronic device. This performance impact can increase as processors are developed that execute code faster and as applications operate on increasingly larger data sets that require ever-larger memories.
Apparatuses of and techniques for logging a memory address associated with faulty usage-based disturbance data are described with reference to the following drawings. The same numbers are used throughout the drawings to reference like features and components:
Processors and memory work in tandem to provide features to users of computers and other electronic devices. As processors and memory operate more quickly together in a complementary manner, an electronic device can provide enhanced features, such as high-resolution graphics and artificial intelligence (AI) analysis. Some applications, such as those for financial services, medical devices, and advanced driver assistance systems (ADAS), can also demand more-reliable memories. These applications use increasingly reliable memories to limit errors in financial transactions, medical decisions, and object identification. However, in some implementations, more-reliable memories can sacrifice bit densities, power efficiency, and simplicity.
To meet the demands for physically smaller memories, memory devices can be designed with higher chip densities. Increasing chip density, however, can increase the electromagnetic coupling (e.g., capacitive coupling) between adjacent or proximate rows of memory cells due, at least in part, to a shrinking distance between these rows. With this undesired coupling, activation (or charging) of a first row of memory cells can sometimes negatively impact a second nearby row of memory cells. In particular, activation of the first row can generate interference, or crosstalk, that causes the second row to experience a voltage fluctuation. In some instances, this voltage fluctuation can cause a state (or value) of a memory cell in the second row to be incorrectly determined by a sense amplifier. Consider an example in which a state of a memory cell in the second row is a “1.” In this example, the voltage fluctuation can cause a sense amplifier to incorrectly determine the state of the memory cell to be a “0” instead of a “1.” Left unchecked, this interference can lead to memory errors or data loss within the memory device.
In some circumstances, a particular row of memory cells is activated repeatedly in an unintentional or intentional (sometimes malicious) manner. Consider, for instance, that memory cells in an Rth row are subjected to repeated activation, which causes one or more memory cells in a proximate row (e.g., within an R+1 row, an R+2 row, an R−1 row, and/or an R−2 row) to change states. This effect is referred to as usage-based disturbance. The occurrence of usage-based disturbance can lead to the corruption or changing of contents within the affected row of memory.
Some memory devices utilize circuits that can detect usage-based disturbance and mitigate its effects. To monitor for usage-based disturbance, a memory device can store an activation count within each row of a memory array. The activation count keeps track of a quantity of accesses or activations of the corresponding memory row. If the activation count meets or exceeds a threshold, proximate rows, including one or more adjacent rows, may be at increased risk for data corruption due to the repeated activations of the accessed row and the usage-based disturbance effect. To manage this risk to the affected rows, the memory device can refresh the proximate rows.
The effectiveness of this protective feature is jeopardized, however, if an activation count malfunctions or is otherwise faulty. The activation count, for instance, can become corrupted when read or written during the array counter update procedure. In another aspect, the memory cells that store the activation count can fail to retain the stored value of the activation count.
The memory device can perform a repair process that replaces a faulty activation count in a permanent (or “hard”) manner or in a temporary (or “soft”) manner. The repair process, however, is initiated by a host device (or a memory controller). In some implementations, the host device may not have the means to directly detect the faulty activation count. Without the ability to write to or read from the memory cells that store the activation count, for instance, the host device may be unable to assess whether or not the activation count is faulty. Consequently, the host device may be unable to initiate the repair process when an activation count becomes faulty.
To address this and other issues regarding usage-based disturbance, this document describes techniques for logging a memory address associated with faulty usage-based disturbance data. In an example aspect, a memory device stores usage-based disturbance data within a subset of memory cells of multiple rows of a memory array. The memory device can detect, at a local-bank level, a fault associated with the usage-based disturbance data. This detection enables the memory device to log an address associated with the faulty usage-based disturbance data. To avoid increasing a complexity and/or a size of the memory device, some implementations of the memory device can perform the address logging at the multi-bank level with the assistance of an engine, such as a test engine. The memory device stores the logged address in at least one mode register to communicate the fault to a memory controller. With the logged address, the memory controller can initiate a repair procedure to fix the faulty usage-based disturbance data.
In example implementations, the apparatus 102 can include at least one host device 104, at least one interconnect 106, and at least one memory device 108. The host device 104 can include at least one processor 110, at least one cache memory 112, and a memory controller 114. The memory device 108, which can also be realized with a memory module, can include, for example, a dynamic random-access memory (DRAM) die or module (e.g., Low-Power Double Data Rate synchronous DRAM (LPDDR SDRAM)). The DRAM die or module can include a three-dimensional (3D) stacked DRAM device, which may be a high-bandwidth memory (HBM) device or a hybrid memory cube (HMC) device. The memory device 108 can operate as a main memory for the apparatus 102. Although not illustrated, the apparatus 102 can also include storage memory. The storage memory can include, for example, a storage-class memory device (e.g., a flash memory, hard disk drive, solid-state drive, phase-change memory (PCM), or memory employing 3D XPoint™).
The processor 110 is operatively coupled to the cache memory 112, which is operatively coupled to the memory controller 114. The processor 110 is also coupled, directly or indirectly, to the memory controller 114. The host device 104 may include other components to form, for instance, a system-on-a-chip (SoC). The processor 110 may include a general-purpose processor, central processing unit, graphics processing unit (GPU), neural network engine or accelerator, application-specific integrated circuit (ASIC), field-programmable gate array (FPGA) integrated circuit (IC), or communications processor (e.g., a modem or baseband processor).
In operation, the memory controller 114 can provide a high-level or logical interface between the processor 110 and at least one memory (e.g., an external memory). The memory controller 114 may be realized with any of a variety of suitable memory controllers (e.g., a double-data-rate (DDR) memory controller that can process requests for data stored on the memory device 108). Although not shown, the host device 104 may include a physical interface (PHY) that transfers data between the memory controller 114 and the memory device 108 through the interconnect 106. For example, the physical interface may be an interface that is compatible with a DDR PHY Interface (DFI) Group interface protocol. The memory controller 114 can, for example, receive memory requests from the processor 110 and provide the memory requests to external memory with appropriate formatting, timing, and reordering. The memory controller 114 can also forward to the processor 110 responses to the memory requests received from external memory.
The host device 104 is operatively coupled, via the interconnect 106, to the memory device 108. In some examples, the memory device 108 is connected to the host device 104 via the interconnect 106 with an intervening buffer or cache. The memory device 108 may operatively couple to storage memory (not shown). The host device 104 can also be coupled, directly or indirectly via the interconnect 106, to the memory device 108 and the storage memory. The interconnect 106 and other interconnects (not illustrated in
The illustrated components of the apparatus 102 represent an example architecture with a hierarchical memory system. A hierarchical memory system may include memories at different levels, with each level having memory with a different speed or capacity. As illustrated, the cache memory 112 logically couples the processor 110 to the memory device 108. In the illustrated implementation, the cache memory 112 is at a higher level than the memory device 108. A storage memory, in turn, can be at a lower level than the main memory (e.g., the memory device 108). Memory at lower hierarchical levels may have a decreased speed but increased capacity relative to memory at higher hierarchical levels.
The apparatus 102 can be implemented in various manners with more, fewer, or different components. For example, the host device 104 may include multiple cache memories (e.g., including multiple levels of cache memory) or no cache memory. In other implementations, the host device 104 may omit the processor 110 or the memory controller 114. A memory (e.g., the memory device 108) may have an “internal” or “local” cache memory. As another example, the apparatus 102 may include cache memory between the interconnect 106 and the memory device 108. Computer engineers can also include any of the illustrated components in distributed or shared memory systems.
Computer engineers may implement the host device 104 and the various memories in multiple manners. In some cases, the host device 104 and the memory device 108 can be disposed on, or physically supported by, a printed circuit board (e.g., a rigid or flexible motherboard). The host device 104 and the memory device 108 may additionally be integrated together on an integrated circuit or fabricated on separate integrated circuits and packaged together. The memory device 108 may also be coupled to multiple host devices 104 via one or more interconnects 106 and may respond to memory requests from two or more host devices 104. Each host device 104 may include a respective memory controller 114, or the multiple host devices 104 may share a memory controller 114. This document describes with reference to
Two or more memory components (e.g., modules, dies, banks, or bank groups) can share the electrical paths or couplings of the interconnect 106. The interconnect 106 can include at least one command-and-address bus (CA bus) and at least one data bus (DQ bus). The command-and-address bus can transmit addresses and commands from the memory controller 114 of the host device 104 to the memory device 108, which may exclude propagation of data. The data bus can propagate data between the memory controller 114 and the memory device 108. The memory device 108 may also be implemented as any suitable memory including, but not limited to, DRAM, SDRAM, three-dimensional (3D) stacked DRAM, DDR memory, or LPDDR memory (e.g., LPDDR DRAM or LPDDR SDRAM).
The memory device 108 can form at least part of the main memory of the apparatus 102. The memory device 108 may, however, form at least part of a cache memory, a storage memory, or a system-on-chip of the apparatus 102. The memory device 108 includes at least one instance of usage-based disturbance circuitry 120 (UBD circuitry 120) and at least one instance of usage-based disturbance data repair circuitry 122 (UBD data repair circuitry 122).
The usage-based disturbance circuitry 120 mitigates usage-based disturbance for one or more banks associated with the memory device 108. The usage-based disturbance circuitry 120 can be implemented using software, firmware, hardware, fixed circuit circuitry, or combinations thereof. The usage-based disturbance circuitry 120 can also include at least one counter circuit for detecting conditions associated with usage-based disturbance, at least one queue for managing refresh operations for mitigating the usage-based disturbance, and/or at least one error-correction-code (ECC) circuit for detecting and/or correcting bit errors associated with usage-based disturbance.
One aspect of usage-based disturbance mitigation involves keeping track of how often a row is activated or accessed since a last refresh. In particular, the usage-based disturbance circuitry 120 performs an array counter update procedure using the counter circuit to update an activation count associated with an activated row. During the array counter update procedure, the usage-based disturbance circuitry 120 reads the activation count that is stored within the activated row, increments the activation count, and writes the updated activation count to the activated row. By maintaining the activation count, the usage-based disturbance circuitry 120 can determine when to perform a refresh operation to reduce the risk of usage-based disturbance. For example, when the activation count meets or exceeds a threshold, the usage-based disturbance circuitry 120 can perform a mitigation procedure that refreshes one or more rows that are near the activated row to mitigate the usage-based disturbance.
Generally speaking, the techniques for logging a memory address associated with faulty usage-based disturbance data can be performed, at least partially, by the usage-based-disturbance data repair circuitry 122. More specifically, these techniques can be implemented using at least one detection circuit 124 and at least one address logging circuit 126. The address logging can be performed at a local-bank level 128 or at a multi-bank level 130, as further described below.
The detection circuit 124 detects an occurrence (or absence) of a fault associated with data that is referenced by the usage-based disturbance circuitry 120 to mitigate usage-based disturbance. This data is referred to as usage-based disturbance data. Generally speaking, the memory device 108 can perform a variety of error detection tests to determine whether or not the usage-based disturbance data (or memory cells that store the usage-based disturbance data) is faulty. Example error detection tests include a parity bit check, an error-correcting-code check, a checksum check, a cyclic redundancy check, another type of error detection procedure, or some combination thereof. In some implementations, the detection circuit 124 performs the error detection test and therefore directly detects the fault. In other implementations, the usage-based disturbance circuitry 120 performs the error detection test as part of the array counter update procedure. In this case, the detection circuit 124 stores information about any faults detected by the usage-based disturbance circuitry 120. The detection circuit 124 communicates the occurrence of the detected fault to the address logging circuit 126.
At the multi-bank level 130, the address logging circuit 126 logs (or captures) an address associated with the faulty usage-based disturbance data based on the detection circuit 124 indicating the occurrence of the detected fault. The address logging circuit 126 can further provide the logged address to other components of the memory device 108 so that the occurrence of the fault and the logged address can be communicated to the host device 104.
In example implementations, the detection circuit 124 is implemented at the local-bank level 128. This means that each detection circuit 124 detects the occurrence of faults within a corresponding bank of the memory device 108. The address logging circuit 126, in contrast to the detection circuit 124, is implemented at the multi-bank level 130. This means that one instance of the address logging circuit 126 can service two or more banks of the memory device 108. At the multi-bank level 130, the address logging circuit 126 can readily pass information about the detected fault in a manner that enables the host device 104 to initiate the repair procedure. The local-bank level 128 implementation of the detection circuit 124 and the multi-bank level 130 implementation of the address logging circuit 126 are further described with respect to
The usage-based-disturbance data repair circuitry 122 enables information about the occurrence of the fault and the address associated with the fault to be communicated to or accessed by the host device 104 (e.g., the memory controller 114). With this information, the host device 104 can initiate a repair procedure to fix the faulty data within the memory device 108. One type of repair procedure is a hard post-package repair (hPPR) procedure. For the hard post-package repair procedure, the memory controller 114 can request that the memory device 108 permanently repair a whole combination row, including the faulty data used for usage-based disturbance mitigation. With this repair procedure, however, the viability of existing data stored in the memory row is uncertain. Further, the permanent, nonvolatile nature of the hard post-package repair can entail blowing a fuse. The procedure is relatively lengthy and can often be performed only during power up and initialization, or with a full memory reset, instead of in real-time while the memory device 108 is functional and performing memory operations for the host device 104.
In contrast with the hard post-package repair, a soft post-package repair (sPPR) is a temporary repair procedure that is significantly faster. Further, although a soft post-package repair procedure produces a volatile repair, the soft post-package repair procedure can be performed in real-time responsive to detection of a failure. If a memory row is being repaired, the computing system may be responsible, however, for handling the data transfer (e.g., a full page of data) from the memory row corresponding to the faulty activation count to a spare counter and memory row combination. This data transfer can consume an appreciable amount of time while occupying the data bus. Other components of the memory device 108 are further described with respect to
The control circuitry 208 can include various components that the memory device 108 can use to perform various operations. These operations can include communicating with other devices, managing memory performance, performing refresh operations (e.g., self-refresh operations or auto-refresh operations), and performing memory read or write operations. In the depicted configuration, the control circuitry 208 includes the usage-based disturbance data repair circuitry 122, at least one array control circuit 210, at least one instance of clock circuitry 212, and at least one mode register 214. The control circuitry 208 can also optionally include at least one engine 216.
The array control circuit 210 can include circuitry that provides command decoding, address decoding, input/output functions, amplification circuitry, power supply management, power control modes, and other functions. The clock circuitry 212 can synchronize various memory components with one or more external clock signals provided over the interconnect 106, including a command-and-address clock or a data clock. The clock circuitry 212 can also use an internal clock signal to synchronize memory components and may provide timer functionality.
In general, the control circuitry 208 stores the addresses that are logged by the usage-based disturbance data repair circuitry 122 in a manner that can be accessed by the memory controller 114. With this information, the memory controller 114 can initiate an appropriate repair procedure. In an example implementation, the mode register 214 facilitates control by and/or communication with the memory controller 114 (or one of the processors 202). Using the mode register 214, the memory device 108 can communicate information to the memory controller 114. Such communications can cause entry into or exit from a repair mode or a command that provides a memory row address to target for a repair procedure. To facilitate this communication, the mode register 214 may include one or more registers having at least one bit relating to usage-based disturbance repair functionality.
When implemented and enabled, the engine 216 can access each row of the memory array 204 in a controlled manner. The manner in which the engine 216 accesses the rows of the memory array 204 can be in accordance with an automatic mode or a manual mode. Generally, given sufficient time, the engine 216 accesses all rows of the memory array 204. In some implementations, the engine 216 accesses the rows of the memory array 204 in a periodic or cyclic manner. An order in which the engine 216 access the rows can be a predetermined order, a rule-based order, or a randomized order. In some implementations, the engine 216 is implemented as a test engine, which can detect and/or correct errors within at least a subset of the data that is stored within the rows. Example engines include an error-check and scrub engine (ECS engine), an add-based engine, or a refresh engine.
The memory device 108 also includes the usage-based disturbance circuitry 120. In some aspects, the usage-based disturbance circuitry 120 can be considered part of the control circuitry 208. For example, the usage-based disturbance circuitry 120 can represent another part of the control circuitry 208. The usage-based disturbance circuitry 120 can be coupled to a set of memory cells within the memory array 204 that store usage-based disturbance data 218 (UBD data 218). The usage-based disturbance data 218 can include information such as an activation count, which represents a quantity of times one or more rows within the memory array 204 have been activated (or accessed) by the memory device 108. In example implementations, each row of the memory array 204 includes a subset of memory cells that stores the usage-based disturbance data 218 associated with that row, as further described with respect to
The interface 206 can couple the control circuitry 208 or the memory array 204 directly or indirectly to the interconnect 106. In some implementations, the usage-based disturbance circuitry 120, the usage-based disturbance data repair circuitry 122, the array control circuit 210, the clock circuitry 212, the mode register 214, and the engine 216 can be part of a single component (e.g., the control circuitry 208). In other implementations, one or more of the usage-based disturbance circuitry 120, the usage-based disturbance data repair circuitry 122, the array control circuit 210, the clock circuitry 212, the mode register 214, or the engine 216 may be implemented as separate components, which can be provided on a single semiconductor die or disposed across multiple semiconductor dies. These components may individually or jointly couple to the interconnect 106 via the interface 206.
The interconnect 106 may use one or more of a variety of interconnects that communicatively couple together various components and enable commands, addresses, or other information and data to be transferred between two or more components (e.g., between the memory device 108 and the processor 202). Although the interconnect 106 is illustrated with a single line in
In some aspects, the memory device 108 may be a “separate” component relative to the host device 104 (of
As shown in
In some implementations, the processors 202 may be connected directly to the memory device 108 (e.g., via the interconnect 106). In other implementations, one or more of the processors 202 may be indirectly connected to the memory device 108 (e.g., over a network connection or through one or more other devices). The memory array 204 is further described with respect to
Each of the rows 302 can store normal data 306 within a first subset of the memory cells associated with that row 302. The normal data 306 represents data that is read from or written to the memory device 108 during normal memory operations (e.g., during normal read or write operations). The normal data 306, for example, can include data that is transmitted by the memory controller 114 and is written to one or more rows 302 of the memory array 204.
In addition to the normal data 306, each of the rows 302 can store usage-based disturbance data 218 within a second subset of the memory cells associated with that row 302. The usage-based disturbance data 218 includes information that enables the usage-based disturbance circuitry 120 to mitigate usage-based disturbance. In an example implementation, the usage-based disturbance data 218 includes an activation count 308.
In this example, the first row 302-1 stores first normal data 306-1 within a first subset of memory cells of the first row 302-1 and stores first usage-based disturbance data 218-1 within a second subset of memory cells of the first row 302-1. The first usage-based disturbance data 218-1 includes a first activation count 308-1, which represents a quantity of times the first row 302-1 has been activated since a last refresh. As another example, the second row 302-2 stores second normal data 306-2 within a first subset of memory cells within the second row 302-2 and stores second usage-based disturbance data 218-2 within a second subset of memory cells within the second row 302-2. The second usage-based disturbance data 218-2 includes a second activation count 308-2, which represents a quantity of times the second row 302-2 has been activated since a last refresh. Additionally, the Rth row 302-R stores Rth normal data 306-R within a first subset of memory cells within the Rth row 302-R and stores Rth usage-based disturbance data 218-R within a second subset of memory cells within the Rth row 302-R. The Rth usage-based disturbance data 218-R includes an Rth activation count 308-R, which represents a quantity of times the Rth row 302-R has been activated since a last refresh.
The usage-based disturbance data 218 also includes information or is formatted (e.g., coded) in such a way as to support error detection. In this example, the usage-based disturbance data 218 includes a parity bit 310 to enable detection of a faulty activation count 308 using a parity check. For instance, the usage-based disturbance data 218-1, 218-2, and 218-R respectively includes parity bits 310-1, 310-2, and 310-R. Other implementations are also possible in which the usage-based disturbance data 218 is coded in a manner that supports any of the error detection tests described above, such as the error-correcting-code check. Although the techniques for logging a memory address associated with faulty usage-based disturbance data 218 are described with respect to parity-bit errors associated with the activation count 308, these techniques can generally be applied for logging addresses for any type of usage-based disturbance data 218 and any type of error detection associated with this data.
The memory module 402 can be implemented in various manners. For example, the memory module 402 may include a printed circuit board, and the multiple dies 404-1 through 404-D may be mounted or otherwise attached to the printed circuit board. The dies 404 (e.g., memory dies) may be arranged in a line or along two or more dimensions (e.g., forming a grid or array). The dies 404 may have a similar size or may have different sizes. Each die 404 may be similar to another die 404 or different in size, shape, data capacity, or control circuitries. The dies 404 may also be positioned on a single side or on multiple sides of the memory module 402.
One or more of the dies 404-1 to 404-D include the usage-based disturbance circuitry 120, the usage-based-disturbance data repair circuitry 122 (UBD DR circuitry 122), and bank groups 408-1 to 408-G, with G representing a positive integer. Each bank group 408 includes at least two banks 410, such as banks 410-1 to 410-B, with B representing a positive integer. In some implementations, the die 404 includes multiple instances of the usage-based disturbance circuitry 120, which mitigate usage-based disturbance across at least one of the banks 410. For example, multiple instances of the usage-based disturbance circuitry 120 can respectively mitigate usage-based disturbance across the bank groups 408-1 to 408-G. In this example, one instance of usage-based disturbance circuitry 120 mitigates usage-based disturbance across multiple banks 410-1 to 410-B of a bank group 408. In another example, multiple instances of the usage-based disturbance circuitry 120 can respectively mitigate usage-based disturbance for respective banks 410. In this case, each usage-based disturbance circuitry 120 mitigates usage-based disturbance for a single bank 410 within one of the bank groups 408-1 to 406-B. In yet another example, each usage-based disturbance circuitry 120 mitigates usage-based disturbance for a subset of the banks 410 associated with one of the bank groups 408-1 to 408-G, where the subset of the banks 410 includes at least two banks 410. The relationship between the banks 410-1 to 410-B and components of the usage-based disturbance data repair circuitry 122 are further described with respect to
Each detection circuit 124 can detect occurrence of a fault (or an error) associated with the usage-based disturbance data 218 stored within the corresponding bank 410. For example, the first detection circuit 124-1 can monitor for faults associated with the usage-based disturbance data 218 stored within the rows 302 of the first bank 410-1. Likewise, the second detection circuit 124-2 can monitor for faults associated with the usage-based disturbance data 218 stored within the rows 302 of the second bank 410-2.
The bank-shared circuitry 504 includes components that are associated with multiple banks 410. These components perform operations associated with multiple banks 410. Example components of the bank-shared circuitry 504 include the address logging circuit 126 and the engine 216 (if implemented). In this example, the usage-based disturbance circuitry 120 is also shown as part of the bank-shared circuitry 504. Alternatively, multiple instances of the usage-based disturbance circuitry 120 can be implemented as part of the bank-specific circuitry 502. In an example implementation, the address logging circuit 126 is positioned proximate to the engine 216.
On the die 404, the bank-specific circuitry 502 is positioned on two opposite sides of the bank-shared circuitry 504. Explained another way, the bank-shared circuitry 504 can be centrally positioned on the die 404. As such, the address logging circuit 126 can be positioned closer to a center of the die 404 compared to the edges of the die 404. Positioning the bank-shared circuitry 504 in the center enables routing between the bank-shared circuitry 504 and the bank-specific circuitry 502 to be simplified.
Consider a first axis 508-1 (e.g., X axis 508-1) and a second axis 508-2 (e.g., Y axis 508-2), which is perpendicular to the first axis 508-1. In
In the depicted configuration, the usage-based disturbance data repair circuitry 122 includes the detection circuits 124-1 to 124-B and the address logging circuit 126, which is coupled to the mode register 214. Although not explicitly shown in
The usage-based disturbance data repair circuitry 122 also includes an interface 602, which is coupled between the detection circuits 124-1 to 124-B and the address logging circuit 126. In general, the interface 602 provides a means for communication between a component at the local-bank level 128 (e.g., one of the detection circuits 124-1 to 124-B) and a component at the multi-bank level 130 (e.g., the address logging circuit 126). Various implementations of the interface 602 are further described with respect to
During operation, the detection circuits 124-1 to 124-B respectively generate control signals 604-1 to 604-B. The control signals 604-1 to 604-B at least indicate whether or not the respective detection circuits 124-1 to 124-B detect an occurrence of faulty usage-based disturbance data 218 within the corresponding banks 410-1 to 410-B.
The interface 602 generates a composite control signal 606 based on the control signals 604-1 to 604-B. The composite control signal 606 represents some combination of the local-bank address logging control signals 604-1 to 604-B. Using the composite control signal 606, the interface 602 can pass information provided by any one of the control signals 604-1 to 604-B to the address logging circuit 126.
The address logging circuit 126 can provide an address 608 and/or a fault detection flag 610 to the mode register 214 based on the composite control signal 606. The address 608 represents at least one of the addresses 304 for which the detection circuits 124-1 to 124-B determined is associated with the faulty usage-based disturbance data 218. The fault detection flag 610 indicates whether or not faulty usage-based disturbance data 218 has been detected. In one example implementation, the fault detection flag 610 represents a flag that is dedicated for detecting faults (or errors) associated with the usage-based disturbance data 218. In another example implementation, the fault detection flag 610 is implemented using another flag or signal that already exists within the memory device 108. For example, the fault detection flag 610 can be implemented using the reliability, availability, and serviceability (RAS) event signal or another alert signal. The fault detection flag 610 can also be referred to as an error flag, a parity flag, an activation count error flag, an activation count parity flag, and so forth.
The mode register 214 stores the address 608 and/or the fault detection flag 610. In some cases, the mode register 214 includes two registers that respectively store the address 608 and the fault detection flag 610. In another case, the mode register 214 includes one register that stores both the address 608 and the fault detection flag 610. The memory controller 114 can initiate one or more repair procedures based on the address 608 and/or the fault detection flag 610 stored by the mode register 214. In some implementations, the memory controller 114 can clear the fault detection flag 610 upon initiating a repair procedure. The usage-based disturbance data repair circuitry 122 can perform aspects of direct or indirect address logging, as further described with respect to
To communicate the address 608 from the local-bank level 128 to the multi-bank level 130, the interface 602 can be implemented using at least on internal bus 702 or at least one scan chain 704. The interface 602 can also include a conflict resolution circuit 706, which can resolve conflicts in which at least two detection circuits 124 detect an occurrence of faulty usage-based disturbance data 218 during a same time interval.
During operation, the usage-based disturbance circuitry 120 performs the array counter update procedure on an active row. As part of the array counter update procedure, the usage-based disturbance circuitry 120 or the detection circuits 124-1 to 124-B perform an error detection test to detect a fault associated with the usage-based disturbance data 218 (e.g., perform a parity check to detect a parity-bit failure associated with the activation count 308). If a fault is detected, the detection circuit 124 associated with the bank 410 in which the fault occurs determines the address 608 associated with the detected fault. For example, the detection circuit 124-1 determines that the address 608-1 is associated with the fault and/or the detection circuit 124-B determines that the address 608-B is associated with the fault. The detection circuits 124-1 to 124-B communicate the addresses 608-1 to 608-B to the address logging circuit 126 using the control signals 604-1 to 604-B.
While direct address logging 700 enables the address 608 associated with the faulty usage-based disturbance data 218 to be logged during the array counter update procedure and enables this address 608 to be stored in the mode register 214 with minimal delay, direct address logging 700 can increase a complexity and/or layout penalty associated with implementing the interface 602. This can increase the cost and/or size of the memory device 108. Alternatively, other implementations of the usage-based disturbance data repair circuitry 122 can perform indirect address logging, which is further described with respect to
In the depicted configuration, the address logging circuit 126 is coupled to the engine 216. Depending on the implementation, the detection circuits 124-1 to 124-B can be coupled to the usage-based disturbance circuitry 120, the engine 216, or both. Example implementations of the detection circuit 124 can include at least one fault detection circuit 802 and/or at lead one address comparator 804. The interface 602 can include at least one logic gate 806. The logic gate 806 can be implemented at the local-bank level 128 and generates the composite control signal 606 based on the control signals 604-1 to 604-B. The address logging circuit 126 can include at least one latch circuit 808, which can latch information provided by the engine 216 based on the composite control signal 606. Example implementations of the detection circuit 124, the interface 602, and the address logging circuit 126 are further described with respect to
During operation, the engine 216 performs operations on the rows 302 of the memory array 204. The engine 216 controls or determines the sequence in which the rows 302 are accessed. The address logging circuit 126 is coupled to the engine 216 and receives information about an address 810 that is accessed by the engine 216. The address logging circuity 126 can latch the address 810 at the multi-bank level 130 based on the composite control signal 606 indicating occurrence of a fault.
The detection circuits 124-1 to 124-B can determine the occurrence of the fault in different manners. In a first example implementation, the detection circuits 124-1 to 124-B perform the error detection test based on an occurrence of the engine 216 accessing the address 810. In this case, the error detection test is performed on rows 302 in a same order that the engine 216 accesses the rows 302. In a second example implementation, the error detection test is performed by the usage-based disturbance circuitry 120 or the detection circuits 124-1 to 124-B as part of or based on an occurrence of the array counter update procedure (or more generally a procedure that updates the usage-based disturbance data 218). The detection circuits 124-1 to 124-B store information associated with a detected fault and provide this information if the address 608 of the detected fault matches the address 810 that is accessed by the engine 216. The first example implementation of the detection circuits 124-1 to 124-B is further described with respect to
The detection circuits 124-1 to 124-B respectively include fault detection circuits 802-1 to 802-B. The fault detection circuits 802-1 to 802-B are coupled to the engine 216 and perform the error detection test to detect faulty usage-based disturbance data 218. A manner in which the error detection tests are performed across the rows 302, however, is dependent upon a manner in which the engine 216 accesses the rows 302, as further described below.
During operation, the engine 216 performs an operation at a particular row 302. The address 810 that is accessed by the engine 216 is provided to the detection circuits 124-1 to 124-B. If the address 810 is within a bank 410 that corresponds with the detection circuit 124, that detection circuit 124 performs the error detection test on the usage-based disturbance data 218 associated with the address 810. For example, the detection circuit 124 performs a parity check to evaluate a parity bit 310 associated with the activation count 308. If the address 810 is not within the bank 410 that corresponds with the detection circuit 124, that detection circuit 124 does not perform an error detection test.
If the detection circuit 124 determines that the usage-based disturbance data 218 associated with the address 810 is faulty, the detection circuit 124 indicates detection of this fault via the corresponding control signal 604. The interface 602 generates the composite control signal 606, which also indicates the detection of the fault. Based on the composite control signal 606 indicating detection of the fault, the latch circuit 808 latches the address 810 that is provided by the engine 216. The address logging circuit 126 provides the address 810 as the address 608 to the mode register 214 (not shown). In some cases, the address logging circuit 126 provides the composite control signal 606 as the fault detection flag 610.
In this example, the execution of the error detection test occurs during or after a time interval in which the engine 216 accesses the address 810. In this manner, the fault detection and address logging are synchronized across the local-bank level 128 and the multi-bank level 130 based on the address 810 that is accessed by the engine 216. In other implementations, the fault detection can occur before the engine 216 accesses the address 810, as further described with respect to
During operation, the usage-based disturbance circuitry 120 performs the array counter update procedure. As part of the array counter update procedure or based on the occurrence of the array counter update procedure, the usage-based disturbance circuitry 120 or the detection circuits 124-1 to 124-B perform the error detection test to detect faulty usage-based disturbance data 218. If faulty usage-based disturbance data 218 is detected, the address 608 of the faulty usage-based disturbance data 218 is stored within the content-addressable memory 1004 of the address comparator 804.
After the array counter update procedure is performed, the engine 216 accesses the address 810. The comparators 1002 of the address comparators 804-1 to 804-B compare the address 810 to the addresses 608-1 to 608-B stored in the content-addressable memory 1004. Consider an example in which the address 810 is the address 608-1 stored by the address comparator 804-1. In this case, the comparator 1002 of the detection circuit 124-1 determines that the address 810 matches the address 608-1, and generates the control signal 604-1 in a manner that indicates detection of faulty usage-based disturbance data 218. The interface 602 generates the composite control signal 606, which also indicates the detection of the fault. Based on the composite control signal 606 indicating detection of the fault, the latch circuit 808 latches the address 810 that is provided by the engine 216. The address logging circuit 126 provides the address 810 as the address 608 to the mode register 214 (not shown). In some cases, the address logging circuit 126 provides the composite control signal 606 as the fault detection flag 610.
In this example, the execution of the error detection test occurs before a time interval in which the engine 216 accesses the address 810. Although the fault detection and address logging can occur at different time intervals, reporting of the fault detection and address logging are synchronized across the local-bank level 128 and the multi-bank level 130 based on the address 810 that is accessed by the engine 216. In still other implementations, the detection circuits 124-1 to 124-B can include both the fault detection circuits 802 and the address comparators 804, as further described with respect to
This implementation of the detection circuits 124-1 to 124-B provides additional opportunities for the error detection tests to be executed, and therefore enables the usage-based disturbance data repair circuitry 122 to more quickly detect faulty usage-based disturbance data 218. For example, the fault detection circuits 802-1 to 802-B enable faulty usage-based disturbance data 218 to be detected based on an occurrence of the engine 216 accessing a row while the address comparator 804-1 to 804-B enables faulty usage-based disturbance data 218 to be detected based on an occurrence of an array counter update procedure. As seen in
This section describes example methods for implementing aspects of logging a memory address associated with faulty usage-based disturbance data with reference to the flow diagram of
At 1204, the row is accessed using an engine. For example, the engine 216 accesses the row 302. The engine 216 can access to the row and perform an operation on the normal data 306 that is stored within another subset of the memory cells of the row 302. In an example implementation, the engine 216 is implemented as an error check and scrub engine, which can detect errors within the normal data 306. In some implementations, the engine 216 does not directly perform operations associated with usage-based disturbance mitigation or does not perform operations on the usage-based disturbance data 218.
In general, the engine 216 is capable of accessing all of the rows 302 within the memory array 204. This enables the techniques associated with indirect address logging 800 to report the occurrence of faults associated with the usage-based disturbance data 218 in a controlled manner that avoids conflicts across multiple banks 410.
At 1206, an occurrence of a fault associated with the data stored within the row is detected at a local-bank level of the memory device. For example, the usage-based-disturbance data repair circuitry 122 detects, at the local-bank level, the occurrence of the fault associated with the usage-based disturbance data 218 that is stored within the row 302. In some implementations, the usage-based-disturbance data repair circuitry 122 can directly detect the fault by executing an error detection test at the local-bank level. The error detection test can be performed based on an occurrence of a procedure performed by the usage-based disturbance circuitry 120 to update the usage-based disturbance data 218 and/or based on an occurrence of the engine 216 accessing the row 302. In other implementations, the usage-based disturbance circuitry 120 can directly detect the fault by executing the error detection test and provide an indication to the usage-based-disturbance data repair circuitry 122 if the fault is detected.
At 1208, an address of the row is logged, at a multi-bank level of the memory device, based on the row being accessed by the engine and based on the detected occurrence of the fault. For example, the usage-based-disturbance data repair circuitry 122 logs, at the multi-bank level 130 of the memory device 108, the address 608 of the row 302 based on the row 302 being accessed by the engine 216 and based on the detected occurrence of the fault, which is reported from (or indicated by) the local-bank level 128 to the multi-bank level 130. In particular, the usage-based-disturbance data repair circuitry 122 can latch the address 810 that is accessed by the engine 216 based on the local-bank level 128 indicating occurrence of a fault that is associated with the address 810. The usage-based-disturbance data repair circuitry 122 can store the latched address 608 and/or the fault detection flag 610 in one or more mode registers 214 that can be accessed by the host device 104. With this information, the host device 104 can initiate a repair procedure that addresses the detected fault associated with the usage-based disturbance data 218 stored within the row 302.
For the figure described above, the order in which operations are shown and/or described are not intended to be construed as a limitation. Any number or combination of the described process operations can be combined or rearranged in any order to implement a given method or an alternative method. Operations may also be omitted from or added to the described methods. Further, described operations can be implemented in fully or partially overlapping manners.
Aspects of this method may be implemented in, for example, hardware (e.g., fixed-circuit circuitry or a processor in conjunction with a memory), firmware, software, or some combination thereof. The method may be realized using one or more of the apparatuses or components shown in
Computer-readable media includes both non-transitory computer storage media and communication media including any medium that facilitates transfer of a computer program (e.g., an application) or data from one entity to another. Non-transitory computer storage media can be any available medium accessible by a computer, such as RAM, ROM, Flash, EEPROM, optical media, and magnetic media.
In the following, various examples for implementing aspects of logging a memory address associated with faulty usage-based disturbance data are described:
Example 1: An apparatus comprising:
Example 2: The apparatus of example 1 or any other example, wherein the circuitry comprises:
Example 3: The apparatus of example 2 or any other example, wherein:
Example 4: The apparatus of example 1 or any other example, wherein the circuitry is configured to detect the occurrence of the fault prior to the engine accessing the row.
Example 5: The apparatus of example 4 or any other example, wherein:
Example 6: The apparatus of example 1 or any other example, wherein the circuitry is configured to detect the occurrence of the fault during or after the engine accesses the row.
Example 7: The apparatus of example 6 or any other example, wherein the circuitry is configured to perform, based on the engine accessing the row, an error detection test to detect the occurrence of the fault.
Example 8: The apparatus of example 1 or any other example, wherein:
Example 9: The apparatus of example 8 or any other example, wherein:
Example 10: The apparatus of example 1 or any other example, wherein the data associated with usage-based disturbance comprises an activation count that represents a quantity of times a corresponding row has been accessed since a last refresh.
Example 11: The apparatus of example 1 or any other example, wherein:
Example 12: The apparatus of example 1 or any other example, wherein:
Example 13: The apparatus of example 12 or any other example, wherein the engine comprises an error check and scrub engine configured to perform error detection on the other data.
Example 14: A method performed by a memory device, the method comprising:
Example 15: The method of example 14 or any other example, further comprising:
Example 16: The method of example 14 or any other example, further comprising:
Example 17: An apparatus comprising:
Example 18: The apparatus of example 17 or any other example, wherein the at least one first circuit is configured to:
Example 19: The apparatus of example 17 or any other example, wherein:
Example 20: The apparatus of example 17 or any other example, wherein:
Unless context dictates otherwise, use herein of the word “or” may be considered use of an “inclusive or,” or a term that permits inclusion or application of one or more items that are linked by the word “or” (e.g., a phrase “A or B” may be interpreted as permitting just “A,” as permitting just “B,” or as permitting both “A” and “B”). Also, as used herein, a phrase referring to “at least one of” a list of items refers to any combination of those items, including single members. For instance, “at least one of a, b, or c” can cover a, b, c, a-b, a-c, b-c, and a-b-c, as well as any combination with multiples of the same element (e.g., a-a, a-a-a, a-a-b, a-a-c, a-b-b, a-c-c, b-b, b-b-b, b-b-c, c-c, and c-c-c, or any other ordering of a, b, and c). Further, items represented in the accompanying figures and terms discussed herein may be indicative of one or more items or terms, and thus reference may be made interchangeably to single or plural forms of the items and terms in this written description.
Although aspects of logging a memory address associated with faulty usage-based disturbance data have been described in language specific to certain features and/or methods, the subject of the appended claims is not necessarily limited to the specific features or methods described. Rather, the specific features and methods are disclosed as a variety of example implementations of logging a memory address associated with faulty usage-based disturbance data.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 63/592,761, filed on Oct. 24, 2023, the disclosure of which is incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
63592761 | Oct 2023 | US |