I. Technical Field
Embodiments described herein relate to semiconductor-based memory devices and systems.
II. Background Art
Devices, e.g., personal computers and servers, cellular and smart phones, PDAs, gaming consoles, home electronics and entertainment devices, etc., include memory(ies) such as random access memory (RAM) and static RAM (SRAM) for data storage. Prior solutions for SRAMs involve building a memory or memory system from individual instances or blocks of existing SRAM architectures, e.g., from memory block libraries. For example, one or more blocks of an existing 0.5 Mb to 2 Mb SRAM (e.g., up to 64 blocks or more) may be connected together to effectively create a larger memory system such as a 16k×1680 payload memory.
Yet, building a high-capacity, high-performance memory system using instances of SRAM blocks based on current memory architectures as building blocks is area inefficient. For instance,
Additionally, timing closure of a large memory system using large numbers of instances of SRAMs as building blocks is increasingly difficult due to system scale. The increasing size of internal memory arrays creates timing issues for routing signals in parallel both to and from each of the SRAM instances.
Methods, systems, and apparatuses are described for memories with feedthrough and retiming paths to support memory to memory requests, substantially as shown in and/or described herein in connection with at least one of the figures, as set forth more completely in the claims.
The accompanying drawings, which are incorporated herein and form a part of the specification, illustrate embodiments and, together with the description, further serve to explain the principles of the embodiments and to enable a person skilled in the pertinent art to make and use the embodiments.
Embodiments will now be described with reference to the accompanying drawings. In the drawings, like reference numbers indicate identical or functionally similar elements. Additionally, the left-most digit(s) of a reference number identifies the drawing in which the reference number first appears.
The present specification discloses numerous example embodiments. The scope of the present patent application is not limited to the disclosed embodiments, but also encompasses combinations of the disclosed embodiments, as well as modifications to the disclosed embodiments.
References in the specification to “one embodiment,” “an embodiment,” “an example embodiment,” etc., indicate that the embodiment described may include a particular feature, structure, or characteristic, but every embodiment may not necessarily include the particular feature, structure, or characteristic. Moreover, such phrases are not necessarily referring to the same embodiment. Further, when a particular feature, structure, or characteristic is described in connection with an embodiment, it is submitted that it is within the knowledge of one skilled in the art to affect such feature, structure, or characteristic in connection with other embodiments whether or not explicitly described.
In the discussion, unless otherwise stated, adjectives such as “substantially” and “about” modifying a condition or relationship characteristic of a feature or features of an embodiment of the disclosure, are understood to mean that the condition or characteristic is defined to be within tolerances that are acceptable for operation of the embodiment for an application for which it is intended.
Furthermore, it should be understood that spatial descriptions (e.g., “above,” “below,” “up,” “left,” “right,” “down,” “top,” “bottom,” “vertical,” “horizontal,” etc.) used herein are for purposes of illustration only, and that practical implementations of the structures described herein can be spatially arranged in any orientation or manner.
Still further, it should be noted that the drawings/figures are not drawn to scale unless otherwise noted herein.
Numerous exemplary embodiments are now described. Any section/subsection headings provided herein are not intended to be limiting. Embodiments are described throughout this document, and any type of embodiment may be included under any section/subsection. Furthermore, it is contemplated that the disclosed embodiments may be combined with each other in any manner. That is, the embodiments described herein are not mutually exclusive of each other and may be practiced and/or implemented alone, or in any combination.
The example techniques and embodiments described herein may be adapted to various types of systems and devices, such as but without limitation, communication devices (e.g., cellular and smart phones, etc.), communication systems and components (e.g., switches, routers, etc.), computers/computing devices, computing systems, electronic devices, gaming consoles, home electronics and entertainment devices, and/or the like, that use memory such as SRAM. While the embodiments herein may be described with respect to SRAM as conceptual and/or illustrative examples for descriptive consistency, other types of memory implementations are also contemplated by implementing the disclosed techniques.
The techniques described herein provide novel SRAM component instances (“blocks”) that contain muxes, ancillary logic, and pins placed so that contiguous instances of the SRAM components may be tiled together adjacently (e.g., no physical, external connections or logic therebetween) to form a memory system, thus minimizing any unused space around the memories and simplifying routing. The SRAM component instances may be configured with bypass select inputs to provide either a signal retiming function or a feedthrough function (i.e., data is pipelined in a retiming memory component or flows through a feedthrough memory component). In embodiments, stages of latency may be minimized by configuring more feedthrough memory components, and the operating system clock frequency may be maximized by configuring more retiming (i.e., pipeline) memory components.
The SRAM component instances may be organized into groups of ‘P’ feedthrough memory components per one retiming memory component. This ratio (‘P:1’) and the number of these groups may be configured based on the memory system requirements. For instance, a memory system operating with a relatively short clock period or cycle length, i.e., a higher clock frequency, may have fewer feedthrough memory components per retiming memory component (a lower ratio ‘P:1’) than a memory system having a relatively long clock period or cycle length, i.e., a slower clock frequency. As described herein, a bypass selector configuration input for memory component instances may be applied to configure individual memory components as being feedthrough or retimed (pipelined).
In embodiments, memory components included in a memory system may comprise one or more series of memory components, such as is described below with respect to
According to one or more embodiments described herein, all logic, including but not limited to muxes and ancillary logic, to support the memory component to memory component paths may be included within the memory component instances themselves. This allows for little or no logic and routing channels needed between the memory components for area savings. Accordingly, key timing paths, e.g., for memory data and memory transaction information, may also be configured as memory component to memory component dependent with little or no external buffering or spacing and no external registers to greatly simplify system level timing and allow for maximum system operating frequencies to be utilized. Input pins of signals for data and information that correspond to individual instances of memory components may be placed such that adjacent placement of memory components in a memory system is possible (e.g., pins at non-adjacent edges, pins connected to signal traces underneath memory components using vias, etc.).
As noted, ancillary logic (e.g., memory select logic, feedthrough buffering, register stages, etc.) and muxes required by the memory components are included within each of the memory components. In embodiments, a memory system described herein also includes a local selector, e.g., a mux or a 2:1 mux, that is configured to selectively output either: 1) read data stored in a given memory component for a valid read operation targeting the memory component, or 2) input read data that is received by the given memory component from either a previous memory component or a source external to the memory system for feedthrough by the memory component. By including a local selector such as 2:1 mux for selecting between local data to be read and received read data, large ‘L:1’ muxes, such as mux 106 shown
It is contemplated herein that in various embodiments and with respect to the illustrated figures of this disclosure, one or more components described and/or shown may not be included and that additional components may be included.
The techniques and embodiments described herein provide for improvements in memory systems as described above, including but not limited to SRAM memory systems.
For instance, methods, systems, devices, and apparatuses are provided for improved memory systems. A memory system in accordance with an example aspect is described. The memory system includes a first memory component that includes within, a first selection component, and a second memory component that includes within, a second selection component. The first memory component is configured to receive memory transaction information that is input to the memory system, as well as memory data, and provide to the second memory component the memory transaction information and, based on the first selection component, first memory data stored in the first memory component. The second memory component is configured to receive the memory transaction information and the first memory data from the first memory component, and provide the memory transaction information, as well as provide the first memory data or second memory data stored in the second memory component as an output of the second memory component based on the second selection component.
A memory system in accordance with another example aspect is also described. The memory system includes a first memory component that includes within, a first retiming register, and a second memory component that includes within, a second retiming register. The first memory component is configured to receive memory transaction information input to the memory system, and selectively provide to the second memory component the memory transaction information as a first buffered output, or as a first registered output from the first retiming register. The second memory component is configured to receive the memory transaction information from the first memory component, and selectively provide the memory transaction information as a second buffered output, or as a second registered output from the second retiming register.
A method performed in a memory system in accordance with yet another example aspect is described. The method includes receiving memory transaction information input to the memory system at a first memory component of the memory system, and selectively providing the memory transaction information from the first memory component to a second memory component of the system as a first buffered output or as a first registered output. The method also includes receiving the memory transaction information at the second memory component from the first memory component, and selectively providing the memory transaction information from the second memory component as a second buffered output or as a second registered output.
Various example embodiments are described in the following subsections. In particular, example memory system and memory component embodiments are described. This description is followed by example operational embodiments. Next, further example embodiments and advantages are described, and subsequently an example computer implementation is described. Finally, some concluding remarks are provided. It is noted that the division of the following description generally into subsections is provided for ease of illustration, and it is to be understood that any type of embodiment may be described in any subsection.
As noted above, memory systems and components, such as SRAM systems and SRAM blocks/components, may be configured in various ways to improve memory system layouts, routing, and timing. Techniques and embodiments are provided for implementation in and with devices and systems that utilize memories such as SRAMs. For instance, in embodiments, an SRAM system comprised of SRAM components according to the described techniques and embodiments may be implemented in devices and systems such as those enumerated herein.
In embodiments, by way of illustrative example and not limitation, a memory system comprises a plurality of memory components that communicate data and transaction information from memory component to memory component. A memory system may comprise two or more (e.g., up to ‘N’) memory components. These memory components are located adjacent to each other in a memory system, in embodiments, such that little or no logic or communication/data connections are included therebetween. For example, according the described techniques, ancillary logic (e.g., memory select logic, feedthrough buffering, register stages, etc.) and muxing required by the memory components are included within each of the memory components. Furthermore, the memory components are configured such that communication/data connections are passed from one memory component to the next, adjacent memory component. However, it should be noted that embodiments are contemplated herein which utilize various described techniques, such as but not limited to, memory component to memory component paths, feedthrough configurations, and/or retiming configurations, that allow for some logic and/or routing between memory components of a memory system and/or one or more series of adjacent memory components.
Systems and devices, such as memory systems and components, as well as SRAM systems and components, may be configured in various ways with feedthrough and retiming paths to support memory to memory requests according to the techniques and embodiments described herein.
A. Memory System Embodiments
Memory component 202, memory component 204, and memory component 206 are memory blocks that comprise memory system 200. Each memory component may include a storage array (shown in
Local logic 208, local logic 210, and local logic 212 are each local to (i.e., contained within) memory component 202, memory component 204, and memory component 206, respectively. As noted herein, each local logic component may include ancillary logic associated with its respective memory component, as well as muxes, additional logic, circuits, connections, etc., that are typical to memory components for standard operations. Further details for muxes, ancillary logic, local logic 208, local logic 210, and local logic 212 are described in the following subsection below. Local logic 208, local logic 210, and local logic 212 are each configured to receive memory data and memory transaction information via memory data and transaction information connector 214 from a memory component previous to their respective memory components. Local logic 208 is also configured to receive memory data and memory transaction information via memory data and transaction information connector 214 from a source 216 external to memory system 200 when memory component 202 is the first memory component of memory system 200. Local logic 208, local logic 210, and local logic 212 are each configured to provide memory data and memory transaction information via memory data and transaction information connector 214 to a memory component subsequent to their respective memory components. Local logic 212 is also configured to provide memory data and memory transaction information via memory data and transaction information connector 214 to a receiver 218 external to memory system 200 when memory component 206 is the last memory component of memory system 200. In embodiments, source 216 and receiver 218 may be separate components, may be the same component, or may reside in a single component and be two respective portions thereof
Local logic 208, local logic 210, and local logic 212 are each configured to register (i.e., retime/pipeline) or buffer (i.e., feedthrough via bypass) memory data and memory transaction information traversing their respective memory components based on a bypass indicator or bypass selector configuration input, in embodiments, as described below. Local logic 208, local logic 210, and local logic 212 are also each configured to provide data stored in their respective memory components to a subsequent memory component based on a valid memory read transaction, according to embodiments. Local logic 208, local logic 210, and local logic 212 are each further configured to provide memory data stored in a previous memory component and received therefrom to a subsequent memory component.
In the example embodiment shown in
In embodiments, memory data may comprise read data and/or write data, and memory transaction information may comprise read transaction information, write transaction information, and/or write data. It is contemplated herein that multiple memory transactions may be performed in a single clock cycles by including multiple instance of transaction information connector 214 in memory system 200.
Turning now to
Memory component 302, memory component 304, and memory component 306 may be similarly configured as the memory components of memory system 200 of
B. Memory Component Embodiments
As noted above, in embodiments, by way of illustrative example and not limitation, a memory system comprises a plurality of memory components (i.e., memory blocks) that communicate data and transaction information from memory component to memory component, and a memory system may comprise two or more (e.g., up to ‘N’) memory components. In embodiments, memory components are located adjacent to each other at respective edges such that no logic or communication/data connections are included therebetween. For example, according the described techniques, ancillary logic (e.g., memory select logic, feedthrough buffering, register stages, etc.) and muxing required by the memory components are included within each memory component. Furthermore, the memory components are configured such that transaction information and data are passed from one memory component to the next, adjacent memory component.
Memory component 402 includes a storage array 408, first selector control logic 410, and second selector control logic 412. Memory component 402 also includes a first retiming register 418, a second retiming register 422, a third retiming register 424, and a read output register 472. Memory component 402 also includes a first selector 416, a second selector 420, a third selector 426, a fourth selector 428, and a fifth selector 430. Memory component 402 may also include bypass logic 414, in some embodiments. First retiming register 418, second retiming register 422, third retiming register 424, first selector 416, second selector 420, third selector 426, fourth selector 428, fifth selector 430, first selector control logic 410, second selector control logic 412, and/or bypass logic 414 may collectively correspond to instances of local logic components of
It should be noted that various connections and connectors described herein may comprise one or more physical connections/connectors, e.g., as a multi-bit bus, or may be serial connections/connectors, as would be understood by one of skill in the relevant art(s) having the benefit of this disclosure. For clarity and ease of illustration, each connection/connector is only shown as a single line.
Memory component 402 is configured to receive a bank identification (ID) via bank ID input connector 464, read data via read data input connector 432, read transaction information via read request input connector 444, and write transaction information via write input connector 448. In embodiments, bank ID input connector 464 may comprise a portion of read request input connector 444 and write input connector 448. Memory component 402 is also configured to provide or output read data via a connector 442, read transaction information via a connector 454, and write transaction information via a connector 456.
Read data is received via read data input connector 432 by first selector 416 that also receives read data that is locally stored in storage array 408 via a connector 434 from read output register 472 through a connector 436, as shown. However, it should be noted that in some embodiments, read output register 472 may not be included in memory component 402. First selector 416 also receives a read data selector configuration input from first selector control logic 410 via a connector 466. First selector 416 provides its output to first retiming register 418 and to second selector 420 via a connector 440. First retiming register 418 provides its output to second selector 420 via a connector 438. Second selector 420 also receives a bypass selector configuration input via a connector 458. Second selector 420 provides its output via connector 442.
Read transaction information is received via read request input connector 444 by third selector 426 and second retiming register 422. Second retiming register 422 provides its output to third selector 426 and to fifth selector 430 via a connector 446. Third selector 426 also receives the bypass selector configuration input via connector 458. Third selector 426 provides its output via connector 454. Write transaction information is received via write input connector 448 by fourth selector 428 and third retiming register 424. Third retiming register 424 provides its output to fourth selector 428 and to fifth selector 430 via a connector 450. Fourth selector 428 also receives the bypass selector configuration input via connector 458. Fourth selector 428 provides its output via connector 456. Fifth selector 430 also receives a read/write transaction selector configuration input from second selector control logic 412 via a connector 468. Fifth selector 430 provides its output to storage array 408 via a connector 452.
One or more paths through memory component 402 for read data, read transaction information, and write transaction information may collectively comprise further embodiments of a portion of memory data and transaction information connector 214 of memory system 200 in
In embodiments, the bypass selector configuration input provided via connector 458 may be generated and provided by bypass logic 414. For example, bypass logic 414 may include an AND gate with an inverted input and a non-inverted input (or other equivalent logic). The inverted input may receive a design for test (DFT) bypass input on a connector 460, and the non-inverted input may receive a bypass select input on a connector 462. The output of bypass logic 414 is then provided as the bypass selector configuration input provided via connector 458. In other embodiments, the bypass selector configuration input may be provided via connector 458 from a component external to memory component 402 and bypass logic 414 may not be included in memory component 402. When the bypass select input is set to a first value (e.g., a logical ‘0’), memory component 402 is configured to re-time or pipeline data and information via internal registers, e.g., first retiming register 418, second retiming register 422, and third retiming register 424, before providing the data and information to a subsequent memory component (e.g., memory component 406) or to a receiver external to memory system 400. When the bypass select input is set to a second value (e.g., a logical ‘1’), memory component 402 is configured to bypass the internal registers (i.e., feedthrough) and only buffer data and information (i.e., with no re-timing) using, e.g., logical components such as second selector 420, third selector 426, and/or fourth selector 428, before providing the data and information. The value of the bypass select input may be static or dynamic according to embodiments, and may be based on a system clock frequency or period.
Each of first retiming register 418, second retiming register 422, third retiming register 424, and read output register 472 may be configured to receive a system clock input via a clock connector 470, although in embodiments each memory component of a memory system may have a respective clock input. The system clock and/or memory component clocks may be configured to be enabled/disabled according to first selector control logic 410 and/or second selector control logic 412. Furthermore, first selector control logic 410 and second selector control logic 412 may each receive the bank ID via bank ID input connector 464, write transaction information via write input connector 448, and/or read transaction information via read request input connector 444 as inputs (actual connections not shown for illustrative clarity). Each memory component of a memory system has a unique bank ID that may be static or dynamic according to embodiments.
Storage array 408 may be configured to store memory data, such as data to be written to storage array 408 (write data) and/or data that may be read from storage array 408 (read data). In embodiments, storage array 408 may comprise an array of memory storage elements, such as SRAM storage elements, as would be understood by one of skill in the relevant art(s) having the benefit of this disclosure. Storage array 408 may be of any storage size, such as by way of example and not limitation, 0.5 Mb to 2 Mb, although both larger and smaller storage sizes are contemplated for storage array 408. Storage array 408 may include an array input register 474 in embodiments that may be configured to receive the output of fifth selector 430 via connector 452, and to receive the system clock input via clock connector 470 to “clock in” data and/or information received by storage array 408. As noted herein, the output of fifth selector 430 on connector 452 comprises write transaction information (via connector 450) or read transaction information (via connector 446), and storage array 408 (including any additional respective control logic, not shown) may receive these outputs and, either store write data for a valid write request according to the write transaction information or read data out for a valid read request according to the read transaction information. Write transaction information may include a write request (e.g., a write valid bit(s), a write bank ID bit(s), a write address, etc.) and/or write data, and read transaction information may include a read request (e.g., a read valid bit(s), a read bank ID bit(s), a read address, etc.), as would be understood by one of skill in the relevant art(s) having the benefit of this disclosure. Write transaction information and read transaction information may include a “no operation” (NoOp) field/bit or values associated with a NoOp in existing fields/bits, according to embodiments.
In embodiments, first selector control logic 410 and second selector control logic 412 may be any combination of logical components, circuits, state machines, etc., configured to control the selection of inputs for first selector 416 and fifth selector 430 respectively. In some embodiments, first selector control logic 410 and second selector control logic 412 may collectively comprise a single component with multiple outputs. First selector control logic 410 and second selector control logic 412 may be configured to receive the bank ID via bank ID input connector 464, write transaction information via write input connector 448, and/or read transaction information via read request input connector 444 as inputs (actual connections not shown for illustrative clarity) and generate their respective outputs according to one or more of these inputs, in embodiments.
For instance, first selector control logic 410 may use one or more read valid bits and the read bank ID of the read transaction information, as well as the bank ID received via bank ID input connector 464 to output a value for read data selector configuration input signal on connector 466. For instance, if the read bank ID does not match the bank ID or if the read valid bit(s) do not indicate a valid memory read transaction, the value of the read data selector configuration input signal on connector 466 may be set to a first value that configures first selector 416 to select and output the read data received on read data input connector 432. If the read bank ID matches the bank ID and if the read valid bit(s) indicate(s) a valid memory read transaction, the value of the read data selector configuration input signal on connector 466 may be set to a second value that configures first selector 416 to select and output the read data that is stored in storage array 408 that is received via connector 434. In embodiments, signals, data, and/or information used by first selector control logic 410 to generate its output may be received by first selector control logic 410 in the same clock cycle as the inputs to first selector 416, or one or more system clock cycles ahead of these inputs.
Second selector control logic 412 may use one or more read valid bits and the read bank ID of the read transaction information, one or more write valid bits and the write bank ID of the write transaction information, and/or the bank ID received via bank ID input connector 464 to output a value for the read/write transaction selector configuration input from second selector control logic 412 via connector 468. For instance, if the read bank ID matches the bank ID and if the read valid bit(s) indicate a valid memory read transaction, the value of the read/write transaction selector configuration input signal on connector 468 may be set to a first value that configures fifth selector 430 to select and output the read transaction information received on connector 446. If the write bank ID matches the bank ID and if the write valid bit(s) indicate(s) a valid memory write transaction, the value of the read/write transaction selector configuration input signal on connector 468 may be set to a second value that configures fifth selector 430 to select and output the write transaction information received via connector 450. In embodiments, signals, data, and/or information used by second selector control logic 412 to generate its output may be received by second selector control logic 412 in the same clock cycle as the inputs to second retiming register 422 and/or third retiming register 424, or one or more system clock cycles ahead of these inputs. In embodiments, signals, data, and/or information used by second selector control logic 412 to generate its output may be received by second selector control logic 412 in the same clock cycle as the inputs to fifth selector 430, or one or more system clock cycles ahead of these inputs.
It should be noted that alternative approaches to determining if memory components are the intended recipient of memory transactions, e.g., other than matching bank IDs with memory transaction information, are contemplated herein, and the described examples above are not considered to be limiting.
In embodiments, first selector control logic 410 and second selector control logic 412 may also be configured to provide a NoOp configuration value (not shown) for configuring first selector 416 and fifth selector 430, respectively, if a NoOp indication is present in the read transaction information and the write transaction information.
First retiming register 418, second retiming register 422, third retiming register 424, and/or read output register 472 may be any type of flip-flops or banks/arrays thereof, registers or banks/arrays thereof, and/or any other logic component/circuit configured to hold or register data and operate according to a clock signal (such as latches, etc.). These registers may be of the same type or of different types in embodiments. As described above, each of first retiming register 418, second retiming register 422, third retiming register 424, and read output register 472 receives a respective data input and a system clock input. First retiming register 418, second retiming register 422, and third retiming register 424 are configured to register or “clock” (i.e., pipeline) data on their respective data paths for system timing purposes.
For example, first retiming register 418 is configured to “clock” or register read data received from first selector 416 on connector 440 before such read data is provided (via second selector 420, as shown in
Second retiming register 422 is configured to “clock in” or register read transaction information received via read request input connector 444 from either of a previous memory component (e.g., memory component 404, as shown in
Third retiming register 424 is configured to “clock in” or register write transaction information received via write input connector 448 from either of a previous memory component (e.g., memory component 404, as shown in
Read output register 472 is configured to register data stored in storage array 408 that is to be output by storage array 408 to first selector 416 in response to a valid read request.
First selector 416, second selector 420, third selector 426, fourth selector 428, and/or fifth selector 430 may be any type of mux or banks/arrays thereof and/or any other logic component/circuit configured to make a selection from inputs according to a configuration input signal. These selectors may be of the same type or of different types in embodiments. As described above, each of first selector 416, second selector 420, third selector 426, fourth selector 428, and/or fifth selector 430 receives respective data inputs and a configuration input.
First selector 416 is configured to select between read data that is locally stored in and read out from storage array 408 via connector 434, and input read data received on read data input connector 432, to be provided as read data output from memory component 402 via connector 442 according to the read data selector configuration input from first selector control logic 410 via connector 466. The read data received on read data input connector 432 may include read data stored in a previous memory component (e.g., memory component 404, as shown in
In embodiments, first selector 416 may be a 2:1 mux that is configured to function as a local point-to-point mux between memory component 402 and subsequent memory component 406. Each memory component of memory system 400 may include a respective first selector 416 that is configured to function as a point-to-point mux between a given memory component and its subsequent memory component. In this way, according to the embodiments and techniques herein, a memory system, e.g., memory system 400 of
Second selector 420, third selector 426, and fourth selector 428 are configured to select between respective registered (pipelined) inputs and respective bypass (feedthrough) inputs according to bypass selector configuration input via connector 458. When registered inputs are selected, the data/information received is said to be “retimed,” and when non-registered inputs are selected, the data/information received is said to be “bypassed.” For example, second selector 420 is configured to receive registered or retimed read data from first retiming register 418 and to receive bypassed (i.e., non-registered) read data from first selector 416. Third selector 426 is configured to receive registered or retimed read transaction information from second retiming register 422 and to receive bypassed (i.e., non-registered) read transaction information via read request input connector 444. Fourth selector 428 is configured to receive registered or retimed write transaction information from third retiming register 424 and to receive bypassed (i.e., non-registered) write transaction information via write input connector 448.
Fifth selector 430 is configured to select between read transaction information on connector 446 and write transaction information on connector 450 to be provided to storage array 408 via connector 452 according to the read/write transaction selector configuration input received from second selector control logic 412 via connector 468. For example, when provided with a first value for the read/write transaction selector configuration input from second selector control logic 412 via connector 468, read transaction information is provided to storage array 408, and when provided with a second value for the read/write transaction selector configuration input from second selector control logic 412 via connector 468, write transaction information is provided to storage array 408. Embodiments are also contemplated where storage array 408 may be other than a single-port storage array as illustrated in
In embodiments, either of the read data selector configuration input from first selector control logic 410 via connector 466 or the read/write transaction selector configuration input from second selector control logic 412 via connector 468 may indicate a NoOp and respectively configure first selector 416 and fifth selector 430 to provide NoOp output values. A NoOp output may comprise default data (e.g., all 1's, all 0's, or a combination thereof), may be no data (e.g., a high impedance value ‘z’, a floating or un-driven value ‘x’, etc.), or any other data value.
It is also contemplated that embodiments may include multiple instances of read request input connector 444 and/or write input connector 448 (request interfaces), and in such embodiments additional corresponding selector and retiming components, as similarly described above with respect to read request input connector 444 and write input connector 448, may also be included. In embodiments where the number of request interfaces is greater than the number of access requests per clock cycle supported by the storage array (e.g., storage array 408), additional logic (that may be external logic, not shown) may be included to ensure that nor more than the supported number of access requests per clock cycle are directed the memory (e.g., memory system 400).
Turning now to
Memory transaction information input to the memory system is received at a first memory component of the memory system (502). For instance, memory transaction information such as read transaction information, read data, and/or write transaction information, as described herein, may be received at memory component 402 of memory system 400 in
The memory transaction information is selectively provided from the first memory component to a second memory component of the system as a first buffered output or as a first registered output (504). For example, second selector 420 of memory component 402 in
The memory transaction information is received at the second memory component from the first memory component (506). For instance, as shown in
The memory transaction information is selectively provided from the second memory component as a second buffered output or as a second registered output (508). In embodiments, memory component 406 may selectively provide the memory transaction information in a similar or identical manner as described above for memory component 402 in 504. The memory transaction information provided by memory component 406 may be provided to a further subsequent memory component of memory system 400, or to a receiver external to the memory system in embodiments where memory component 406 is the last memory component of memory system 400. In embodiments where memory component 406 is the last memory component, memory component 406 may be the only memory component of memory system 400 to provide memory transaction information to the external receiver.
In some example embodiments, one or more of operations 502, 504, 506, and/or 508 of flowchart 500 may not be performed. Moreover, operations in addition to or in lieu of operations 502, 504, 506, and/or 508 may be performed. Further, in some example embodiments, one or more of operations 502, 504, 506, and/or 508 may be performed out of order, in an alternate sequence, or partially (or completely) concurrently with each other or with other operations.
Turning now to
Memory data from the first memory component is selectively provided to the second memory component as a second buffered output or as a second registered output (602). For instance, memory data may comprise read data, i.e., data that is requested to be read from a memory component of memory system 400 of
The memory data stored in the first memory component is received by the second memory component, the memory data being read from the first memory component according a read operation associated with the memory transaction information (604). For example, as shown in
The memory data is provided by the second memory component as an output of the second memory component (606). For instance, memory component 406 may provide the memory data (e.g., the received read data) in a similar or identical manner as described herein for memory component 402. The read data provided by memory component 406 may be read data that was stored in storage array 408 of memory component 402, or that was stored in a memory component before memory component 402. The memory data provided by memory component 406 may be provided to a further subsequent memory component of memory system 400, or to a receiver external to the memory system in embodiments where memory component 406 is the last memory component of memory system 400. In embodiments where memory component 406 is the last memory component, memory component 406 may be the only memory component of memory system 400 to provide memory data to the external receiver.
In some example embodiments, one or more of operations 602, 604, and/or 606 of flowchart 600 may not be performed. Moreover, operations in addition to or in lieu of operations 602, 604, and/or 606 may be performed. Further, in some example embodiments, one or more of operations 602, 604, and/or 606 may be performed out of order, in an alternate sequence, or partially (or completely) concurrently with each other or with other operations.
Turning now to
Write data is stored at the second memory component, where memory transaction information comprises write data and a write request to the second memory component (702). For example, memory transaction information, as described herein, may comprise write data and write transaction information received by memory component 402 of
In some example embodiments, operation 702 of flowchart 700 may not be performed. Moreover, operations in addition to or in lieu of operation 702 may be performed. Further, in some example embodiments, operation 702 may be performed partially (or completely) concurrently with other operations described herein.
As noted above, the techniques and embodiments described herein provide for memories with feedthrough and retiming paths to support memory to memory requests. In
Timing diagram 800 includes four sections: a clock section 802, a setup section 804, a bypass select enabled section 806, and a bypass select disabled section 808. Information and data in these sections of timing diagram 800 are conceptually represented as binary data “010 . . . ” for illustrative purposes only, and these conceptual representations are not intended to be limiting or to represent actual data or information. Additionally, signals in timing diagram 800 are illustrated with respect to the arrival of the signals, but it should be understood that the signals may remain valid for one or more clock periods after arrival. Furthermore, some signals may be omitted in certain embodiments, such as the register clock enable signals described below (for embodiments in which retiming registers are not disabled). Still further, the timing shown in timing diagram 800 is not to be considered limiting, but rather is an illustrative example that demonstrates the techniques and embodiments described herein. That is, embodiments described herein may be configured such that the signals shown in timing diagram 800, such as but not limited to, read transaction information signals and write transaction information signals, may arrive during other clock periods than as shown.
Clock section 802 shows a clock signal and clock periods 0-5 for reference with the remaining sections of timing diagram 800. The clock signal may be a system clock signal input, e.g., via a clock connector 470, or a per memory component clock signal as described herein. The clock signal may have any operating frequency and corresponding period or cycle according to design implementations for memory systems as described herein.
Setup section 804 corresponds to the receipt of signals for a memory transaction by a memory component of a memory system, such as memory component 402 of memory system 400 in
In clock period ‘1’, a read address signal (i.e., read transaction information) is received, for example via read request input connector 444, and a register clock enable signal may be provided to a register that receives read transaction information (e.g., second retiming register 422 of memory component 402). This allows second retiming register 422 an initial setup time before the read transaction information is registered and clocked out to storage array 408. It should be noted that according to this description, second retiming register 422 is configured as an input register for storage array 408 in addition to being a retiming register. The read address signal may also be received by first selector control logic 410 and second selector control logic 412 as described herein, according to embodiments.
In clock period ‘2’, storage array 408 may receive the read transaction information (i.e., a read request) from second retiming register 422 via connector 446, fifth selector 430, and connector 452 shown in
In clock period ‘3’, storage array 408 may read local data stored therein that is associated with the read address of the read transaction information, and then provide the stored data (local read data) to read output register 472. A register clock enable signal may also be provided to read output register 472 in clock period ‘3’. This allows read output register 472 an initial setup time before the local read data is registered and clocked out.
In clock period ‘4’, the local read data is registered and clocked out by read output register 472, and is provided to first selector 416 via connector 434. Additionally, first selector 416 receives memory data (i.e., read data) that is provided to memory component 402 via read data input connector 432.
Accordingly, the signals described in setup section 804 are received and setup for being provided to a subsequent memory component (e.g., memory component 406) or to a receiver that is external to memory system 400 (e.g., receiver 218 of
Bypass select enabled section 806 corresponds to timing for outputs of a memory transaction by a memory component of a memory system (such as a read transaction for memory component 402 of memory system 400 in
Accordingly, when configured for feedthrough and retiming registers are bypassed, the read data that is output by memory component 402 is available after four clock periods from the initial receipt of the first portion of the read transaction information (i.e., a four pipeline stage read transaction).
Bypass select disabled section 808 corresponds to timing for outputs of a memory transaction by a memory component of a memory system (such as memory component 402 of memory system 400 in
Accordingly, when configured for retiming/pipelining and retiming registers are utilized and not bypassed, the read data that is output by memory component 402 is available after five clock periods from the initial receipt of the first portion of the read transaction information (i.e., a five pipeline stage read transaction).
As noted above, the selection of memory components, such as memory component 402, as being either a bypass memory component (as in bypass select enabled section 806) or a pipeline/registered memory component (as in bypass select disabled section 808) may be dependent on the operating clock period (i.e., cycle time) of the memory system. For example, according to embodiments, signals such as memory data and memory transaction information described herein may be required to traverse a path between two registers in an amount of time that is less than the clock cycle time. This allows such signals to “clocked out” and “clocked in” by registers within the clock cycle time to meet timing constraints, as would become apparent to one of skill in the relevant art(s) having the benefit of this disclosure. Given a clock cycle time, these signals may be able to traverse one or more (‘P’) feedthrough memory components before being registered in a retiming memory component as described herein.
In the disclosed embodiments by way of example, several timing paths may be considered in determining the ratio (‘P:1’) of feedthrough memory components per pipeline memory component. These timing paths may include, but are not limited to, a feedthrough path, a registered feedthrough path, a storage array output register path, and an input access register path.
The feedthrough path is active when the bypass function of a memory component is active, and therefore read transaction information signals and write transaction information signals are not registered outputs (but are buffered outputs). The feedthrough path comprises logical component delay from input to output of a memory component (e.g., the path comprising read request input connector 444 and connector 454 via third selector 426).
The registered feedthrough path is active when the bypass function of a memory component is inactive, and therefore signals for locally stored read data being read from the memory component, read transaction information signals, and write transaction information signals are registered outputs. The registered feedthrough path comprises register to output access time (e.g., from first retiming register 418, from second retiming register 422, or from third retiming register 424, or in some embodiments, the longest timing path of these three).
The storage array output register path is active when the bypass function of a memory component is active, and therefore signals for locally stored read data being read from the memory component are not registered subsequent to being clocked out from a storage array as outputs. The storage array output register path comprises output access time from the storage array output register (e.g., read output register 472 of
The input access register path comprises input setup time for read transaction information signals and write transaction information signals that are registered as inputs (e.g., respectively by second retiming register 422 and third retiming register 424 of
Consider an illustrative example memory system having a ratio ‘P:1’, where ‘P’ equals 2, and a given clock cycle time. Such an example memory system requires that the sum of time for the storage array output register path, the feedthrough path, and the input access register path be less than or equal to the clock cycle time; and that the sum of time for the registered feedthrough path, two times the feedthrough path, and the input access register path be less than or equal to the clock cycle time. However, as clock cycle times may vary by memory system, higher or lower ratios ‘P:1’, where P is greater than zero, may be configured according to following equations:
Path 3+((P−2)×Path 1)+Path 4≦clock cycle time, (Equation 1)
and
Path 2+((P−1)×Path 1)+Path 4≦clock cycle time, (Equation 2)
where Path 1 is the feedthrough path, Path 2 is the registered feedthrough path, Path 3 is the storage array output register path, and Path 4 is the input access register path
It should be noted that a ratio ‘P:1’ where ‘P’ equals 0 is also contemplated herein. In such embodiments, each memory component of a memory system would be a retiming/pipeline memory component. For example, with respect to memory system 400 of
As noted above, systems and devices, including memory systems and components, may be configured in various ways with feedthrough and retiming paths to support memory to memory requests according to the techniques and embodiments provided. For instance, in embodiments, memory components of a memory system may provide memory data and memory transaction information from memory component to memory component.
Additionally, memory components of a memory system may be adjacently placed with no physical connectors, muxing, and/or ancillary logic therebetween. Such configurations allow for memory system circuit advantages of up to approximately 16-18% or more improvements in the density of memory systems, or alternatively, the inclusion of up to approximately 16-18% or more additional memory components in a given memory system area.
From a system level perspective, the described techniques and embodiments allow for additional memory depth not previously available in existing solutions. Products in which memory systems configured according to the described techniques and embodiments are utilized may realize cost savings due to reduced die area and improved physical design and integration considerations, such as but not limited to, routing, placement, less resource intensive memory system timing closure, etc. Additionally, product performance improvements due to additional memory available may also be realized.
The described techniques and embodiments may be utilized in any large memory systems, and the advantages described herein become even more beneficial as semiconductor devices continue to use increasingly larger and larger memory systems. The described techniques and embodiments provide value and performance benefits to products and industries that drive for increasing amounts of memory capacity. However, any chip architecture using fairly large memories and/or multi-read memories can benefit from the density and memory capacity advantages provided herein.
The further example embodiments and advantages described in this Section may be applicable to embodiments disclosed in any other Section of this disclosure.
Embodiments and techniques, including methods, described herein may be performed in various ways such as, but not limited to, being implemented by hardware, software, firmware, and/or any combination thereof. For example, embodiments may be implemented as memory systems, such as SRAM systems, specifically customized hardware, ASICs, electrical circuitry, and/or the like.
Memory system 200 of
The embodiments described herein, including circuitry, devices, systems, methods/processes, and/or apparatuses, may be implemented in or using well known processing devices, communication systems, servers, and/or, computers, such as a processing device 900 shown in
Processing device 900 can be any commercially available and well known communication device, processing device, and/or computer capable of performing the functions described herein, such as devices/computers available from International Business Machines®, Apple®, Sun®, HP®, Dell®, Cray®, Samsung®, Nokia®, etc. Processing device 900 may be any type of computer, including a desktop computer, a server, etc., and may be a computing device or system within another device or system.
Processing device 900 includes one or more processors (also called central processing units, or CPUs), such as a processor 906. Processor 906 is connected to a communication infrastructure 902, such as a communication bus. In some embodiments, processor 906 can simultaneously operate multiple computing threads, and in some embodiments, processor 906 may comprise one or more processors.
Processing device 900 also includes a primary or main memory 908, such as random access memory (RAM). Main memory 908 has stored therein control logic 924 (computer software), and data.
Processing device 900 also includes one or more secondary storage devices 910. Secondary storage devices 910 include, for example, a hard disk drive 912 and/or a removable storage device or drive 914, as well as other types of storage devices, such as memory cards and memory sticks. For instance, processing device 900 may include an industry standard interface, such a universal serial bus (USB) interface for interfacing with devices such as a memory stick. Removable storage drive 914 represents a floppy disk drive, a magnetic tape drive, a compact disk drive, an optical storage device, tape backup, etc.
Removable storage drive 914 interacts with a removable storage unit 916. Removable storage unit 916 includes a computer useable or readable storage medium 918 having stored therein computer software 926 (control logic) and/or data. Removable storage unit 916 represents a floppy disk, magnetic tape, compact disk, DVD, optical storage disk, or any other computer data storage device. Removable storage drive 914 reads from and/or writes to removable storage unit 916 in a well-known manner.
Processing device 900 also includes input/output/display devices 904, such as touchscreens, LED and LCD displays, monitors, keyboards, pointing devices, etc.
Processing device 900 further includes a communication or network interface 920. Communication interface 920 enables processing device 900 to communicate with remote devices. For example, communication interface 920 allows processing device 900 to communicate over communication networks or mediums 922 (representing a form of a computer useable or readable medium), such as LANs, WANs, the Internet, etc. Network interface 920 may interface with remote sites or networks via wired or wireless connections.
Control logic 928 may be transmitted to and from processing device 900 via the communication medium 922.
Any apparatus or manufacture comprising a computer useable or readable medium having control logic (software) stored therein is referred to herein as a computer program product or program storage device. This includes, but is not limited to, processing device 900, main memory 908, secondary storage devices 910, and removable storage unit 916. Such computer program products, having control logic stored therein that, when executed by one or more data processing devices, cause such data processing devices to operate as described herein, represent embodiments.
Techniques, including methods, and embodiments described herein may be implemented by hardware (digital and/or analog) or a combination of hardware with one or both of software and/or firmware. Techniques described herein may be implemented by one or more components. Embodiments may comprise computer program products comprising logic (e.g., in the form of program code or software as well as firmware) stored on any computer useable medium, which may be integrated in or separate from other components. Such program code, when executed by one or more processor circuits, causes a device to operate as described herein. Devices in which embodiments may be implemented may include storage, such as storage drives, memory devices, and further types of physical hardware computer-readable storage media. Examples of such computer-readable storage media include, a hard disk, a removable magnetic disk, a removable optical disk, flash memory cards, digital video disks, random access memories (RAMs), read only memories (ROM), and other types of physical hardware storage media. In greater detail, examples of such computer-readable storage media include, but are not limited to, a hard disk associated with a hard disk drive, a removable magnetic disk, a removable optical disk (e.g., CDROMs, DVDs, etc.), zip disks, tapes, magnetic storage devices, MEMS (micro-electromechanical systems) storage, nanotechnology-based storage devices, flash memory cards, digital video discs, RAM devices, ROM devices, and further types of physical hardware storage media. Such computer-readable storage media may, for example, store computer program logic, e.g., program modules, comprising computer executable instructions that, when executed by one or more processor circuits, provide and/or maintain one or more aspects of functionality described herein with reference to the figures, as well as any and all components, capabilities, and functions therein and/or further embodiments described herein.
Such computer-readable storage media are distinguished from and non-overlapping with communication media (do not include communication media). Communication media embodies computer-readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave. The term “modulated data signal” means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media includes wireless media such as acoustic, RF, infrared and other wireless media, as well as wired media and signals transmitted over wired media. Embodiments are also directed to such communication media.
The techniques and embodiments described herein may be implemented as, or in, various types of devices. For instance, embodiments may be included, without limitation, in processing devices (e.g., illustrated in
While various embodiments have been described above, it should be understood that they have been presented by way of example only, and not limitation. It will be apparent to persons skilled in the relevant art that various changes in form and detail can be made therein without departing from the spirit and scope of the embodiments. Thus, the breadth and scope of the embodiments should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.
This application claims the benefit of U.S. Provisional Application No. 62/195,587, filed on Jul. 22, 2015, the entirety of which is incorporated by reference herein.
Number | Date | Country | |
---|---|---|---|
62195587 | Jul 2015 | US |