The present invention relates generally to integrated circuit memory devices and, more particularly, to a write/read priority blocking scheme using a parallel static address decode path.
Generally, electronic circuits have significant data storage capacities. Such capacities may be achieved with large memories formed of several memory blocks for physical or logical reasons. For example, such memories may include SRAM (Static Random-Access Memory) or DRAM (Dynamic Access Memory). A memory controller enables the other functions of the electronic circuit to view all the memory blocks as a single memory, in terms of address.
In one implementation, memory blocks may have a single-port architecture. In other words, as seen from the other electronic circuit functions, a single-port block can only perform one read operation or one write operation at a time. This memory block architecture avoids the need for overly complex memory architectures or architectures consuming too much circuit surface area. On the other hand, it may sometimes be desirable for some functions of the electronic circuit to simultaneously perform a read operation and a write operation. In this case, other architectures, such as two-port, dual-port, and multiple port cells have also become popular.
For memory architectures performing address decoding that are presented with both read and write addresses at the same bank of logical entries, a banking function can be performed, which enables simultaneous read and write operations to different memory banks. In the case of multiport arrays, simultaneous reads to any logical entry can be achieved. However, in the case of a bank conflict where a request is made for simultaneous read and write to the same logical bank, a decision must be made to determine whether the access will be a read or a write. This decision can be made external to the memory array, in which case the desired address is simply sent to the memory without a conflict; however, the memory array could also perform this function with the appropriate logic if it is known whether reading or writing takes priority.
In the case of a write access taking priority over a read access, the write address can be used to prevent a full read address decode when the read and write addresses are the same. One approach would be to use the actual write decode in the critical decode path for the address blocking; however, this would introduce timing complexity in the critical path, specifically loading down the partially decoded write MSB signals with additional wire and device load. As a result, the write decode path timing would start to differ from that of the read decode path, and such a difference would worsen across process variations.
In an exemplary embodiment, a write block read apparatus for a memory device includes a dynamic read address decoder that receives static read address bits as inputs thereto and having an output used to implement a read operation of a memory location corresponding to the read address bits; a dynamic write address decoder that receives static write address bits as inputs thereto and having an output used to implement a write operation of a memory location corresponding to the write address bits; and a static write address decoder, configured in parallel with the dynamic write address decoder, the static write address decoder configured to receive a portion of the static write address bits as inputs thereto, and wherein the static write address decoder is coupled to the dynamic read address decoder so as to block the read operation upon an address conflict with the write operation.
Referring to the exemplary drawings wherein like elements are numbered alike in the several Figures:
As indicated above, in the case of a write access taking priority over a read access, the write address can be used to prevent a full read address decode when the read and write addresses are the same. Such a scheme is utilized in embodiments disclosed herein, where a version of the write decode is used to block the read address from fully decoding. That is, embodiments herein disclose utilizing a replicated, partially decoded version of the write address to block the read address from completing its decode when the write and read addresses access the same logical bank. Unlike the above described approach of using the actual write decode in the critical decode path, the present scheme maintains the existing dynamic decode hierarchy for memory access, thus generating predictable timing behavior in the critical path.
Referring initially to
The memory architecture 100 supports a simultaneous 2-port read to any 2 addresses within a macro half (i.e., 256 entries), as well as a simultaneous write to 1 bank and 2 reads to different banks. As further illustrated in
Wordline (WL) address decoders 104 are configured to select a wordline (WL_T, WL_C) for a given memory cell 106 by activating the associated access transistor. The resulting wordline is chosen by multiplexing the reads and write MSB/LSB address signals (r0_lsb, r0_msb, wrt_lsb, wrt_msb, r1_lsb, r1_msb) from the output of the first and second level address decoders 102. Additional circuitry associated with the memory architecture 100, such as precharging devices and bitline (data) read/write devices are not described in further detail herein, as they are known to one skilled in the art.
An address conflict for this architecture occurs when the macro receives both write and read addresses to the same bank. That is, if the same bank is selected for both reading and writing, then a conflict will result on the local bitline (LBL_T/LBL_C) such that the read address enables the memory cell content to be read onto the bitline while the write address enables the write data from the bitline to be written to the cell. Such a conflict situation is exemplified by the “X” shown on LBL_T of
Referring now to
As shown in
In operation, the write block read apparatus 200 has priority to block a read operation when the write address MSB are equal to the read address MSB (i.e., when the write and read addresses correspond to the same bank. Moreover, through the use of the parallel static write address decoder 219, the dynamic write address decoder 213 remains dynamic to track with the dynamic read address decoder 211. As described in further detail below, an additional feature of the present embodiments is the use of a Vdd (logic) voltage level signal to block a Vcs (SRAM) voltage level signal. That is, two separate voltage domain levels are used in the present approach, wherein an exemplary logic rail voltage level may be Vdd=1.0 volts, and an exemplary memory rail voltage level may be Vcs=1.1 volts.
In further detail now,
Referring now to
The devices described thus far in the dynamic AND gate 400 operate with the aforementioned memory rail voltage level Vcs. In addition, however, the static write block read_B signal (having a logic high value at the logic rail voltage level of Vdd) is coupled to an NFET 418 configured between the dotted node and NFET 404. In a “normal” or non-blocking mode of operation, the static write block read_B signal is maintained at an inactive or logic high level (Vdd). In other words, NFET 418 is rendered conductive in a non-blocking mode of operation such that the dynamic AND gate 400 acts as a conventional AND gate. For example, when the value of dynamic read first level predecode signal A is “0” (regardless of the value of dynamic read first level predecode signal B), PFET 402 is conductive and NFET 404 is non-conductive, thus charging node 406 to Vcs. As a result, the inverter stage 408/410 generates a “0” on the output of the gate.
Continuing with the non-blocking mode of operation, if the value of dynamic read first level predecode signal A is “1”, then PFET 402 is non-conductive and NFET 404 is conductive, which may allow node 406 to be discharged from Vcs, depending upon the value of dynamic read first level predecode signal B. If dynamic read first level predecode signal B is “0”, then PFET 414 is conductive and NFET 416 is non-conductive. Thus, there is no path to ground for the node 406 to be discharged, and it is maintained at Vcs by PFET 414 and keeper PFET 412. On the other hand, if both dynamic read first level predecode signals A and B are “1” (again assuming a non-blocking mode), then each of the NFETs 404, 418 and 416 are conductive, providing a discharge path to ground for node 406. The relative strength of the NFET devices is sufficient to overcome the keeper PFET 412, which results in node 406 being discharged to ground and a “1” being generated on the output of the gate 400.
The particular placement of the NFET 418 in the pull-down stack between NFET 404 and NFET 416 allows an active low signal to render NFET 418 non-conductive, which cuts off a discharge path for node 406. Thus, even if both dynamic read first level predecode signals A and B are “1”, if the static write block read_B signal is active low (i.e., a write block read condition), the node is maintained at Vcs by PFET 412, meaning that the dynamic read MSB signal is blocked (i.e., kept low). Again, as stated above this blocking signal is statically decoded, at a Vdd voltage level.
As will thus be appreciated, the above described embodiments implement a write block read addressing scheme internally to the macro but without using the actual write decode in the critical decode path for the address blocking. This results in avoiding the introduction of timing of complexity in the critical path, by specifically loading down the partially decoded write MSB signals with additional wire and device load. With respect to physical overhead M4 metal level wire tracks may be used to deliver long static decode signals to second level read circuits. The static write blocking signal must encapsulates the dynamic read first level decode signals arriving at the read second level decoder, meeting setup and hold requirements.
Further, the introduction of additional devices/static signals in the read decode stack enables the blocking function, with a separate static path enabling custom tuning to meet setup/hold requirements. The blocking function blocks a Vcs (SRAM voltage level) based dynamic decode, using a Vdd (logic voltage) based signal. Through the use of parallel write static and decode paths, the write static decode path maintains the same number of stages as the dynamic read and write decode paths for better tracking of timing.
While the invention has been described with reference to a preferred embodiment or embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from the essential scope thereof. Therefore, it is intended that the invention not be limited to the particular embodiment disclosed as the best mode contemplated for carrying out this invention, but that the invention will include all embodiments falling within the scope of the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
5021688 | Leforestier et al. | Jun 1991 | A |
5287323 | Takahashi et al. | Feb 1994 | A |
5717653 | Suzuki | Feb 1998 | A |
6049487 | Plants et al. | Apr 2000 | A |
6243287 | Naffziger et al. | Jun 2001 | B1 |
6725325 | Nishiyama et al. | Apr 2004 | B2 |
6826088 | Sohn et al. | Nov 2004 | B2 |
20020031043 | Tsuruto et al. | Mar 2002 | A1 |
20040027857 | Ooishi | Feb 2004 | A1 |
20040095830 | Tanaka | May 2004 | A1 |
20060291283 | Jin et al. | Dec 2006 | A1 |
20100232250 | Bull et al. | Sep 2010 | A1 |
20110310691 | Zhou et al. | Dec 2011 | A1 |
Number | Date | Country |
---|---|---|
1456757 | Sep 2004 | EP |
2007012128 | Jan 2007 | JP |
1020020054209 | Jul 2002 | KR |
102007002841 | Jan 2007 | KR |
Entry |
---|
List of IBM Patents or Patent Applications Treated as Related; Sep. 30, 2014, pp. 1-2. |
Paul A. Bunce, et al., “Write/Read Priority Blocking Scheme Using Parallel Static Address Decode Path,” U.S. Appl. No. 14/501,078, filed Sep. 30, 2014. |
B. Akesson, et al., “Composability and Predictability for Independent Application Development, Verification, and Execution”, Multiprocessor System-on-Chip: Hardware Design and Tool Integration, Chapter 2, Springer New York, 2011, pp. 25-56. |
Jianhui Yue, et al., “Making Write Less Blocking for Read Accesses in Phase Change Memory”, Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), IEEE 20TH International Symposium, 2012, pp. 269-277. |
Number | Date | Country | |
---|---|---|---|
20150302902 A1 | Oct 2015 | US |