The present application is related to and claims priority from and herein incorporates by reference the U.S. provisional patent application Ser. No. 60/445,872, entitled “Dual-port memory using a single port memory for multiple clock domain”, by inventor, Raza S. BABAR, filed on Feb. 7, 2003, and U.S. provisional patent application Ser. No. 60/437,303, entitled “A logic for implementing a dual clock domain read access with predictable timing for bi-directional I/Os”, by the same inventor, filed on Dec. 31, 2002.
The present disclosure generally relates to computer memory systems.
Random Access Memory cells or RAM cells have become increasingly popular due in part to the attractive costs of these devices. Many such conventional RAM cells are ordinarily only addressable from a single port. In such a memory apparatus, the input of an address causes a single select line or a coincidence of two select lines, such as a row select line and a column select line, to cause the addressed cell to be selected. Upon selection of a cell, data may then be sensed from or written into the selected storage cell on one or two bit sense lines. Alternatively, in some RAM cells, a single selection line causes the particular cell to be selected, and other selection circuitry activated by the same address causes the bit-sense lines to be selected. However, there are never any options for reading and writing at the same time. The same selection lines and the same bit sense lines are always utilized to access and write or read the content of a particular location. Consequently, a single port RAM memory device cannot be simultaneously addressed and accessed from separate sources.
Certain applications require higher memory access speeds. Examples of such high speed applications may be graphic related memory systems such as those used in a computer display systems, data transfer and buffering devices used in high speed communication systems, and memory systems used in conjunction with arithmetic logic units. For such applications, multiport random access memory devices have been developed to provide increased accessibility to the memory contents of the random access memory unit and more flexibility for inter-processor communications. One example of a multi-port memory device is a dual-port RAM cell accessible simultaneously by two independent entities. In digital integrated circuits (ICs), this implies a dual-port memory cell that can be accessed at the same time through two different ports. Each port utilizes independent sets of addresses and control lines to access the memory array.
Although the conventional dual port memory system 100 of
Therefore, systems and methods are desired that provide predictable timing for bi-directional access in multi-clock domain memory devices providing high data transfer rates required of the traditional multiport RAMs, without the disadvantages of large circuits size, high development cost, and other limitations inherent in memory systems such the ones shown in
Accordingly, embodiments of the invention are directed to a system and method for providing predictable timing for read operation from a multiport memory device operating over multiple clock domains. An embodiment of the system of the present invention is directed to a multiport memory system comprising a single port memory core synchronized to a first clock, a first multiplexer logic coupled to the core memory and a plurality of access ports, where the multiplexer logic operates as an arbitrator between the plurality of access ports attempting to access the core memory.
Each access ports includes an uncertainty detect logic that measures data path latency, and an uncertainty adjust logic that operates to selectively add data path delay to increase the measured path latency to a predictable value.
An embodiment of a method according to the present invention provides predictable timing for read access in a dual clock domain, bi-directional input/output (I/O) device. The method comprises generating a tag, sending the generated tag through a data path crossing a clock domain, determining clock path latency based on a data path travel time of the generated tag, and adding at least one pipeline stage to the data path to increase the path latency up to a predictable time.
In one embodiment, the present invention consists of two main blocks, the uncertainty detect logic and the uncertainty adjust logic. The uncertainty detect logic sends a tag information to the other clock domain. It then waits until it receives the tag back. Based on the number of clock cycles it takes the tag to go through the FIFO paths in the two directions, it adjusts the pipelining stages.
The logic detect can be a counter, which counts the number of cycles the tag takes through the FIFO. It then provides that information to the select logic. The select logic then selects the multiplexer for the appropriate pipeline delays.
The accompanying drawings, which are incorporated in and form a part of this specification, illustrate embodiments of the present invention, and together with the description, serve to explain the principles of the invention. Unless noted, the drawings referred to this description should be understood as not being drawn to scale.
The present invention provides new systems and methods for bi-directional inputs and outputs (I/Os) access to the content of multiport memory devices operating over multi clock domains by providing predictable timing for device access.
In a typical dual port memory operation, after sending a read request, there is a predictable time (number of clock cycles) after which the data is guaranteed to arrive. However, when the signals cross the clock boundaries for synchronization, clock cycle uncertainty is introduced. This clock cycle uncertainty results in an unpredictable time after which the data can be guaranteed. Over time, the clock signals move with respect to each other. This movement of the clock cycles can cause the potential proximity of the two clock signal edges. As a result, the read or write control signals may not be processed in the current clock cycle, and data read from the memory core may not be available over the current clock cycle.
In one embodiment of the present invention, the solution to the unpredictable read access timing issue may be based on the use of a data_valid pin.
Although the use of a data_valid signal may resolve some of the problems due to clock uncertainty, it requires an additional pin to indicate the availability of the data. The additional pin results in a dual port memory device having a pin structure different from that of a conventional dual port memory device.
As shown in
The operation of the port sync logic 406 and port sync logic 408 are transparent to any outside devices. The dual port memory device 400 implemented according to an embodiment of the present invention allows concurrent writes on each port, concurrent reads on each port and simultaneous read and write operation on each port in multiple clock domains.
In operation, the FIFOs are used to synchronize the signals crossing a clock boundary. For example, in the case of a write operation, data, address, and the write control signal are written from an access port synchronized to a first clock, into the write data and control FIFO 504. Thus, the incoming data and control signals are synchronized to the port sync logic's clock. The write data and control signal are then synchronized to the core memory's 402 clock through the write data and control FIFO 504.
In one embodiment, each FIFO may be implemented using a memory. In an alternative embodiment, the FIFO function may be implemented using synchronizing registers. It should be apparent to one skilled in the art that alternative implementations of the present invention may use other means of synchronizing the signals crossing a clock boundary.
The read or write operations are performed and the outgoing data and control signals are passed to the read FIFO 502. In a read operation, the incoming read control signal is first synchronized from the port clock cycle to the core using the write data and control FIFO 504. Next, the read control signal is processed with respect to the core memory 402 clock. Once the read access operation is performed, the outgoing data and control signals are passed to the read FIFO 502 and synchronized to the port clock domain using the read FIFO 502.
In one embodiment, if the port uses the same clock as the core, the FIFOs can then be bypassed. This ability to by-pass the FIFOs allows for reduced latency for that particular port.
The operation of the port sync logic 406 and port sync logic 408 may be transparent to any outside devices. Therefore, the dual port memory device 400 implemented according to an embodiment of the present invention operates to allow concurrent writes on each port, concurrent reads on each port and simultaneous read and write operation from each port in multiple clock domains. However, in the exemplary embodiment of the present invention presented herein above, when a read, write, control or address signal crosses a clock boundary, uncertainty is introduced into the system. Whenever a system runs on or is synchronized to more than one clock, a degree of uncertainty is inherent in the system. The clock uncertainty is introduced when the two clocks are very close to each other. As a result, signals may not be registered or evaluated in time, causing a delay in the response by at least one clock cycle. Thus, whenever a signal crosses the clock boundaries, some uncertainty may be introduced due to the possibility that a clock cycle may be lost because of the potential close proximity of the signal to the clock edges.
In a conventional dual port memory operation, after sending a read request, there is a predictable time (number of clock cycles) after which the data is guaranteed to arrive. In certain embodiment of the present invention, for example, the write clock which the write the request is based on may be slightly behind the read clock (e.g. see
As previously mentioned, a data_valid pin may be added to indicate whether the data is valid or not in the current clock cycle. However, in order to maintain the traditional dual port memory device design, a new solution to overcome read access clock uncertainty in a multiport memory device operating over multiple clock domains is desired.
According to an embodiment of the present invention, the uncertainty detect logic 710 includes a path counter 712 and a select logic block 714. In an alternative embodiment, a tag or control character generator may be used to generate a tag, control character or other types of unique identifier that is distinguishable from data.
In operation, the uncertainty detect logic 710 generates and sends a tag information across the multiple clock domains. In one embodiment, a tag or an otherwise recognizable character is generated by the path counter 712 or a control or tag generator and transmitted over the data path crossing the multiple clock domains. By measuring the travel time of the generated tag over the data path using the path counter 712 (also referred to as the counter), the path latency may be periodically calculated. In another embodiment, the measurement of the path latency may done in a continuous way by sending the tag along with each request and measuring its corresponding travel time. Thus, the path latency is calculated by sending the generated tag through the write data and control FIFO 604 (
The path latency results calculated by the path counter 712 is input to the select logic 714. The select logic 714 in turn controls the MUX 704. The MUX 704 controls the number of pipeline stages in the output path. Thus, the MUX 704 operates to adjust or select the appropriate pipeline stage to introduce the necessary added delay required for providing a predictable data access or read time. Based on the calculation of the path latency by the uncertainty detect logic 710, the uncertainty adjust logic 702 adjusts the path latency by adding pipelining stages that lengthen the travel time of the tag through the path and introduce additional delay in the travel path for the signal. For example, if the maximum path latency is 5 clock cycles and the generated tag travels the path in 4 clock cycles, the uncertainty adjust logic 702 adds 1 pipelining stage to the total path length in order to provide a 5 cycle latency. If the path latency is 5 cycles, no additional delay is introduced in the path.
Therefore, the uncertainty detect logic counts and compares the number of cycles or path latency and accordingly it adjusts the select logic by introducing appropriate amount of delay in the system which is performed by controlling multiplexer 704.
It should be apparent to one skilled in the art that alternative implementations of the present invention may combine the uncertainty detect logic and the uncertainty adjust logic into a single unit. Additionally, the functionalities of the path counter 712, the select logic 714, the MUX 704, etc, may be implemented by alternative devices with equivalent functionality. For example, each of the pipeline stages 708 or 706 may be implemented using a latch, a register or a memory. In one embodiment, the select logic 714 may be implemented using a comparator.
In operation 806, the path latency is calculated in terms of number of clock cycles by counting the number of clock cycles it took for the generated tag to travel through the data path.
In operation 808, based on the measured path latency, additional delay is introduced into the data path if needed by selecting additional pipeline stages to be added to the path of the data. The added delay time provides a predictable read time. If the separation between the rising clock edges of the two clocks (WCLK and RCLK, see
It would be apparent to one skilled in the art that the systems and methods disclosed in the present invention are equally applicable to multiport memory devices operating over multiple clock domains.
The teachings of the various embodiments of the present invention provide predictable timing for bidirectional I/Os from a multiport memory device. The foregoing descriptions of specific embodiments of the present invention have been presented for purposes of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise forms disclosed, and many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the Claims appended hereto and their equivalents.
Number | Name | Date | Kind |
---|---|---|---|
4899352 | Cucchi et al. | Feb 1990 | A |
4965794 | Smith | Oct 1990 | A |
4970499 | Ryherd et al. | Nov 1990 | A |
5166926 | Cisneros et al. | Nov 1992 | A |
5247626 | Firoozmand | Sep 1993 | A |
5418781 | Kaufman et al. | May 1995 | A |
5444853 | Lentz | Aug 1995 | A |
5502833 | Byrn et al. | Mar 1996 | A |
5586294 | Goodwin et al. | Dec 1996 | A |
5625625 | Oskouy et al. | Apr 1997 | A |
5638506 | Peterson et al. | Jun 1997 | A |
5649230 | Lentz | Jul 1997 | A |
5778250 | Dye | Jul 1998 | A |
5802546 | Chisholm et al. | Sep 1998 | A |
5804986 | Jones | Sep 1998 | A |
5805808 | Hasani et al. | Sep 1998 | A |
5812820 | Loram | Sep 1998 | A |
5815501 | Gaddis et al. | Sep 1998 | A |
5822383 | Muntz et al. | Oct 1998 | A |
5822776 | De Korte et al. | Oct 1998 | A |
5881269 | Dobbelstein | Mar 1999 | A |
5898687 | Harriman et al. | Apr 1999 | A |
5912898 | Khoury | Jun 1999 | A |
5914757 | Dean et al. | Jun 1999 | A |
5963499 | Leong et al. | Oct 1999 | A |
5970229 | Thomas et al. | Oct 1999 | A |
5991295 | Tout et al. | Nov 1999 | A |
6005795 | Hawkins et al. | Dec 1999 | A |
6005796 | Sywyk et al. | Dec 1999 | A |
6047339 | Su et al. | Apr 2000 | A |
6049487 | Plants et al. | Apr 2000 | A |
6104802 | Perlmutter | Aug 2000 | A |
6115379 | Flanders et al. | Sep 2000 | A |
6131123 | Hurst et al. | Oct 2000 | A |
6163810 | Bhagavath et al. | Dec 2000 | A |
6163842 | Barton | Dec 2000 | A |
6181595 | Hawkins et al. | Jan 2001 | B1 |
6181634 | Okita | Jan 2001 | B1 |
6212597 | Conlin et al. | Apr 2001 | B1 |
6246682 | Roy et al. | Jun 2001 | B1 |
6246692 | Dai et al. | Jun 2001 | B1 |
6262912 | Sywyk et al. | Jul 2001 | B1 |
6269413 | Sherlock | Jul 2001 | B1 |
6292877 | Ryan | Sep 2001 | B1 |
6295295 | Wicklund | Sep 2001 | B1 |
6304936 | Sherlock | Oct 2001 | B1 |
6339596 | Kozaki et al. | Jan 2002 | B1 |
6363077 | Wong et al. | Mar 2002 | B1 |
6425045 | LaBerge | Jul 2002 | B2 |
6430088 | Plants et al. | Aug 2002 | B1 |
6430626 | Witkowski et al. | Aug 2002 | B1 |
6504204 | Hsu et al. | Jan 2003 | B1 |
6515991 | McKeown | Feb 2003 | B1 |
6545935 | Hsu et al. | Apr 2003 | B1 |
6560668 | Ryan et al. | May 2003 | B2 |
6578118 | Raza et al. | Jun 2003 | B1 |
6581144 | Raza et al. | Jun 2003 | B1 |
6631455 | Raza et al. | Oct 2003 | B1 |
6731566 | Sywyk et al. | May 2004 | B1 |
6760327 | Manchester et al. | Jul 2004 | B1 |
6816955 | Raza et al. | Nov 2004 | B1 |
6822960 | Manchester et al. | Nov 2004 | B1 |
6834378 | Augsburg et al. | Dec 2004 | B2 |
6907479 | Karnstedt et al. | Jun 2005 | B2 |
6934289 | Lee | Aug 2005 | B2 |
7016349 | Raza et al. | Mar 2006 | B1 |
7193994 | Payson | Mar 2007 | B1 |
7242686 | Dougherty et al. | Jul 2007 | B1 |
7289946 | Lee | Oct 2007 | B1 |
20010014053 | Li | Aug 2001 | A1 |
20010044882 | Pille et al. | Nov 2001 | A1 |
20020010897 | Stark | Jan 2002 | A1 |
20020057893 | Wood et al. | May 2002 | A1 |
20020091916 | Dowling | Jul 2002 | A1 |
20020188816 | Johnson et al. | Dec 2002 | A1 |
20040039938 | Katz et al. | Feb 2004 | A1 |
20040088439 | Lee et al. | May 2004 | A1 |
Number | Date | Country |
---|---|---|
01285088 | Nov 1989 | JP |
WO2004075447 | Feb 2004 | WO |
WO2004075447 | Sep 2004 | WO |