The present invention relates generally to skew control circuit and more specifically, for controlling the skew between at least three clock signals.
A logic device may comprise several sub-circuits each having an associated clock domain. The clock domains of two sub-circuits exchanging data have to be in synchronization to avoid data loss and/or data faults. The sub-circuits may be configured to communicate in a hierarchical structure, wherein each sub-circuit communicates with one or more daughter sub-circuits and one mother sub-circuit (except for the root sub-circuit). For example, a microprocessor may comprise four processor cores, wherein two of the four processor cores may exchange data via a first second level cache and the other two of the four processor cores may exchange data via a second level cache. To allow for a data exchange between the two branches, a third level cache is provided, which communicates with the first second level cache and the second level cache. Thus, the first second level cache has two daughter sub-circuits, namely, two of the four processor cores, and one mother sub-circuit, namely, the third level cache. The seven sub-circuits (one third level cache, two second level caches, and four processor cores) each have an associated clock domain. The clock domains are (directly or indirectly) driven by a common global clock source. However, the local clock signal of one clock domain of one sub-circuit may be early with respect to another clock-domain of a sub-circuit communicating with the aforementioned sub-circuit. The difference may also be called “skew”. Delay lines may be provided between the global clock source and the local clock sources of said clock domains to ensure proper data exchange between the sub-circuits. Additional timing restrictions may have to be observed using known skew adjusting circuits and methods to allow for parallel skew adjusting and data transmissions.
According to an embodiment, a method for operating a control circuit of a skew control circuit for controlling the skew between at least three clock signals, the method comprising: determining an order of occurrence of edges of the clock signals; selecting one of the programmable delay elements based on the determined order; and adjusting the delay caused by the selected programmable delay element.
The embodiments depicted and described herein recognize the challenges of controlling a skew between at least three clock signals. The improvement lies in the ability to avoid shortening the delay for both clocks at the same time, but decrease the delay of the later clock first and then decrease the delay of the earlier clock.
Implementation of embodiments of the invention can take a variety of forms, and exemplary implementation details are discussed subsequently with reference to the Figures.
It has become more and more common to provide means for actively reducing the skew between different clock domains in modern semiconductor circuits. Programmable delay elements may be used for this purpose. For example, as shown in
Adjusting the delays of the programmable delay elements may have an additional influence on the setup times. Minimizing the skew between the clock domains B and C shown in
The setup time t_s_BC for transmitting data from clock domain B to clock domain C is reduced from t_s_BC_o=t_cycle+|skew_BC| to t_s_BC=t_cycle+|skew_BC|−Δd. Correspondingly, the setup time for transmitting data from clock domain to C to clock domain B is increased from t_s_CB_o (not shown)=t_cycle−|skew_BC| to t_s_CB=t_cycle−|skew_BC|+Δd.
The setup time t_s_BC for transmitting data from clock domain B to clock domain C is reduced from t_s_BC_old (not shown)=t_cycle+skew_BC to t_s_BC=t_cycle+skew_BC−Δd. Accordingly, the setup time for transmitting data from clock domain to C to clock domain B is increased from t_s_CB_o=t_cycle−skew_BC to t_s_CB=t_cycle−skew_BC+Δd.
Performing skew reduction by decreasing delays induced by programmable delay elements instead of increasing the delays may be preferred to avoid increasing an overall delay and to reduce jitter.
Modern integrated semiconductor circuits may comprise more than two clock domains. For example,
In a system comprising more than two clock domains, sometimes two or more clock signals have to be adjusted at the same time to minimize the skew.
As shown in
Accordingly, the minimal setup time for transmitting data from clock domain C to clock domain B may be calculated to be t_s_CB_min=t_cycle−2·Δd. This reduction by two times the delay step Δd may also be called double cycle compression. Double cycle compression does not occur when delays are incremented.
The delay of another programmable delay element may be adjusted at least one clock cycle later. The selected programmable delay element may be the programmable delay element associated with the clock signal with the highest latency, i.e. the clock signal with the latest arrival time. The clock signal with the highest latency may be the clock signal with the last rising edge or last falling edge within a clock cycle. Thus, ordering may be done in a descending order of clock latencies. It may be started with the latest clock signal down to the earliest clock signal of the clock signals to be sped up.
In the example shown in
According to the example shown in
Based on the determined skew, a known skew reduction algorithm may be used to generate increment signals Inc_A, Inc_B, Inc_C or decrement signals Dec_A, Dec_B, Dec_C to be transmitted to the counters for adjusting the delays d_A, d_B and d_C, respectively. For example, a skew reduction algorithm is described in U.S. patent application Ser. No. 15/593,057 or U.S. Ser. No. 15/593,079 both filed on May 11, 2017, the content thereof being incorporated by reference.
The following table shows the possible combination of decrement signals:
Only in two cases more than one delay has to be decremented. Thus, only in two cases double-compression may occur.
Afterwards or if the query 801 resulted in a negative answer, it is determined if the skew reduction algorithm prescribes decreasing d_B and d_C, i.e. if the decrement signals dec_B and dec_C are both 1 (step 805). If this is the case, it is determined if the rising (falling) edge of clock signal clk_B_d arrives later than the rising (falling) edge of clock signal clk_C_d (step 806). If this is the case, decreasing the delay of the programmable delay element PD_C is deferred to a later clock cycle (step 807). If not, decreasing the delay of the programmable delay element PD_B is deferred to a later clock cycle (step 808).
The signals transmitted by the different skew detectors may be used to determine the order of the rising (falling) edges of the clock signals.
In the table shown above, the columns AT_* refer to the arrival time of the rising (falling) edges at the respective clock domain and the columns t_* define the number of clock cycles by which decreasing the delay value of the programmable delay element associated with the respective clock domain should be deferred to avoid double compression.
Deferral elements DE are provided for deferring decrementing the value stored in the counter C_C by m clock cycles, wherein m has been determined based on the skew between the different clock domains.
The deferral elements may comprise a simple flip-flop for storing the decrement signal dec_c and a transmission gate operated by a deferral control circuit.
Number | Name | Date | Kind |
---|---|---|---|
5268656 | Muscavage | Dec 1993 | A |
5465347 | Chao | Nov 1995 | A |
6150865 | Fluxman | Nov 2000 | A |
6583649 | Nakamura | Jun 2003 | B2 |
7251765 | Kushiyama | Jul 2007 | B2 |
7770049 | Searles | Aug 2010 | B1 |
7971088 | Jung | Jun 2011 | B2 |
8031823 | Jenkins et al. | Oct 2011 | B2 |
8205182 | Zlatanovici | Jun 2012 | B1 |
8817936 | Dominguez | Aug 2014 | B2 |
9356589 | Ebuchi et al. | May 2016 | B2 |
10348279 | Arp | Jul 2019 | B2 |
10564664 | Arp | Feb 2020 | B2 |
20030197537 | Saint-Laurent | Oct 2003 | A1 |
20080150605 | Chueh et al. | Jun 2008 | A1 |
20130336082 | Khawshe | Dec 2013 | A1 |
20140306746 | Meneghini | Oct 2014 | A1 |
20150323958 | Arabi | Nov 2015 | A1 |
Entry |
---|
IBM, Appendix P, List of patents or patent applications treated as related, filed herewith, 2 pages. |
Kumar et al., “A New Method to Minimize the Clock Skew to Enhance Performance of Digital System”, International Journal of Engineering Science and Technology vol. 2(6), 2010, 2479-2482. |
Sasaki et al., “A Circuit for On-chip Skew Adjustment with Jitter and Setup Time Measurement”, IEEE Asian Solid-State Circuits Conference Nov. 8-10, 2010 / Beijing, China, 4 pages. |
Number | Date | Country | |
---|---|---|---|
20190020334 A1 | Jan 2019 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15649771 | Jul 2017 | US |
Child | 15807634 | US |