The invention relates to the basic elements or cells of which integrated circuits are built. In particular the invention relates to the prevention of processing errors that could otherwise be caused by delays in relation to a triggering signal, such as a clock signal.
Processing of data in integrated circuits may take place in pipelines where the outputs of one or more previous combinational logic units are to constitute the inputs of one or more subsequent combinational logic units. The intermediate results are stored in registers between said combinational logic units. A clock signal synchronizes the propagation of digital values through the pipeline; in particular the storing of values into the registers. Some of the basic building blocks for use as said registers are latches and flip-flops.
An important difference between a latch and a flip-flop is that a latch is transparent during the whole pulse of a clock signal (or enabling signal, or any other pulsed control signal): its output may acquire the appropriate value even as a function of an input value that arrives late, i.e. even if the input value changes its state after the beginning of the current clock pulse. A flip-flop will lock its output as a function of the input value(s) at a triggering edge of the clock pulse, so any changes in input values after that can only affect the output at the next triggering edge of the clock pulse. Single edge triggered flip-flops can only change their state at one (rising or falling) edge of the clock pulse, while double edge triggered flip-flops may change their state at both rising and falling clock edges.
In an ideal circuit each circuit element makes its output settle early enough before the data is read into the subsequent circuit element on the appropriate clock pulse. In practical circuits this is not always the case. If a previous circuit element in the pipeline is late in making its output settle, the subsequent circuit element may operate incorrectly or in an unpredictable way. In the worst case a time error like this may seriously derail the whole execution of an algorithm, and even in controlled cases it may require rolling back the processing and flushing erroneous data from the pipeline, causing delay and unnecessary consumption of energy.
Integrated circuits that run on extremely low supply voltages are particularly prone to time errors, because the low supply voltage boosts the effect of e.g. random structural differences between transistors and other micro-scale components in the circuit that theoretically should be identical to each other but in practice are not. One solution would be to maintain the supply voltage high enough, with a suitable safety margin, but that would work against the aim of minimizing the energy consumption of the circuit.
Latches may be more tolerant against time errors than flip-flops, because a late arriving input bit to a latch just causes a corresponding delay before the output of the latch settles at the appropriate value (as long as the delay was shorter than the clock pulse enabling the latch). If a subsequent combinational logic is quick enough, it may even catch up the delay so that at the input of the next circuit element the processing is on time again. This is called time borrowing: the operation of the latch allows a preceding circuit element to “borrow” some processing time from a subsequent circuit element. Conventional flip-flops do not allow time borrowing because they lock their output at an edge of the clocking pulse. However, flip-flops have certain other advantages that would advocate preferring them in designing integrated circuits.
A solution known as the Razor is a kind of flip-flop capable of at least detecting time errors.
An objective of the present invention is to provide a register circuit for temporarily storing a digital value in a pipeline using the flip-flop approach but simultaneously allowing time borrowing. Another objective of the invention is to provide such an approach that would be applicable to both single edge and double edge triggered flip-flops. A further objective of the invention is that a register circuit according to the invention can be built both from library cells and from custom cells. Yet another objective of the invention is that the register circuit is capable of announcing the occurrence of time borrowing to other parts of the integrated circuit.
The objectives of the invention are achieved by providing first and second pulse-enabled subregister stages that together constitute a register circuit of flip-flop type, and by deliberately delaying the triggering signal to the first pulse-enabled subregister stage with respect to the triggering signal going to the second pulse-enabled subregister stage.
A register circuit according to the invention is characterized by the features recited in the characterizing part of the appended independent claim directed to a register circuit.
An integrated circuit according to the invention is characterized in that it comprises at least one register circuit of the kind described above.
A method according to the invention is characterized by the features recited in the characterizing part of the appended independent claim directed to a method.
The invention covers also an embodiment in the form of a set of library instructions for designing a part of an integrated circuit, said set of library instructions being stored on a machine-readable tangible medium. They comprise one or more set of one or more machine-readable instructions that, when executed by a computer adapted for designing integrated circuits, are configured to effect the designing of a register circuit of the kind described above.
Further embodiments and advantages of the invention are described in the depending claims.
The verb “to comprise” is used in this text as an open limitation that does not exclude the existence of also other features in addition to those that are literally mentioned.
The accompanying drawings, which are included to provide a further understanding of the invention and constitute a part of this specification, illustrate embodiments of the invention and together with the description help to explain the principles of the invention. In the drawings:
It is obvious to a person skilled in the art that with the advancement of technology, the basic idea of the invention may be implemented in various ways. The invention and its embodiments are thus not limited to the examples described above, instead they may vary within the scope of the claims.
The embodiments of the invention are meant for use in integrated circuits in which digital data is processed in pipelines consisting of sequences of combinational logic units. In particular, embodiments of the invention are meant for use as register circuits that are used to temporarily store digital values obtained from an output of a preceding circuit element, for example from an output of a preceding combinational logic unit. Temporary storing means here that the output value of the preceding circuit element is read into the register circuit and made (and kept) available at the output of the register circuit for a duration of time that is determined by a triggering signal, which for synchronous circuits is typically called a clock signal. Without losing generality the term “clock” signal can be used also for asynchronous circuits, and in this description these terms are used essentially as synonyms of each other.
Embodiments of the invention are generally categorized as register circuits of flip-flop type. This means that when the timing of the processing goes as intended, the digital value at the data input of the register circuit settles before an allowable time limit defined by a triggering edge of the clock signal, and the data output of the register circuit is locked to that value at the triggering edge. The allowable time limit is not at the triggering edge but slightly before it; the shortest allowable interval between the allowable time limit and the triggering edge is called the setup time of the register circuit of flip-flop type. Since the setup time is (essentially) constant and characteristic to the register circuit in question, the triggering edge of the clock signal may be said to “define” the allowable time limit, because the allowable time limit is always one setup time earlier than the triggering edge. The setup time is only a small fraction of the half cycle of the clock signal.
On the data propagation path between the data input 201 and data output 202 there is a sequence of a first pulse-enabled subregister stage 203 and a second pulse-enabled subregister stage 204. Being pulse-enabled, the first and second subregister stages are transparent in the sense that whenever their enabling pulse is active, any change in the input of the respective pulse-enabled subregister stage is immediately reflected at its output. When the enabling pulse is not active, the output of the pulse-enabled subregister stage maintains the value it had when the enabling pulse was active last time, and is insensitive to any changes in its input value until the enabling pulse goes active again. Being in sequence on the data propagation path between the data input 201 and data output 202 means that data coming to the data input 201 must pass through the first pulse-enabled subregister stage 203 and the second pulse-enabled subregister stage 204 in this order before becoming available at the data output 202.
If a conventional flip-flop has first and second pulse-enabled subregister stages concatenated on a data propagation path, these are typically clocked with opposite phases of the same clock signal. In the register circuit of
The register circuit of
With these generalizations in mind it can be said that a triggering edge of the clock signal defines an allowable time limit before which a digital value must appear at the data input 201 to become temporarily stored according to the normal, expected operation of the register circuit.
Additionally the register circuit of
We may first assume that the register circuit of
The D line illustrates an arbitrary passage of an input data signal, which in this example changes its value once in every clock cycle. The changes of input data associated with moments 301, 302, 303, and 306 arrive in time, while the changes of input data associated with moments 304 and 305 are late. The allowable time limit of the register circuit is not separately shown in
Since the first latch remains enabled whenever the CLKD signal is low, its output signal q1 follows the incoming data signal at all changes in
At moment 304 the first latch is still enabled, because the CLKD signal is low. Even at the time of the late change 307 in the input data signal the first latch is still enabled, because—as explained above—despite being late the change 307 comes before the time window closes at the immediately following rising edge in the CLKD signal. The output signal q1 of the first latch goes low and stays low when the first latch is disabled by the CLKD signal going high. Meanwhile the second latch has been enabled by the high value or the CLK signal, so its output follows the falling edge of q1 and the changed value of input data becomes available as a corresponding value in the output signal Q of the whole register circuit, even if it was not on time (not simultaneous with the triggering edge in the clock signal at moment 304) but slightly late.
The fact that the value of i1 (and correspondingly q1) remained unchanged even at moment 304 does not tell as such, whether this was because a change in the input data signal D was late or whether the value of D will simply stay unchanged in this clock cycle. Thus at moment 304 sharp the timing event observation logic 205 does not know yet, whether there was a timing event or not. However, when the late change 307 in input data then takes place within the time window, the timing event observation logic 205 notices this and makes the TEO signal active. The next time when the timing event observation logic 205 must be ready to detect any eventual timing event is at the next triggering (rising) edge of the CLK signal, so it is advantageous to reset the TEO signal before that, for example at the falling edge of the CLK signal in the middle between moments 304 and 305 like in
Indeed there comes another timing event associated with moment 305 in
In the embodiment of
In the embodiment of
A hybrid embodiment could be presented where the timing event observation logic 205 would have parallel inputs for receiving values on the data propagation path both before and after the first pulse-enabled subregister stage 203. Such an embodiment could involve the advantage that the timing event observation logic 205 could arrive at the correct result even in cases where the timing event took place so close to the end of the time window that it would be uncertain, whether the first pulse-enabled subregister stage 203 had been disabled already, i.e. whether the change in the input data value made it through the first pulse-enabled subregister stage 203. Since such a timing event represents already a relatively long delay in receiving the change in the input data signal, possibly necessitating significant corrective action in the whole integrated circuit, the timing event observation logic 205 may be configured to output a special kind of an indicator signal in such situations. Another possible reason for equipping the timing event observation logic 205 with parallel inputs for both i1 (=D) and q1 might be that this way the timing event observation logic 205 can be designed in a particularly advantageous way.
The embodiment shown in
Compared to those embodiments in which the delayed clock signal is derived internally within the register circuit, the embodiment of
As is shown with dashed lines and the and/or conjunction in
The pulsed clock signal CP is taken as such to clock the second pulse-enabled subregister stage 204. A delayed form CPD (clock pulse, delayed) of the pulsed clock signal is produced in a delay line 801, which in
The monitor circuit 803 in
In the embodiments described so far it has been assumed that the register circuit is of the single edge triggered flip-flop type. However, the same principle can be applied to register circuits that are of the double edge triggered flip-flop type.
The second pulse-enabled subregister stage of the register circuit in
Concerning late-arriving changes in the input data D, the standard double edge triggered flip-flop of
In the register circuit of
The data input 201 of the register circuit is coupled to the data inputs of the upper and lower latches 1101 and 1102 through respective multiplexers 1104 and 1105 that are clocked by the delayed clock signal CLKD. The arrangement and selecting of the inputs of the multiplexers 1104 and 1105 is such that on a high value of the delayed clock signal CLKD the data input 201 is coupled through multiplexer 1104 to the data input of the upper latch 1101, the enabling input of which is coupled to receive the delayed clock signal CLKD as such. On the high value of the delayed clock signal CLKD the data input of the lower latch 1102 is coupled to receive a feedback signal 11q from the data output of the lower latch 1102 itself. On a low value of the delayed clock signal CLKD the data input 201 is coupled through multiplexer 1105 to the data input of the lower latch 1102, while the data input of the upper latch is coupled to receive a feedback signal 10q from the data output of the upper latch 1101 itself. The enabling input of the lower latch 1102 is coupled to receive the inverse of the delayed clock signal CLKD.
As an example, if a change in input data should arrive to the register circuit of
The upper part of the register circuit in
A transition detector 1110 is coupled to the TE data input 1106 and to the output of the XOR gate 1109. It is configured to produce the timing event observation signal TEP in response to the value in the TE data input 1106 changing while the timing window indicator TEP-W is active.
The timing event observation logic of
Clock gating is a technique used in integrated circuits to save energy by avoiding unnecessary changes of state. If digital data is processed in a pipeline where the intermediate results given by combinational logic units are stored in registers, it may happen that the intermediate result stored in a particular (set of) register(s) does not change at all at a certain triggering edge, or even at a number of consecutive triggering edges. In such a case it is advisable to temporarily disable the clocking of the subsequent stages in the same pipeline, because their outputs would remain the same anyway.
The clock gating of
However, let us then assume that the register circuits 1401 and 1403 are of the double edge triggered flip-flop type that was generally described earlier with reference to
It is an objective of the invention to present a method and circuit for performing clock gating for register circuits of the double edge triggered flip-flop type so that a proper number of triggering edges can be ensured in all occasions. It is a further objective of the invention to present such a method and circuit that are well suited for register circuits that are not only of the double edge triggered flip-flop type but also capable of time borrowing.
Said objectives of the invention are achieved by making the clock gating circuit remember its state at the last moment of disabling and to continue from there at each edge of the ungated clock signal, allowing the phase of the gated clock signal to invert if necessary.
It is characteristic to a clock gating circuit of the kind referred to here that the clock gating circuit is configured to freeze the toggling (gated) clock signal at its current value in response to an enabling signal assuming the value that disables delivering said toggling clock signal, and to respond to said enabling signal thereafter assuming the value that enables delivering said toggling clock signal by beginning the continued toggling of said toggling clock signal at the next edge of the input clock signal.
It is characteristic to a clock gating method of the kind referred to here that it comprises freezing the toggling (gated) clock signal at its current value in response to an enabling signal assuming the value that disables delivering said toggling clock signal, and responding to said enabling signal thereafter assuming the value that enables delivering said toggling clock signal by beginning the continued toggling of said toggling clock signal at the next edge of the input clock signal.
The inventive method and circuit for clock gating are based on the insight that for clocking double edge triggered flip-flops it is actually not necessary to require the clock signal to have the same phase. Also oppositely phased clock signals will work, at least as long as it can be ensured that the structure of the double edge triggered flip-flops can prevent having two transparent register circuits in line enabled simultaneously.
Clock gating of the kind described above will be discussed in more detail in the following with reference to
The incoming CLK signal goes through two parallel branches, one of which comprises an inverter 1701, to the respective inputs of a first multiplexer 1702. The output of the first multiplexer 1702 goes to the data input of a first latch 1703, the non-inverted output of which provides the gated output clock signal GCLK. The last-mentioned is also taken to a first input of a second multiplexer 1704, the other input of which is coupled to receive the inverted output of the first latch 1703. The selection signal of the second multiplexer 1704 is the CLK signal.
The output of the second multiplexer 1704 goes to the data input of a second latch 1705, the (non-inverted) output of which constitutes the selection signal of the first multiplexer 1702. The enabling signal EN could be taken as such to the first latch 1703 and inverted to the second latch 1705; in
In the example case of
After moment 1801 the EN signal goes low; generally it can be said that the enabling signal assumes the value that disables delivering the gated clock signal to those clocked digital circuits that would otherwise receive it. This disables the first latch 1703, so its output signal Q1 is frozen at its current value. Simultaneously the second latch 1705 is enabled, so it begins to read the values of its input signal D2. These, in turn, come from the second multiplexer 1704 which, having its input values frozen, repeatedly alternates between passing them through in the rhythm of the input clock signal CLK. As long as the second latch 1705 remains enabled, the alternating values in its data input go through to its data output. The resulting toggling Q2 signal clocks the first multiplexer 1702, which—due to the inverter 1701 in one of its input branches—ensures that the input signal D1 to the first latch 1703 maintains the value it had when the EN signal went low, even if the CLK signal as such continues to toggle all the time.
After moment 1803 the EN signal goes high again. This happens while the signal D2 (and consequently also Q2) was low, so the value of the signal D1 that the now re-enabled first latch 1703 reads at its input comes from the non-inverted input branch of the first multiplexer 1702. No immediate change takes place in the gated output clock signal GCLK (which is line Q1 in
However, an important difference to conventional clock gating circuits becomes apparent at the end of the gated clock disabling interval that begins after moment 1804 in
Comparing line Q1 in
An inevitable consequence of said typical behavior is that the phase of the gated output clock signal may become inverted from what it was, and indeed does so whenever the EN signal goes high during a different phase of the CLK signal than what the CLK signal had when the EN signal went low. This is easy to see in
Inverting the phase of the gated clock signal does not matter, however, if all flip-flops in the clocked logic circuits, to which the CLK and GCLK signals are delivered, are of the double edge triggered type. For a double edge triggered flip-flop it is only important that there is an edge in the clocking signal, whether it is a rising or a falling edge does not matter.
As a general characterization, a clock gating circuit of the kind described above comprises a phase memory that is configured to memorize the phase of the input clock signal in response to the EN signal going low (i.e. assuming the value that disables delivering the gated clock signal further). The clock gating signal is configured to use the memorized phase as the phase of the gated clock signal in response to the EN signal thereafter going high again. In
The same principle can be used even if the integrated circuit comprises circuit elements that need a clock pulse window signal for defining the time window during which late-arriving changes in input data should be detected.
The clock gating circuit of
Any integrated circuit could comprise, and benefit from, a clock gating circuit of the kind described above. In particular, advantages are gained in an integrated circuit that comprises one or more double edge triggered flip-flops coupled to receive the output gated clock signal.
Advantageous features of a clock gating signal of the kind described above are recited in concise form in the following numbered clauses.
1. A clock gating circuit for selectively enabling and disabling the delivery of a toggling clock signal to one or more clocked digital circuits, comprising a clock signal input for receiving an input clock signal, a gated clock signal output for delivering said toggling clock signal to said one or more clocked digital circuits in synchronism with edges of said input clock signal, and an enabling input for receiving an enabling signal, two values of which are commands to enable and disable delivering said toggling clock signal respectively, wherein the clock gating circuit is configured to freeze said toggling clock signal at its current value in response to said enabling signal assuming the value that disables delivering said toggling clock signal, and to respond to said enabling signal thereafter assuming the value that enables delivering said toggling clock signal by beginning the continued toggling of said toggling clock signal at the next edge of said input clock signal.
2. A clock gating circuit according to numbered clause 1, comprising a phase memory configured to memorize the phase of said input clock signal in response to said enabling signal assuming the value that disables delivering said toggling clock signal, so that the clock gating circuit is configured to use the memorized phase as the phase of the toggling clock signal in response to said enabling signal thereafter assuming the value that enables delivering said toggling clock signal.
3. A clock gating circuit according to numbered clause 2, wherein said phase memory comprises a first multiplexer with two inputs and an output, so that said input clock signal is coupled to one of said inputs as such and to the other of said inputs inverted.
4. A clock gating circuit according to numbered clause 3, wherein said clock gating circuit is configured to clock said first multiplexer in rhythm with said input clock signal during those periods when said enabling signal has the value that disables delivering said toggling clock signal and to stop the clocking of said first multiplexer during those periods when said enabling signal has the value that enables delivering said toggling clock signal.
5. A clock gating circuit according to any of the numbered clauses 2 to 4, comprising first and second latches, of which the first latch is configured to be enabled by said enabling signal and the second latch is configured to be enabled by an inverse of said enabling signal, and also comprising a second multiplexer configured to be clocked by said input clock signal and to selectively couple, as determined by said clocking input clock signal, either a non-inverted or an inverted output of said first latch to an input of said second latch, wherein an output of said second latch is configured to operate said phase memory.
6. A clock gating circuit according to any of the numbered clauses 1 to 5, comprising a clock pulse window input for receiving an input clock pulse window signal, and a gated clock pulse window signal output for delivering a toggling clock pulse window signal to at least some of said one or more clocked digital circuits in synchronism with edges of said input clock window pulse signal, wherein the clock gating circuit is configured to freeze said toggling clock pulse window signal at its current value in response to said enabling signal assuming the value that disables delivering said toggling clock signal, and to respond to said enabling signal thereafter assuming the value that enables delivering said toggling clock signal by beginning the continued toggling of said toggling clock pulse window signal at the next edge of said input clock pulse window signal.
7. An integrated circuit comprising a clock gating circuit according to any of the numbered clauses 1 to 6.
8. An integrated circuit according to numbered clause 7, comprising one or more double edge triggered flip-flops coupled to receive said toggling clock signal.
9. A set of library instructions for designing a part of an integrated circuit, said set of library instructions being stored on a machine-readable tangible medium and comprising one or more set of one or more machine-readable instructions that, when executed by a computer adapted for designing integrated circuits, are configured to effect the designing of a clock gating circuit according to any of numbered clauses 1 to 6.
A method embodiment of the invention is meant for temporarily storing a digital value obtained from an output of a preceding circuit element and for making such a temporarily stored digital value available for a subsequent circuit element at a controlled moment of time. The method comprises receiving said digital value for temporary storage, and receiving a triggering signal, a triggering edge of which defines an allowable time limit before which a digital value must appear at said data input to become available for said subsequent circuit element. The method comprises also using a sequence of first and second pulse-enabled subregister stages to temporarily store said digital value, and providing said triggering signal to said first pulse-enabled subregister stage delayed with respect to the triggering signal received by said second pulse-enabled subregister stage. The length of the delay is a fraction of a cycle of said triggering signal. Further the method comprises outputting a timing event observation signal as an indicator of said digital value at said data input having changed within a time window that begins at said allowable time limit and is shorter than one cycle of said triggering signal.
Changes and modifications are possible to the embodiments described so far without parting from the scope of the appended claims. For example,
Another class of variations is related to the number of stages inside the register circuit. For example, even if the register circuit has been consistently shown to comprise exactly two stages (the first and second pulse-enabled subregister stages), saying that the register circuit comprises a sequence of these stages does not exclude it from having additionally third, fourth, etc. subregister stages along the data path between the data input and the data output. Also, while the register circuits have been described as storing one-bit digital values, a register circuit according to the invention may temporarily store multi-bit digital values. This can be achieved for example by having parallel data paths between the data input and the data output, each data paths being configured to temporarily store one bit of the multibit value.
Another class of variations is related to the use of a margin adder circuit for making the time window longer at its beginning, at its end, or at both its beginning and its end. The margin adder circuit may be coupled to the timing event observation logic and configured to displace in time at least one edge of the time window with reference to a corresponding edge of the triggering signal.
An example of such a variation is shown schematically in
Some additional circuitry could be used in the margin adder circuit to displace (to advance) also the left-hand edge of the time window, so that it would occur slightly before the rising edge in the CLK signal. An example of displacing both edges of the time window, for making the time window longer than without said displacing, is shown on the right in
The use of a margin adder circuit involves the advantage of compensating for arbitrary jitter in the timing of operation in individual circuit elements. If the timing window is defined strictly by the edges of a CLK (and CLKD) signal, it may happen that a timing event that actually occurred within the time window and should have been dealt with accordingly does not cause a timing event observation signal because the timing event observation logic actually operated according to a time window that was shorter than meant. The extra margin added by a margin adder circuit helps to ensure that a timing event observation signal is always generated when needed.
Handling multibit values may involve also time interleaving; in a way, a double edge triggered flip-flop may be considered as a register circuit for temporarily storing two-bit digital values, so that the first bit is temporarily stored and made available on the rising edge of the clock signal and the second bit is temporarily stored and made available on the falling edge of the clock signal.
Two or more register circuits for temporarily storing single-bit digital values can share some logic parts. For example, two or more register circuits of this kind may have common parts related to the detection of late-arriving changes, like common transition detectors.
In an integrated circuit that comprises a number of register circuits of the kind described above the TEO signals should be collected from all said register circuits and handled in an intelligent way. It may be sufficient just to know, for a plurality of register circuits, that a timing event was observed in one of them, without knowing exactly which one. A higher-level logic that is responsible for correct operation of one or more pipelines may take certain actions, which are known as such. For example, a certain delay may be introduced in the common clock signal so that circuit elements downstream from the one in which the timing event occurred have sufficient time to recover and ensure that they received the valid data.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/FI2017/050853 | 12/1/2017 | WO | 00 |