This disclosure relates to an integrated circuit (IC) chip with multiple cores. More particularly, this disclosure relates to an IC chip with a given core and another core that are asymmetrically oriented with respect to each other.
Lockstep systems are fault-detection CPU systems that redundantly execute the same set of operations at the same time in parallel. The redundancy allows error detection. In particular, the output from lockstep operations can be compared to determine if there has been a fault if there are at least two systems (dual modular redundancy).
To run in lockstep, each redundant system is set up to progress from one well-defined state to the next well-defined state. When a new set of inputs reaches the system, each redundant system processes the new set of inputs, generates new outputs and updates a respective state. This set of changes (new inputs, new outputs and new state) is considered to define that step, and is treated as an atomic transaction. In other words, either all of the set of changes happen, or none of set of changes happen, and not something in between.
One example relates to an integrated circuit (IC) chip. The IC chip can include a given core at a position in the IC chip that defines a given orientation, wherein the given core is designed to perform a particular function. The IC chip can include another core designed to perform the particular function, the other core being flipped and rotated by 180 degrees relative to the given core such that the other core is asymmetrically oriented with respect to the given core. The IC chip can further include a compare unit configured to compare outputs of the given core and the other core to detect a fault in the IC chip.
Another example relates to another IC chip. The IC chip can include a non-transitory machine readable memory configured to store data. The IC chip can also include a pair of matched cores configured to operate in lockstep. The pair of matched cores can include a given core and another core. The other core can be flipped and rotated by 180 degrees relative to the given core such that the other core is asymmetrically oriented with respect to the given core. The IC chip can further include a compare unit configured to compare outputs of the given core and the other core to detect a fault in the IC chip.
Yet another example relates to a method that can include positioning a given core of a pair of matched cores of an IC chip. The method can also include flipping another core of the pair of matched cores relative to the given core. The method can further include rotating the other core of the pair of matched cores relative to the given core by an angle of 180 degrees. The method can yet further include positioning the flipped and rotated other core on the IC chip such that the other core is asymmetrically oriented with respect to the given core.
An integrated circuit (IC) chip can include a given core that defines a given orientation, wherein the given core is designed to perform a particular function. The IC chip can include another core designed to perform the particular function. The given and the other core can be formed with the same microchip architecture, such that the other core is a replica of the given core. The other core can be flipped and rotated by 180 degrees relative to the given core such that the given core and the other core are asymmetrically aligned with respect to each other. In this manner, no point in space is equidistance to replicated regions of the given core and the other core. The IC chip can also include a compare unit configured to compare outputs of the given core and the other core to detect a fault in the IC chip. By asymmetrically orienting the given and other core with respect to each other, the probability of a Common Cause Fault (CCF) can be reduced.
The master core 52 and the checker core 54 can be implemented as a pair of matched cores (or more generally, matched macroblocks). As used herein, the term “matched cores” (or matched macroblocks) indicates a pair of cores that have the same logical design and perform the same function. That is, each core in the pair of matched cores is a replica of the other core. Thus, the master core 52 and the checker core 54 can be implemented as the same logical design and with the same circuit components. The master core 52 and the checker core 54 can operate in lockstep. That is, the master core 52 and the checker core 54 can be implemented as a redundant pair of cores that execute the same instructions at substantially the same time. The IC chip 50 can be implemented, for example, on a safety device, wherein false positives and/or false negatives need to be avoided as much as possible in the manner described herein.
For example, the master core 52 and the checker core 54 can receive an input and generate an output in response to the input. In some examples, the input can be provided from a memory 56. The memory 56 could be a non-transitory machine readable medium, such as volatile or non-volatile random access memory (RAM). In other examples, the input could be provided from an external system (e.g., a sensor).
The IC chip 50 can also include a compare unit 58. The compare unit 58 can be implemented as a macroblock of the IC chip 50. The compare unit 58 can compare the output of the master core 52 and the checker core 54 to determine if both the master core 52 and the checker core 54 are functioning properly.
As noted, the IC chip 50 can be implemented in a safety system. For instance, the IC chip 50 can be employed to comply with safety standards set forth in the International Standards Organization (ISO)-26262 specifications. In a given example (hereinafter, “the given example”), the IC chip 50 can be implemented on/as a controller for an airbag deployment system. In the given example, the memory 56 can receive an input signal from a crash sensor and/or an accelerometer. The master core 52 and the checker core 54 can each execute an algorithm that can monitor the input to determine whether an output signal that causes deployment of an airbag should be activated.
In the given example, in a this situation, it is presumed that the input to the IC chip 50 would be indicative of “no crash detected” (e.g., normal operations). In such a situation, both the master core 52 and the checker core 54 would generate output data that can be provided to the memory 56 and/or the compare unit 58. The compare unit 58 can compare the output data. As noted, in this situation (if functioning properly), both the master core 52 and the checker core 54 will output data indicating that the airbag is not to be deployed.
In the same situation, if either the master core 52 or the checker core 54 generates output data indicating that the airbag is to be deployed, and the other of the master core 52 and the checker core 54 generates conflicting output data indicating that the airbag is not to be deployed, the compare unit 58 can detect the conflict and generate a fault for the IC chip 50. The fault for the IC chip 50 can indicate that one of the master core 52 and the checker core is malfunctioning.
However, there are situations where both the master core 52 and the checker core 54 are malfunctioning concurrently due to the same disturbance, which can be referred to as a common cause failure (CCF). A CCF can occur due to thermal and/or a mechanical stress in the IC chip 50. As used herein, terms “thermal stress” and “mechanical stress” can include nearly any form of stress that adversely impacts silicon (or other material) in the IC chip 50. Moreover, in many situations, such thermal and/or mechanical stress can propagate and/or radiate from a particular point or area on the IC chip 50. The originating source of the thermal and/or mechanical stress could be external to the IC chip 50. In other examples, the source of the thermal and/or mechanical stress can be internal with respect to the IC chip 50.
Each of the master core 52 and the checker core 54 can have a particular (physical) orientation. As used herein, the orientation of the cores of the IC chip 50 define the angular position in three dimensional space of a corresponding core. The orientation of the master core 52 can be represented by a symbol 60 illustrated in
As noted, the checker core 54 includes the same components as the master core 52. The master core 52 can be positioned somewhere on the IC chip 50 to define the orientation 60 of the master core 52. Relative to the orientation 60 of the master core 52, the checker core 54 can be flipped and rotated 180 degrees (or within 1 degree of 180 degrees). The flipping and rotating of the checker core 54 relative to the master core 52 can ensure the orientation 62 of the checker core 54 has an asymmetric orientation relative to the orientation 60 of the master core 52. As is illustrated in
Additionally, in some examples, the master core 52 and the checker core 54 can be aligned. In such a situation, an edge 68 of the master core 52 and an edge 70 of the checker core 54 that are perpendicular to the edge 64 of the master core 52 and the edge 66 of the checker core 54 extend along a common plane. Additionally or alternatively, an edge 72 of the master core 52 that opposes the edge 68 and an edge 74 of the checker core 54 that opposes the edge 70 of the checker core can extend along another common plane.
As used herein, the term “asymmetric orientation” of a given core denotes an orientation relative to a replica core on a common IC chip, wherein electrical components of the given core and replica electrical components of the replica core are not equidistant from a point in space, including any point on the IC chip. For example, assuming the IC chip 50 is drawn to scale, there is no point on the IC chip 50 that would be equidistant to replicated electrical components on both the master core 52 and the checker core 54.
By flipping and rotating the checker core 54 180 degrees with respect to the master core 52, asymmetric orientation between the master core 52 and the checker core 54 can be achieved. Moreover, this specific arrangement of flipping and rotating the checker core 54 by 180 degrees with respect to the master core 52 avoids reliability issues related to poly orientation. Such reliability issues can arise in situations where the IC chip 50 is formed with microfabrication techniques on a scale of 45 nanometers (nm) or smaller.
Thermal and/or mechanical stress may be propagated and/or radiated from the source of stress at a relatively low speed as compared to the operational speed of the master core 52 and the checker core 54. By ensuring that the master core 52 and the checker core 54 are asymmetrically oriented relative to each other, a thermal and/or mechanical stress emanating from a particular point or area of on the IC chip 50 would propagated to the master core 52 and the checker core 54 at different times, thereby reducing the probability of a CCF. Stated differently, the stress from a stress source (e.g., a point or area of thermal and/or mechanical stress) that radiates and/or propagates a thermal or mechanical stress would reach a common region (a region formed of the same electrical components) of either the master core 52 or the checker core 54 at different times. Thus, the probability of both the master core 52 and the checker core 54 malfunctioning at substantially the same time is reduced. Instead, it is more likely that the thermal and/or mechanical stress would cause a malfunction in one of the master core 52 and the checker core 54 prior to causing the same malfunction in the other of the master core 52 and the checker core 54. Moreover, once such a malfunction occurred, the compare unit 58 can detect the malfunction and report a fault in the manner described herein prior to the occurrence of a CCF.
The orientation of the first core 102 can be represented by a symbol 106. The second core 104 can be flipped and rotated by 180 degrees relative to the first core 102 to ensure that the second core 104 and the first core 102 have an asymmetrical orientation relative to each other. Thus, no point in space is equidistant from the same region of replicated electrical components of the first core 102 and the second core 104. To illustrate this concept, the first core 102 and the second core 104 are separated by a distance along an edge of the first core 102 and an edge of the second core 104, D10. Moreover, similarly to the master core 52 and the checker core 54 of
A first stress source 110 can apply thermal and/or mechanical stress to the first core 102 and the second core 104. The first stress source 110 can be a point or area from which thermal and/or mechanical stress radiates and/or propagates. The first stress source 110 can have a particular distance, D12 from a specific region 112 of the first core 102. The specific region 112 of the first core can be an area, including, but not limited to a side of the first core 102 with a specific set of electrical components. The second core 104 can also include a replica of the specific region 112 of the first core 102, namely a specific region 113 of the second core 104. However, since the first core 102 and the second core 104 are asymmetrically oriented with respect to each other, the specific region 112 in the first core 102 has a different location that than the specific region 113 in the second core 104. The specific region 112 of the first core 102 and the specific region 113 can be referred to as replica regions of the cores, since the specific region 112 of the first core 102 includes the same circuit components of the specific region 113 of the second core 104. The first stress source 110 is a distance, D13 from the specific region 112 of the second core 104. As is illustrated in
As another example of asymmetric orientation, a second stress source 114 that is a point along a neutral symmetric axis 115. The neutral symmetric axis 115 can be equidistant from the first core 102 and the second core 104. The second stress source 114 can radiate and/or propagate a thermal and/or a mechanical stress to the first core 102 and the second core 104. A corner 116 of the first core 102 is separated from the second stress source 114 by a distance of D14. Moreover, a corner 118 the second core 104 is separated from the second stress source 114 by a distance of D15. In this situation, it is presumed that D14 and D15 are equal (or nearly equal). However, due to the asymmetric orientation between the first core 102 and the second core 104, the electrical components at the corner 116 of the first core 102 and the electrical components at the corner 118 of the second core are different That is, the corner 118 of the second core 104 is not a replicated region of the corner 116 of the first core 102. Therefore, the stress emanated from the second stress source 114 does not affect the first core 102 in the same manner as the second core 104.
As explained, due to the asymmetric orientation of the second core 104 relative to the first core 102, the first stress source 110 and the second stress source 114 radiate and/or propagate thermal and/or mechanical stress to the first core 102 and the second core 104 at different times and/or in different areas. In this manner, the probability of a CCF occurring at the same time (or near the same time) is reduced. Instead, it is more likely that the stress applied by the first stress source 110 would cause a malfunction in the first core 102 prior to causing a malfunction in the second core 104. Additionally, since the second stress source 114 applies stress to different functional regions of the first core 102 and the second core 104 (at the corners 116 and 118, respectively), the type of malfunction that would occur at the first core 102 would likely be different than the type of malfunction that would occur at the second core 104. In either of these situations, the malfunction could be detected (e.g., by a compare unit) and a fault can be reported prior to a CCF occurring.
By implementing the asymmetric orientation in the manner described, the first core 102 and the second core 104 age at different rates. These different rates of aging can further reduce the probability of a CCF occurring, particularly a CCF caused by aging of the IC chip 100.
Due to the flipping and rotation by 180 degrees by the checker core 54 relative to the master core 52, the distance between specific regions, including edges of the checker core 54 and other macroblocks of the IC chip 150 may be different than the distance between the corresponding replica regions of the master core 52 and the same macroblocks. For instance, a first edge 152 of the master core 52 can correspond to a first edge 154 of the checker core 54. Similarly, a second edge 156 of the master core 52 can correspond to a second edge 158 of the checker core 54. In such a situation, connectors (e.g., conductive traces) between the compare unit 58 and the master core 52 can have a different length than the connectors between the compare unit 58 and the checker core 54. Similarly, connectors (e.g., conductive traces) between the memory 56 and the master core 52 can have a different length than the connectors between the memory 56 and the checker core 54.
In some situations, such as situations where the IC chip 150 is formed with microfabrication techniques on a scale of 45 nanometers (nm) or smaller, timing issues due to the varied distances may arise. That is, a signal output from the master core 52 may arrive at another macroblock prior to the arrival of a corresponding signal from the checker core 54. In some situations, the differing arrival time may cause timing issues. Thus, to rectify the situation, a pipeline can be added. The pipeline can be an instruction pipeline (e.g., a macroblock) configured to add a specific amount of delay to a signal.
More particularly, in the IC chip 150, an input pipeline 160 can be inserted between the memory 56 and the checker core 54. The input pipeline 160 can apply a predetermined amount of delay to signals transmitted as input from the memory 56 to the checker core 54. The predetermined amount of delay can be selected to substantially match (e.g., within 10 nanoseconds) the added transit time of signals between the memory 56 and the master core 52. Accordingly, the predetermined delay added by the input pipeline 160 can ensure that input signals transmitted from the memory 56 arrive at the master core 52 and the checker core 54 at substantially the same time (e.g., within 10 nanoseconds).
Additionally or alternatively, an output pipeline 162 can be inserted between the master core 52 and the checker core 54. The output pipeline 162 can apply an predetermined amount of delay to an output signal transmitted from the master core 52 to the compare unit 58. The predetermined amount of delay can be selected to substantially match (e.g., within 10 nanoseconds) the added transit time of signals transmitted from the checker core 54 to the compare unit 58. In this manner, the predetermined delay can ensure that output signals transmitted from the master core 52 and the checker core 54 arrive at the compare unit 58 at substantially the same time (e.g., within 10 nanoseconds).
It is noted that the input pipeline 160 and the output pipeline 162 are only two examples of possible pipelines between macroblocks of the IC chip 150. For instance, in some examples, the same core can be coupled to multiple pipelines, and a corresponding replica core (e.g., that is flipped and rotated 180 degrees) may not be coupled to any pipelines. Accordingly, the number and placement of the pipelines can vary based on the architecture of the IC chip 150.
By implementing the IC chip 150 in the manner illustrated and described, the probability of a CCF occurring can be reduced. Additionally, as noted, including pipelines between macroblocks (including the input pipeline 160 and/or the output pipeline 162) can be implemented to account for timing issues that may otherwise arise from flipping and rotating a replica core (e.g., the checker core 54) by 180 degrees.
Furthermore, the design of the IC chip 150 avoids the need for more complicated fabrication techniques. For example, the IC chip 150 can avoid the need for asymmetry for timing critical paths in the master core 52 and/or the checker core 54. Additionally, the design of the IC chip 150 avoids the need to harden the master core 52 and the checker core 54 separately by compensating for asymmetry at a signal level inside the master core 52 and the checker core 54.
In view of the foregoing structural and functional features described above, an example method will be better appreciated with reference to
At 210, a master core (e.g., the master core 52 of
What have been described above are examples. It is, of course, not possible to describe every conceivable combination of components or methodologies, but one of ordinary skill in the art will recognize that many further combinations and permutations are possible. Accordingly, the disclosure is intended to embrace all such alterations, modifications, and variations that fall within the scope of this application, including the appended claims. As used herein, the term “includes” means includes but not limited to, the term “including” means including but not limited to. The term “based on” means based at least in part on. Additionally, where the disclosure or claims recite “a,” “an,” “a first,” or “another” element, or the equivalent thereof, it should be interpreted to include one or more than one such element, neither requiring nor excluding two or more such elements.
This application is a Continuation of application Ser. No. 14/854,900 filed Sep. 15, 2015, now U.S. patent Ser. No. 10/002,056.
Number | Name | Date | Kind |
---|---|---|---|
4517673 | Brown et al. | May 1985 | A |
5005174 | Bruckert et al. | Apr 1991 | A |
5048022 | Bissett et al. | Sep 1991 | A |
5054026 | Tsubota | Oct 1991 | A |
5295258 | Jewett et al. | Mar 1994 | A |
5434997 | Landry et al. | Jul 1995 | A |
5630045 | Krygowski et al. | May 1997 | A |
5675807 | Iswandhi et al. | Oct 1997 | A |
5692152 | Cohen et al. | Nov 1997 | A |
5701507 | Bonneau, Jr. et al. | Dec 1997 | A |
5732209 | Vigil et al. | Mar 1998 | A |
5838899 | Leavitt et al. | Nov 1998 | A |
6061599 | Rhodehamel et al. | May 2000 | A |
6526559 | Schiefele et al. | Feb 2003 | B2 |
6694449 | Ghameshlu et al. | Feb 2004 | B2 |
6883065 | Pittelkow et al. | Apr 2005 | B1 |
7055060 | Nguyen et al. | May 2006 | B2 |
7290169 | Safford et al. | Oct 2007 | B2 |
7624302 | Michaelis et al. | Nov 2009 | B2 |
7653764 | Mizutani | Jan 2010 | B2 |
7873830 | Fayad et al. | Jan 2011 | B2 |
7937682 | Arunachalam et al. | May 2011 | B2 |
8058916 | Steiner | Nov 2011 | B2 |
8120938 | Moore et al. | Feb 2012 | B2 |
8265100 | Steiner et al. | Sep 2012 | B2 |
8392856 | Misaka et al. | Mar 2013 | B2 |
8489919 | Clark et al. | Jul 2013 | B2 |
8924772 | Abe | Dec 2014 | B2 |
9329927 | Ito | May 2016 | B2 |
9367375 | Cornes et al. | Jun 2016 | B2 |
9588937 | Alvarez-Icaza Rivera | Mar 2017 | B2 |
20020065986 | Jeffrey et al. | May 2002 | A1 |
20020157071 | Schiefele et al. | Oct 2002 | A1 |
20070277023 | Weiberle et al. | Nov 2007 | A1 |
20100060313 | Momose | Mar 2010 | A1 |
20120066551 | Palus et al. | Mar 2012 | A1 |
20120173924 | Xiao | Jul 2012 | A1 |
20130238945 | Vilela | Sep 2013 | A1 |
20140115401 | Ito | Apr 2014 | A1 |
20140258684 | Hastie | Sep 2014 | A1 |
20140325183 | Rozen et al. | Oct 2014 | A1 |
20150019779 | Hirade et al. | Jan 2015 | A1 |
20150235998 | Moran et al. | Aug 2015 | A1 |
20150278400 | Mattson et al. | Oct 2015 | A1 |
20160034368 | Nishii | Feb 2016 | A1 |
20160092320 | Baca | Mar 2016 | A1 |
20160188519 | Wagh et al. | Jun 2016 | A1 |
20160283314 | Thanner et al. | Sep 2016 | A1 |
Number | Date | Country |
---|---|---|
2340627 | Feb 2000 | GB |
200257083 | Feb 2002 | JP |
Entry |
---|
Tummeltshammer, P., et al., “On the Risk of Fault Coupling over the Chip Substrate,” 2009 12th Euromicro Conference on Digital System Design / Architectures, Methods and Tools, IEEE, Conference Location: Patras, Greece, Date of Conference: Aug. 27-29, 2009. (Year: 2009). |
Number | Date | Country | |
---|---|---|---|
20180285218 A1 | Oct 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14854900 | Sep 2015 | US |
Child | 15991127 | US |