The present invention is directed to integrated circuit testing technology. Various aspects of the invention may be particularly useful for low power scan-based testing.
As devices grow in gate count, data volume and application time for scan-based test grow significantly. Test compression techniques have been developed to reduce test data volume and test application time. A majority of the existing test compression techniques take advantage of low test cube fill rates to compress test patterns. In a paper authored by G. Mrugalski, et al., “Compression based on deterministic vector clustering of incompatible test cubes”, in Proc. IEEE. International Test Conference (ITC), November 2009 (referred to as Mrugalski et al. hereafter), which is incorporated herein by reference, a test data compression scheme is proposed to merge test cubes with a large number of identical specified bits despite the existence of a few conflicting specified bits. Each resulting test vector cluster comprises a parent pattern, a control pattern and a number of incremental patterns. In order to load scan chains with patterns that feature the original test cubes, a test controller combines the parent pattern with each of the incremental patterns separately based on the control pattern. While the compression ratio is increased due to the merging of incompatible test cubes, this scheme may limit the number of cores that can be tested in parallel due to high power consumption. The high power consumption is attributed to high switching activity in scan chains and other parts of the circuit-under-test. The excessive switching activity may cause overheating or supply voltage noise, either of which can in turn lead to device malfunction, increased chip reliability degradation, shortened circuit lifetime, or even a permanent circuit damage.
Various techniques have been proposed to reduce power consumption during a scan test. One group of the techniques is called X-filling. The X-filling techniques can be employed in a post-processing step in an ATPG (automated test pattern generation) flow and require no change to the ATPG procedures. Certain non-random values are assigned to unspecified positions of test cubes such that the number of state transitions of circuit nodes during shift or capture cycles is minimized. Similar concepts have been proposed for designs with test compression. However, these approaches only attempt to reduce the power consumption during the test pattern loading process. Power consumption during the test response unloading process and in the clock distribution network are not considered.
Another group of the techniques for reducing power consumption is based on disabling a number of the scan chains during certain time periods. These techniques divide test patterns in a given test set into different blocks and activate only a subset of scan chains at a given time. This may be achieved by enabling/disabling the clock control circuitry for each of the scan chains. Because not all of the scan chains are enabled, the power of the clock distribution and the combinational logic can both be reduced. However, these techniques may still result in high peak shifting power from time to time. When a new block of test data is shifted into the scan chains, most of the scan chains need be active at the same time. To resolve this problem, scan chains may be partitioned to selectively deactivate some scan chains in any test cycle. This, however, can lead to a significant increase in test time. Thus, challenges still remain to develop techniques that can reducing both switching activity in various test phases and test data volume without significantly increasing test time.
Aspects of the invention relate to low power scan-based testing. During testing a circuit, while first test response data for a first test pattern are being unloaded from a first group of scan chains, a portion of a second test pattern is loaded to the first group of scan chains. The first group of scan chains, a subset of scan chains in the circuit, comprises one or more conflicting scan chains for the first test pattern and the second test pattern and one or more observation scan chains for the first test pattern.
The loaded second test pattern is then applied to the circuit. Second test response data for the second test pattern is captured in a second group of scan chains. The second group of scan chains, also a subset of scan chains in the circuit, comprises one or more conflicting scan chains for the second test pattern and a third test pattern and one or more observation scan chains for the second test pattern.
While the second test response data for the second test pattern are being unloaded from the second group of scan chains, a portion of the third test pattern is loaded to the second group of scan chains.
The above process may be repeated until all available test patterns are applied to the circuit. It should be noted that if a test pattern needs to be loaded to all of the scan chains (e.g., the very first test pattern), the loading process may be performed in multiple steps to reduce switching activity.
To generate test patterns for low power scan-based testing, the test cube generation procedure may set two constraints for test cube merging: a maximum number for the number of conflicting scan chains for consecutive test patterns and a maximum number for the number of observation scan chains for each test pattern. In some embodiments of the invention, the former may be set to be greater than the latter. For a test employing test compression techniques, compression (encoding) feasibility may also be a constraint for test cube merging.
Low power scan-based testing methods may be implemented using test circuitry that comprises a control circuit. The control circuit supplies control signals to clock gating circuitry of the scan chains to control the loading/unloading and capturing operations. The control circuit may comprise a control register and an XOR network. The control circuit may further comprise a biasing circuit.
Various aspects of the present invention relate to low power scan-based testing techniques. In the following description, numerous details are set forth for the purpose of explanation. However, one of ordinary skill in the art will realize that the invention may be practiced without the use of these specific details. In other instances, well-known features have not been described in detail to avoid obscuring the present invention.
Some of the techniques described herein can be implemented by a computer-readable medium having software instructions stored thereon, a programmable computer system executing software instructions, or some combination of both. Some of the disclosed techniques, for example, can be implemented as part of an electronic design automation (EDA) tool. Such methods can be executed on a single computer or on networked computers.
Although the operations of the disclosed methods are described in a particular sequential order for convenient presentation, it should be understood that this manner of description encompasses rearrangements, unless a particular ordering is required by specific language set forth below. For example, operations described sequentially may in some cases be rearranged or performed concurrently. Moreover, for the sake of simplicity, the disclosed flow charts and block diagrams typically do not show the various ways in which particular methods can be used in conjunction with other methods. Additionally, the detailed description sometimes uses terms such as “apply,” “capture,” “load,” and “unload” to describe the disclosed methods. Such terms are high-level abstractions of the actual operations that are performed. The actual operations that correspond to these terms will vary depending on the particular implementation and are readily discernible by one of ordinary skill in the art.
Also, as used herein, the term “design” is intended to encompass data describing an entire integrated circuit device. This term also is intended to encompass a smaller group of data describing one or more components of an entire device, however, such as a portion of an integrated circuit device. Still further, the term “design” also is intended to encompass data describing more than one integrated circuit device, such as data to be used to form multiple integrated circuit devices on a single wafer.
Test cubes tend to be correlated. Faults that are structurally related require similar input bits for detection. Thus, many test cubes in a test set share identical values for many specified bits and differ in values for only a few specified bits. Here, the specified bits are bits in a test cube for which values must be specified for detecting one or more defects. Correspondingly, values for the rest of the bits do not affect the detection of the one or more defects and thus can be either “1” or “0.” These unspecified bits are often referred to as don't care bits.
Typically, ATPG tools or test data compression techniques apply test cube merging techniques to reduce pattern counts. In a test cube merging process, a test pattern is gradually developed by merging compatible test cubes with appropriate values assigned to unspecified bits. Two test cubes are compatible if for each specified bit in one test cube, the other test cube features either the same specified bit value or a don't care bit value. In another word, two test cubes are compatible if they do not have conflicting specified bits.
As observed in Mrugalski et al., by continuing the test cube merging process despite a few conflicting specified bits, a higher test compression ratio can be achieved. The method in Mrugalski et al. partitions test cubes into different clusters each containing one parent pattern that is compatible with all the other incremental test cubes in the cluster. For loading the test cubes of each cluster to the scan chains, it repeatedly applies the same parent pattern, every time using a different incremental test cube. As noted previously, however, this approach may increase switching activity.
Instead of shifting the parent pattern every time for all the test cubes in the cluster, scan chains including conflicting bits and observation bits may be updated for new test cube while the remaining scan chains hold current values. This can reduce both the amount of switching activity during the loading/unloading phase and the volume of test data needed to be delivered.
To facilitate the disclosure of various embodiments of the invention, three terms, compatible scan chains, conflicting scan chains and observation scan chains, are introduced here. A compatible scan chain for two test cubes corresponds to a portion of the two test cubes that do not have conflicting specified bits, while a conflicting scan chain corresponds to a portion of the two test cubes that have conflicting specified bits. An observation scan chain for a test cube is a scan chain in which one or more scan cells are used as an observation point for the test cube. An observation scan chain may be a compatible scan chain or a conflicting scan chain.
To reduce switching activity and test data for delivery, portions of the test cubes T1 and T2 corresponding to the three compatible scan chains A, C and D could be merged. However, to observe any fault effect of T1, the observation scan chain A should be used to capture test response data. The captured test response data need to be unloaded as well. Therefore, the observation scan chain A cannot hold the same values as it is loaded with. Accordingly, only portions of the test cubes T1 and T2 corresponding to the two compatible scan chains C and D should be merged to form a new test cube T1*. During a test, the test cube T1* is loaded to the scan chains first. After test response data for T1*(T1) are captured, they are unloaded while a portion of T2*(T2) corresponding to scan chains A and B is being loaded. This approach eliminates switching activity in scan chains C and D during this loading/unloading process.
In the above procedure, the test cube T1 is initially generated by merging all compatible test cubes in the buffer such that both the number of observation scan chains (Cunload) does not exceed the given threshold δ and compression does not fail.
The next test cube T2 is generated based on the test cube T1. In this step, Cunload of the test cube T1 should be loaded with new values during shifting test vector T2 into scan chains as Cload of T2 (conflicting scan chains for T1 and T2) is set to include the scan chains in Cunload of T1. For every remaining test cube in the buffer (candidate test cubes), the number of specified bits that each candidate test cube has in common with test vector T1 is determined. It should be noted that the values of scan chains that are not included in Cload are compared with test cubes in buffer. Also determined is whether the number of conflicting scan chains for each test cube with T1 is greater than Δ. If it is, that candidate test cube will be deleted from the list of candidate test cubes that could be merged with T2.
After all the candidate test cubes are scanned, the one with the highest degree of similarity with T1 (the largest number of common specified bits) is selected as a base test cube for T2. T2 is then generated by merging the base test cube with compatible candidate test cubes under the constraints imposed by factors δ and Δ and by compression feasibility. If the total number of scan chains that should be loaded for T2 (Cload) is less than Δ, (Δ−Cload) scan chains may be randomly selected and placed in Cload. Those scan chains that have not been loaded with a new value for a long time may be selected. As for compression of the test cube T2, only the specified bits that are included in Cload need to be encoded and the specified bits in the remaining scan chains are merged to the test cube T1.
After the test cube T2 is generated, the generation of the next test cube T3 starts. Scan chains in Cunload of T2 are placed in Cload of T3. Then the same procedure as mentioned above except the compression part is executed. As for compression of the test cube T3, only the specified positions are included in Cload are encoded while the remaining specified bits are moved to the nearest test vector in which the scan chains are reloaded with the new value. In
By the time the test cube T5 is generated in the example illustrated in
With the generated test cubes, a low power testing process may be performed according to a flow chart 300 illustrated in
In operation 320, the loaded second test pattern (test cube Ti) is applied to the circuit. Next, in operation 330, the test response data for Ti are captured in a second group of scan chains. One or more scan chains (including all scan chains) in the second group of scan chains may be used in the capturing operation. In
In operation 340, a portion of a third test pattern (test cube Ti+1) is loaded to the second group of scan chains while the captured test response data for Ti are being unloaded. Using the same example in
The low power testing system in
The fraction of scan chains driven directly by the decompressor 410 is determined by the biasing circuit 470. In
The number of scan chains that can be driven by the decompressor 410 may depend on the encoding capabilities of the XOR network 460. Because the encoding process is equivalent to solving a set of linear equations, setting a gating signal to a pre-specified value requires one or two equations. For instance, allowing the decompressor 410 to drive a given scan chain requires solving two equations xi=1 (one for each input of the AND gate). For scan chains that need to hold their value, one equation xi=0 is enough to set the output of AND gate to 0 (any input of AND gate can be chosen to set to 0).
While the invention has been described with respect to specific examples including presently preferred modes of carrying out the invention, those skilled in the art will appreciate that there are numerous variations and permutations of the above described systems and techniques that fall within the spirit and scope of the invention as set forth in the appended claims.
This application claims priority to U.S. Provisional Patent Application No. 61/438,795, entitled “Low Power Compression Utilizing Clock-Gating,” filed on Feb. 2, 2011, and naming Janusz Rajski et al. as inventors, which application is incorporated entirely herein by reference.
Number | Date | Country | |
---|---|---|---|
61438795 | Feb 2011 | US |