Various features relate to integrated circuit (IC) design and testing, and in particular to test architectures for three-dimensional (3D) stacked ICs.
Computational needs in integrated circuits are constantly increasing, especially with the growth in artificial intelligence (AI) and machine learning (ML) applications. Three-dimensional (3D) system integration has emerged as a key enabling technology to continue the scaling trajectory that Moore's Law predicted for future integrated circuit (IC) generations. More particularly, with 3D integration technology, components in a stacked IC can be placed on different dies, which can substantially reduce both the average and maximum distance between the components in the stacked IC and translate into significant savings in delay, power, and area footprint. Furthermore, 3D integration technology can enable the integration of heterogeneous devices, thereby making the entire system more compact and more efficient. Nevertheless, the success of 3D stacked ICs is predicated on the final post-bond yield.
Historically, focus on pre-bond testing was expected to improve the overall yield of 3D ICs, which involves testing each individual die in a 3D stacked IC prior to the bonding process, because manufacturers can avoid stacking defective dies with good dies. However, post-stacking testing discovered that defects still occur due to assembly, stacking of dies, and packaging. There is accordingly an ongoing need to provide efficient post-stacking testing features for 3D stacked circuits.
The following presents a summary of one or more aspects of the present disclosure, in order to provide a basic understanding of such aspects. This summary is not an extensive overview of all contemplated features of the disclosure, and is intended neither to identify key or critical elements of all aspects of the disclosure nor to delineate the scope of any or all aspects of the disclosure. Its sole purpose is to present some concepts of one or more aspects of the disclosure in a form as a prelude to the more detailed description that is presented later.
Various features relate to stacked circuits (e.g., 3D stacked ICs) that facilitate post-stacking testing. One example provides a stacked circuit that includes a first die electrically coupled to a second die through a plurality of interconnects. The first die may include a test input interface configured to receive test data signals and a source test clock signal, a test output interface configured to convey test responses, a first test signal path configured to transition the test data signals from the source test clock to a first balanced clock tree, to test the first die using the test data signals transitioned to the first balanced clock tree, to transition the test data signals and resulting test responses from the first balanced clock tree to a low-latency clock, and to convey resulting test responses to the test output interface, at least one first die-to-die output interface configured to convey to the second die the test data signals and a low-latency clock signal received from a low-latency clock path between the test input interface and the at least one first die-to-die output interface, and at least one first die-to-die input interface configured to receive test responses and the clock signal from the second die.
Another example provides an apparatus that includes a first die electrically coupled to a second die in a 3D stacked circuit configuration. The first die may be configured to convey test data signals and a clock signal to the second die, where the first die includes a first tunable die-to-die output interface configured to tune at least one of the test data signals or the clock signal prior to being conveyed to the second die, and a first tunable die-to-die input interface configured to tune at least one of test response signals or the clock signal received from the second die. The second die may include a second tunable die-to-die input interface configured to tune the at least one of the test data signals or the clock signal received from the first die, and a second tunable die-to-die output interface configured to tune the at least one of the test response signals or the clock signal prior to being conveyed to the first die.
Another example provides a method for fabricating a stacked circuit. The method provides a first die and a second die, and electrically coupling the first die and the second die together to form a 3D stacked circuit. The first die may include a test input interface configured to receive test data signals and a source test clock signal, a test output interface configured to convey test responses, a first test signal path configured to transition the test data signals from the source test clock to a first balanced clock tree, to test the first die using the test data signals transitioned to the first balanced clock tree, to transition the test data signals and resulting test responses from the first balanced clock tree to a low-latency clock, and to convey resulting test responses to the test output interface, at least one first die-to-die output interface configured to convey to the second die the test data signals and a low-latency clock signal received from a low-latency clock path between the test input interface and the at least one first die-to-die output interface, and at least one first die-to-die input interface configured to receive test responses and the clock signal from the second die.
Yet another example provides a method operational on a stacked circuit. The method includes receiving test data signals and a source test clock in a first die, transitioning the test data signals from the source test clock to a first balanced clock tree, employing the test data signals transitioned to the first balanced clock tree to test the first die, where the test of the first die results in first die test responses, transitioning the test data signals to a low-latency clock, conveying the low-latency clock and the test data signals transitioned to the low-latency clock to a second die stacked on the first die and electrically coupled to the first die, transitioning the test data signals in the second die from the test clock to a second balanced clock tree, employing the test data signals transitioned to the second balanced clock tree to test the second die, wherein the test of the second die results in second die test responses, transitioning the second die test responses to the low-latency clock, and conveying the second die test responses and low-latency clock from the second die to the first die.
These and other aspects of the disclosure will become more fully understood upon a review of the detailed description, which follows. Other aspects, features, and examples of the present invention will become apparent to those of ordinary skill in the art, upon reviewing the following description of specific, exemplary examples in conjunction with the accompanying figures. While features may be discussed relative to certain examples and figures below, all examples can include one or more of the advantageous features discussed herein. In other words, while one or more examples may be discussed as having certain advantageous features, one or more of such features may also be used in accordance with the various examples of the invention discussed herein. In similar fashion, while exemplary examples may be discussed below as device, system, or method examples it should be understood that such exemplary examples can be implemented in various devices, systems, and methods.
In the following description, specific details are given to provide a thorough understanding of the various aspects of the disclosure. However, it will be understood by one of ordinary skill in the art that the aspects may be practiced without these specific details. For example, circuits may be shown in block diagrams in order to avoid obscuring the aspects in unnecessary detail. In other instances, well-known circuits, structures and techniques may not be shown in detail in order not to obscure the aspects of the disclosure.
The illustrations presented herein are, in some instances, not actual views of any particular stacked IC, but are merely idealized representations which are employed to describe the present disclosure. Additionally, elements common between figures may retain the same numerical designation.
The present disclosure describes 3D stacked ICs utilizing a common scan interface in the first die to scan both the first die and one or more dies stacked on the first die.
The stacked circuit 100 in
Although only two dies 102, 104 are shown in
Referring to
The first die 102 may further include a first test signal path 214. In at least one example, the first test signal path 214 may be configured to transition the test data signals from a source test clock (e.g., the source test clock received at the test clock input interconnect 208 of the test input interface 202) to a first balanced clock tree 224 having a higher latency compared to the source test clock. For example, the first die 102 may include a first transition circuit 215 configured to receive the test data signals from the test input interface 202, and to transition the test data signals from the source test clock to the first balanced clock tree 224 having a relatively higher latency. In the example depicted, the first transition circuit 215 includes a plurality of buffers 216 and a synchronous FIFO (first-in-first-out) 218 with clock inputs from both the source test clock and the first balanced clock tree 224. It will be apparent that various configurations of a transition circuit capable of transitioning the test data signals to a first balanced clock tree 224 may be utilized. In any such examples, the first balanced clock tree 224 will have a relatively higher latency compared to the source test clock. As used herein, a “transition” from one clock (e.g., the source test clock) to a different clock (e.g., the first balanced clock tree 224) refers to test data signals and/or test responses transitioning from circuitry employing the first clock signal to circuitry employing the second clock signal.
With the test data signals transitioned to the first balanced clock tree 224, the test data signals may be received along the first test signal path 214 by a first testing circuit 220. As shown, the first testing circuit 220 may include positive edge flops P, one or more cores, negative edge flops R, as well as multiplexors and other relevant components. It should be understood that the specific circuitry for the first testing circuit 220 will depend on the specific application and will vary according to such differing applications.
The first die 102 further includes a second transition circuit 225 configured to transition the test data signals from the first balanced clock tree 224 to a low-latency test clock signal on a first low-latency clock path 222. The first low-latency clock path 222 may be configured to convey the source test clock signal from the test clock input interconnect 208 along a low-latency clock path to the second die 104. The second transition circuit 225 may further transition the test responses from the first testing circuit 220 from the first balanced clock tree 224 to the low-latency test clock signal on the first low-latency clock path 222. In the depicted example, the second transition circuit 225 includes a positive edge flop 226 utilizing the first balanced clock tree 224, a FIFO 228 receiving clock inputs from both the first balanced clock tree 224 and the low-latency test clock signal from the first low-latency clock path 222, and a positive edge flop 230 utilizing the low-latency test clock signal. It should again be apparent that various configurations of a transition circuit capable of transitioning the test data signals and the test responses from the first balanced clock tree 224 to the low-latency clock signal may be utilized.
According to further aspects of the present disclosure, the first die 102 may include at least one first die-to-die output interface 232 configured to convey the test data signals and the low-latency clock received from the first low-latency clock path 222 to the second die 104. These test data signals may be similar to those received at the test data signal interconnects 206, or could be modified in the first die 102, according to different examples. The first die-to-die output interface 232 includes a plurality of output tunable delay circuits 234. For example, the plurality of output tunable delay circuits 234 may include at least one output tunable delay circuit 234a for the test data signals and an output tunable delay circuit 234b for the clock signal received from the first low-latency clock path 222. Each output tunable delay circuit 234a and 234b may be any suitable tunable delay circuit according to various aspects.
Referring again to
Turning to
The second die 104 further includes at least one second die-to-die output interface 504 configured to convey test responses and a clock signal to the first die 102. The second die-to-die output interface 504 includes a plurality of output tunable delay circuits 234. For example, the plurality of output tunable delay circuits 234 may include at least one output tunable delay circuit 234 for test responses generated at the second die 104 and an output tunable delay circuit 234 for the clock signal from the second die 104. The output tunable delay circuit 234 may be any suitable tunable delay circuit according to various embodiments, including the output tunable delay circuit 234 according to the example in
The second die 104 also includes a second test signal path 506. In at least one aspect, the second test signal path 506 may be configured to transition the test data signals from the low-latency clock signal received at the second die-to-die input interface 502 to a second balanced clock tree 508 having a higher latency compared to the received low-latency clock signal. For example, the second die 104 may include a third transition circuit 509 configured to receive the test data signals from the second die-to-die input interface 502, and to transition the test data signals from the low-latency clock signal received from the second die-to-die input interface 502 to the second balanced clock tree 508. In the example depicted, the third transition circuit 509 includes a plurality of buffers 510 and a synchronous FIFO 512 with clock inputs from both the low-latency clock and the second balanced clock tree 508. It will be apparent that various configurations of a transition circuit capable of transitioning the test data signals to the second balanced clock tree 508 may be utilized. In any such embodiments, the second balanced clock tree 508 will have a relatively higher latency compared to the low-latency clock signal received from the second die-to-die input interface 502.
With the test data signals transitioned to the second balanced clock tree 508, the test data signals may be received along the second test signal path 506 by a second testing circuit 514. Like the first testing circuit 220 in the first die 102 in
The second die 104 also includes a second low-latency clock path 516 configured to convey the low-latency clock signal from the second die-to-die input interface 502 to a fourth transition circuit 517. The fourth transition circuit 517 is configured to transition test data signals (if conveying to a third die, not shown) from the second balanced clock tree 508 to the low-latency clock signal on the second low-latency clock path 516. The fourth transition circuit 517 may further transition the test responses from the second testing circuit 514 from the second balanced clock tree 508 to the low latency clock signal on the second low-latency clock path 516. In the depicted example, the fourth transition circuit 517 includes a positive edge flop 518 utilizing the second balanced clock tree 508, a FIFO 520 receiving clock inputs from both the second balanced clock tree 508 and the low-latency clock signal on the second low-latency clock path 516, and a positive edge flop 522 utilizing the low-latency clock signal. It should again be apparent that various configurations of a transition circuit capable of transitioning test data signals and test responses from the second balanced clock tree 508 to the low-latency clock signal may be utilized.
Although the examples in
On the return path, from the second die 104 to the first die 102, each test data bit signal path on the bus can include a respective synchronous FIFO 520, a respective positive edge flop 522, and a respective output tunable delay circuit 234. Following each respective output tunable delay circuit 234 on the second die 104, each test data bit signal is conveyed to the first die 102 including a respective TSV, input tunable delay circuit 238, negative edge flop R, and synchronous FIFO.
In
The tunable delay circuits for each test data bit signal path in the bus can help to maintain test data signal coherency when the test data signals are transferred from die to die. Additionally, these tunable delay circuits provide granular delay control on both clock and data transfers between dies. The tunable delay circuits provide clock path delay control for coarse delay tuning, and data path delay control for fine delay tuning per bit. Further, the tunable delay circuits enable the dies to be tuned on the fly.
Additional features of the present disclosure include 3D stacked ICs including die-to-die interfaces configured to debug, diagnose, and repair the test data signal paths between dies. For instance, the first die-to-die output interface 232 and the second die-to-die input interface 502 may be configured to debug, diagnose, and repair the test data signal paths between the first die 102 and the second die 104. Similarly, the first die-to-die input interface 236 and the second die-to-die output interface 504 may be configured to debug, diagnose, and repair the test data signal paths between the first die 102 and the second die 104. For example, the last stage of the pipeline flops, such as the positive edge flop 230 in
As shown, the positive edge flops P and negative edge flops R are stitched together in a scan chain for each of the first die 102 and the second die 104, along with an on-chip clock controller (OCC) 702 driving both the first die 102 and the second die 104. With the depicted embodiment, a test signal can be launched from one die and captured at the other die at each flop location. The test transitions can be launched at high speeds or low speeds, as desired. Based on the scan shift out data, an accurate determination can be made to identify which bits of scan data has data coherency issues.
As a specific example, a value can be conveyed from a first positive edge flop 704 of the first die 102 to be captured on a first negative edge flop 706 of the second die 104. In the depicted example, the value conveyed from the first positive edge flop 704 may be accurately captured by the first negative edge flop 706, but the capture is delayed from what it should be. When the test results are shifted out through the scan chain, a determination can be made that the connection between the first positive edge flop 704 of the first die 102 and the first negative edge flop 706 of the second die 104 is experiencing a delay defect. By knowing where the defect is occurring and that the defect is a delay defect, the first die 102 can employ the output tunable delay circuit 234 (shown in
According to another example in the depicted embodiment, a value is also conveyed from a second positive edge flop 708 of the second die 104 to a second negative edge flop 710 of the first die 102. In this example, the value is not captured at the second negative edge flop 710 because of some defect inhibiting the value from being successfully conveyed from the second die 104 to the first die 102. The results will be shifted out through the scan chain, and a determination can be made that a static defect exists at the specific data path.
In some embodiments of the present disclosure, spare TSVs may be employed to facilitate repairing the static defect between the second positive edge flop 708 and the second negative edge flop 710. For example, referring to
In operation, multiple stacked dies of a 3D stacked integrated circuit can be tested, and even repaired.
At 804, the test data signals may be transitioned to a first balanced clock tree. For example, the test clock signal may be converted through a plurality of buffers 216 to generate the first balanced clock tree 224. Additionally, the test clock signal may be split along a first low-latency clock path 222, resulting in a first low-latency clock. The test data signals are conveyed along the first test signal path 214, where the test data signals are transitioned to the first balanced clock tree 224 in the first transition circuit 215. In the example in
At 806, the test data signals transitioned to the first balanced clock tree 224 may be employed to test the first die 102. For example, the first testing circuit 220 may utilize the test data signals transitioned to the first balanced clock tree 224 to test the first die 102.
At 808, the test data signals are then transitioned from the first balanced clock tree 224 to the low-latency clock signal conveyed along the first low-latency clock path 222. More specifically, the test data signals may be transitioned to the low-latency clock utilizing the positive edge flop 226, the synchronous FIFO 228, and the positive edge flop 230. Similarly, the resulting test responses from the first testing circuit 220 may also be transitioned from the first balanced clock tree 224 to the low-latency clock. The resulting first die test responses can be conveyed to the test response signal interconnects 210.
At 810, the low-latency clock, together with the test data signals converted to the low-latency clock are conveyed to the second die 104. For example, low-latency clock and the test data signals converted to the low-latency clock can be conveyed the first die-to-die output interface 232. At the first die-to-die output interface 232, the test data signals and the low-latency clock signal may be tuned in the respective output tunable delay circuits 234, and conveyed to the second die-to-die input interface 502 of the second die 104.
At the second die-to-die input interface 502 of the second die 104, the received test data signals and the low-latency clock signal are tuned in the input tunable delay circuit 238. The low-latency clock signal is converted to a second balanced clock tree 508 utilizing the plurality of buffers 510, and also maintained as a low-latency clock signal conveyed along the second low-latency clock path 516.
At 812, the test data signals may be transitioned from the low-latency clock to the second balanced clock tree 508. More specifically, the test data signals may be conveyed along the second test signal path 506, where the test data signals are transitioned to the second balanced clock tree 508 in the first transition circuit 215.
At 814, the test data signals transitioned to the second balanced clock tree 508 may be employed to test the second die 104. For example, the test data signals transitioned to the second balanced clock tree 508 may be utilized by the second testing circuit 514 to test the second die 104, resulting in second die test responses.
At 816, the second die test responses from the second testing circuit 514 may be transitioned from the second balanced clock tree 508 to the low-latency clock signal. For example, the second die test responses, and any test data signals that may be conveyed to a third die stacked on the second die 104, may be transitioned from the second balanced clock tree 508 to the low-latency clock signal conveyed along the second low-latency clock path 516 utilizing, for example, the positive edge flop 518, the synchronous FIFO 520, and the positive edge flop 522.
At 818, the resulting test responses, together with the low-latency clock signal, can be conveyed to the first die 102. For example, the resulting test responses and the low-latency clock signal can be conveyed to the second die-to-die output interface 504. At the second die-to-die output interface 504, the resulting test responses and the low-latency clock signal may be tuned in the output tunable delay circuit 234, and conveyed to the first die-to-die input interface 236 of the first die 102. At the first die-to-die input interface 236 of the first die 102, the resulting test responses received from the second die 104 and the low-latency clock signal may be tuned in the input tunable delay circuit 238. The resulting test responses received from the second die 104 and resulting test responses from the first die 102 can be conveyed to the test response signal interconnects 210 of the test output interface 204 of the first die 102.
Additional aspects of the present disclosure include methods of making a stacked circuit, such as one or more examples of 3D stacked IC described herein.
More specifically, the first test signal path 214 may include a first transition circuit 215, a first testing circuit 220, and a second transition circuit 225. The first transition circuit 215 is configured to receive the test data signals from the test input interface 202 and to transition the test data signals from the source test clock to the first balanced clock tree 224, where the first balanced clock tree 224 has a higher latency compared to the source test clock. The first testing circuit 220 is configured to receive the test data signals from the first transition circuit 215 and to test the first die 102 utilizing the test data signals. The second transition circuit 225 is configured to receive test data signals and resulting test responses from the first testing circuit 220 to transition the test data signals and resulting test responses from the first balanced clock tree 224 to the low-latency clock prior to conveying the resulting test responses to the test output interface 204 and the test data signals to the at least one first die-to-die output interface 232.
The provided first die 102 may further include the at least one first die-to-die output interface 232 configured to convey to the second die 104 the test data signals and a low-latency clock signal received from a first low-latency clock path 222 between the test input interface 202 and the at least one first die-to-die output interface 232, and at least one first die-to-die input interface 236 configured to receive test responses and the clock signal from the second die 104. According to various examples, the first die-to-die output interface 232 may include at least one output tunable delay circuit 234 for the test data signals, and an output tunable delay circuit for the low-latency clock signal. Similarly, the at least one first die-to-die input interface 236 may include at least one input tunable delay circuit 238 for the received test responses from the second die 104, and an input tunable delay circuit 238 for the low-latency clock signal received from the second die 104.
At 904, a second die 104 may be provided. The second die 104 may include at least one second die-to-die input interface 502, a second die-to-die output interface 504, and a second test signal path 506. The at least one second die-to-die input interface 502 may be electrically coupled to the at least one first die-to-die output interface 232 of the first die 102, and configured to receive the test data signals and the low-latency clock signal from the first die 102. The second die-to-die output interface 504 may be configured to convey test responses and the low-latency clock signal to the first die 102. The second test signal path 506 may be configured to transition the test data signals from the low-latency clock to a second balanced clock tree 508, to test the second die 104 using the test data signals transitioned to the second balanced clock tree 508, to transition the resulting test responses from the second balanced clock tree 508 back to the low-latency clock, and to convey resulting test responses to the second die-to-die output interface 504.
The second test signal path 506 may include a third transition circuit 509, a second testing circuit 514, and a fourth transition circuit 517. The third transition circuit 509 may be configured to receive the test data signals from the at least one second die-to-die input interface 502, and to transition the test data signals from the low-latency clock to the second balanced clock tree 508. The second testing circuit 514 may be configured to receive the test data signals from the third transition circuit 509 and to test the second die 104 utilizing the test data signals. The fourth transition circuit 517 may be configured to receive resulting test responses from the second testing circuit 514, and to transition the resulting test responses from the second balanced clock tree 508 to the low-latency clock prior to conveying the resulting test responses to the second die-to-die output interface 504.
According to various examples, the at least one second die-to-die input interface 502 may include at least one input tunable delay circuit 238 for the test data signals, and an input tunable delay circuit 238 for the low-latency clock signal. Similarly, the at least one second die-to-die output interface 504 may include at least one output tunable delay circuit 234 for the resulting test responses from the second die 104, and an output tunable delay circuit 234 for the low-latency clock signal.
In at least some implementations, the first die-to-die output interface 232 and the second die-to-die input interface 502 may be coupled together to form a stitched scan chain to test signal connections for each test data signal path from the first die-to-die output interface 232 to the second die-to-die input interface 502. Similarly, the first die-to-die input interface 236 and the second die-to-die output interface 504 may be coupled together to form a stitched scan chain to test signal connections for each test data signal path from the second die-to-die output interface 504 to the first die-to-die input interface 236.
At 906 in
Various aspects of the disclosure may be incorporated into electronic devices.
The various features described herein may include additional aspects, such as any single aspect or any combination of aspects described below and/or in connection with one or more other processes described elsewhere herein.
In a first aspect, a stacked circuit may include a first die electrically coupled to a second die through a plurality of interconnects, the first die including a test input interface configured to receive test data signals and a source test clock signal, a test output interface configured to convey test responses, a first test signal path configured to transition the test data signals from the source test clock to a first balanced clock tree, to test the first die using the test data signals transitioned to the first balanced clock tree, to transition the test data signals and resulting test responses from the first balanced clock tree to a low-latency clock, and to convey resulting test responses to the test output interface, at least one first die-to-die output interface configured to convey to the second die the test data signals and a low-latency clock signal received from a low-latency clock path between the test input interface and the at least one first die-to-die output interface, and at least one first die-to-die input interface configured to receive test responses and the clock signal from the second die.
In a second aspect, alone or in combination with the first aspect, the first test signal path may comprise a first transition circuit configured to receive the test data signals from the test input interface and to transition the test data signals from the source test clock to the first balanced clock tree, where the first balanced clock tree has a higher latency compared to the source test clock, a first testing circuit configured to receive the test data signals from the first transition circuit and to test the first die utilizing the test data signals, and a second transition circuit configured to receive test data signals and resulting test responses from the first testing circuit to transition the test data signals and resulting test responses from the first balanced clock tree to the low-latency clock prior to conveying the resulting test responses to the test output interface and the test data signals to the at least one first die-to-die output interface.
In a third aspect, alone or in combination with one or more of the first and second aspects, the at least one first die-to-die output interface may include at least one first tunable delay circuit for the test data signals and a first tunable delay circuit for the low-latency clock signal, and the at least one first die-to-die input interface may include at least one second tunable delay circuit for the received resulting test responses and a second tunable delay circuit for the received low-latency clock signal.
In a fourth aspect, alone or in combination with one or more of the first through third aspects, the stacked circuit may further include a plurality of TSVs and at least one spare TSV, and at least one respective fuse associated with each TSV to facilitate selection or deselection of each respective TSV.
In a fifth aspect, alone or in combination with one or more of the first through fourth aspects, the second die may include at least one second die-to-die input interface electrically coupled to the at least one first die-to-die output interface of the first die, the at least one second die-to-die input interface configured to receive the test data signals and the low-latency clock signal from the first die, a second die-to-die output interface configured to convey test responses and the low-latency clock signal to the first die, and a second test signal path configured to transition the test data signals from the low-latency clock to a second balanced clock tree, to test the second die using the test data signals transitioned to the second balanced clock tree, to transition the resulting test responses from the second balanced clock tree back to the low-latency clock, and to convey resulting test responses to the second die-to-die output interface.
In a sixth aspect, alone or in combination with one or more of the first through fifth aspects, the second test signal path may include a third transition circuit configured to receive the test data signals from the at least one second die-to-die input interface and to transition the test data signals from the low-latency clock to the second balanced clock tree, a second testing circuit configured to receive the test data signals from the third transition circuit and to test the second die utilizing the test data signals, and a fourth transition circuit configured to receive resulting test responses from the second testing circuit to transition the resulting test responses from the second balanced clock tree to the low-latency clock prior to conveying the resulting test responses to the second die-to-die output interface.
In a seventh aspect, alone or in combination with one or more of the first through sixth aspects, the at least one second die-to-die input interface may include at least one third tunable delay circuit for the received test data signals and a third tunable delay circuit for the received low-latency clock signal, and the at least one second die-to-die output interface may include at least one fourth tunable delay circuit for the resulting test responses from the second die, and a fourth tunable delay circuit for the low-latency clock signal.
In an eighth aspect, alone or in combination with one or more of the first through seventh aspects, the first die-to-die output interface and the second die-to-die input interface may include a stitched scan chain to test signal connections for each test data signal path from the first die-to-die output interface to the second die-to-die input interface.
In a ninth aspect, alone or in combination with one or more of the first through eighth aspects, the first die-to-die input interface and the second die-to-die output interface may include a stitched scan chain to test signal connections for each test data signal path from the second die-to-die output interface to the first die-to-die input interface.
In a tenth aspect, alone or in combination with one or more of the first through ninth aspects, the stacked circuit may be incorporated into a device selected from a group consisting of a music player, a video player, an entertainment unit, a navigation device, a communications device, a mobile device, a mobile phone, a smartphone, a personal digital assistant, a fixed location terminal, a tablet computer, a computer, a wearable device, a laptop computer, a server, an internet of things (IoT) device, and a device in an automotive vehicle.
In an eleventh aspect, an apparatus may include a first die electrically coupled to a second die in a 3D stacked circuit configuration, the first die configured to convey test data signals and a clock signal to the second die, wherein the first die includes a first tunable die-to-die output interface configured to tune at least one of the test data signals or the clock signal prior to being conveyed to the second die, and a first tunable die-to-die input interface configured to tune at least one of test response signals or the clock signal received from the second die, and the second die including a second tunable die-to-die input interface configured to tune the at least one of the test data signals or the clock signal received from the first die, and a second tunable die-to-die output interface configured to tune the at least one of the test response signals or the clock signal prior to being conveyed to the first die.
In a twelfth aspect, alone or in combination with the eleventh aspect, the first die may include a test input interface configured to receive the test data signals and a source test clock signal, a test output interface configured to convey test responses, a first test signal path configured to transition the test data signals from the source test clock to a first balanced clock tree, to test the first die using the test data signals transitioned to the first balanced clock tree, to transition the test data signals and resulting test responses from the first balanced clock tree to a low-latency clock, and to convey resulting test responses to the first tunable die-to-die output interface, and a low-latency clock path to convey the low-latency clock from the test input interface to the first tunable die-to-die output interface.
In a thirteenth aspect, alone or in combination with one or more of the eleventh or the twelfth aspects, the first test signal path may include a first transition circuit configured to receive the test data signals from the test input interface and to transition the test data signals from the source test clock to the first balanced clock tree, where the first balanced clock tree has a higher latency compared to the source test clock, a first testing circuit configured to receive the test data signals from the first transition circuit and to test the first die utilizing the test data signals, and a second transition circuit configured to receive test data signals from the testing circuit, and to transition the test data signals from the first balanced clock tree to the low-latency clock prior to conveying the test data signals to the first tunable die-to-die output interface.
In a fourteenth aspect, alone or in combination with one or more of the eleventh through thirteenth aspects, the second die may further include a second test signal path configured to transition the test data signals from the clock signal received from the first die to a second balanced clock tree, to test the second die using the test data signals transitioned to the second balanced clock tree, to transition resulting test responses from the second balanced clock tree to the received clock signal, and to convey resulting test responses to the second tunable die-to-die output interface.
In a fifteenth aspect, alone or in combination with one or more of the eleventh through fourteenth aspects, the second test signal path may include a third transition circuit configured to receive the test data signals from the second tunable die-to-die input interface and to transition the test data signals from a low-latency clock to the second balanced clock tree, a second testing circuit configured to receive the test data signals from the third transition circuit and to test the second die utilizing the test data signals, and a fourth transition circuit configured to receive resulting test responses from the second testing circuit, and to transition the resulting test responses from the second balanced clock tree to the low-latency clock prior to conveying the resulting test responses to the second tunable die-to-die output interface.
In a sixteenth aspect, alone or in combination with one or more of the eleventh through fifteenth aspects, the first tunable die-to-die output interface and the second tunable die-to-die input interface may include a stitched scan chain to test signal connections for each test data signal path from the first tunable die-to-die output interface to the second tunable die-to-die input interface, and the first tunable die-to-die input interface and the second tunable die-to-die output interface may include a stitched scan chain to test signal connections for each test data signal path from the second tunable die-to-die output interface to the first tunable die-to-die input interface.
In a seventeenth aspect, alone or in combination with one or more of the eleventh through sixteenth aspects, the apparatus may further include a plurality of TSVs and at least one spare TSV, and at least one respective fuse associated with each TSV to facilitate selection or deselection of each respective TSV.
In an eighteenth aspect, alone or in combination with one or more of the eleventh through seventeenth aspects, the apparatus may be selected from a group consisting of a music player, a video player, an entertainment unit, a navigation device, a communications device, a mobile device, a mobile phone, a smartphone, a personal digital assistant, a fixed location terminal, a tablet computer, a computer, a wearable device, a laptop computer, a server, an internet of things (IoT) device, and a device in an automotive vehicle.
In a nineteenth aspect, a method for fabricating a stacked circuit may include providing a first die including a test input interface configured to receive test data signals and a source test clock signal, a test output interface configured to convey test responses, a first test signal path configured to transition the test data signals from the source test clock to a first balanced clock tree, to test the first die using the test data signals transitioned to the first balanced clock tree, to transition the test data signals and resulting test responses from the first balanced clock tree to a low-latency clock, and to convey resulting test responses to the test output interface, at least one first die-to-die output interface configured to convey to the second die the test data signals and a low-latency clock signal received from a low-latency clock path between the test input interface and the at least one first die-to-die output interface, and at least one first die-to-die input interface configured to receive test responses and the clock signal from the second die, providing the second die, and electrically coupling the first die and the second die together to form a 3D stacked circuit.
In a twentieth aspect, alone or in combination with the nineteenth aspect, providing the first die including the first test signal path may include providing the first die including a first test signal path comprising a first transition circuit configured to receive the test data signals from the test input interface and to transition the test data signals from the source test clock to the first balanced clock tree, where the first balanced clock tree has a higher latency compared to the source test clock, a first testing circuit configured to receive the test data signals from the first transition circuit and to test the first die utilizing the test data signals, and a second transition circuit configured to receive test data signals and resulting test responses from the first testing circuit to transition the test data signals and resulting test responses from the first balanced clock tree to the low-latency clock prior to conveying the resulting test responses to the test output interface and the test data signals to the at least one first die-to-die output interface.
In a twenty-first aspect, alone or in combination with one or more of the nineteenth or twentieth aspects, the at least one first die-to-die output interface may include at least one first tunable delay circuit for the test data signals, and a first tunable delay circuit for the low-latency clock signal, and the at least one first die-to-die input interface may include at least one second tunable delay circuit for the received resulting test responses, and a second tunable delay circuit for the received low-latency clock signal.
In a twenty-second aspect, alone or in combination with one or more of the nineteenth through twenty-first aspects, providing the second die may include providing the second die comprising at least one second die-to-die input interface electrically coupled to the at least one first die-to-die output interface of the first die, the at least one second die-to-die input interface configured to receive the test data signals and the low-latency clock signal from the first die, a second die-to-die output interface configured to convey test responses and the low-latency clock signal to the first die, and a second test signal path configured to transition the test data signals from the low-latency clock to a second balanced clock tree, to test the second die using the test data signals transitioned to the second balanced clock tree, to transition the resulting test responses from the second balanced clock tree back to the low-latency clock, and to convey resulting test responses to the second die-to-die output interface.
In a twenty-third aspect, alone or in combination with one or more of the nineteenth through twenty-second aspects, the second test signal path may include a third transition circuit configured to receive the test data signals from the at least one second die-to-die input interface and to transition the test data signals from the low-latency clock to the second balanced clock tree, a second testing circuit configured to receive the test data signals from the third transition circuit and to test the second die utilizing the test data signals, and a fourth transition circuit configured to receive resulting test responses from the second testing circuit to transition the resulting test responses from the second balanced clock tree to the low-latency clock prior to conveying the resulting test responses to the second die-to-die output interface.
In a twenty-fourth aspect, alone or in combination with one or more of the nineteenth through twenty-third aspects, the at least one second die-to-die input interface may include at least one third tunable delay circuit for the received test data signals, and a third tunable delay circuit for the received low-latency clock signal, and the at least one second die-to-die output interface may include at least one fourth tunable delay circuit for the resulting test responses from the second die, and a fourth tunable delay circuit for the low-latency clock signal.
In a twenty-fifth aspect, alone or in combination with one or more of the nineteenth through twenty-fourth aspects, the first die-to-die output interface and the second die-to-die input interface may include a stitched scan chain to test signal connections for each test data signal path from the first die-to-die output interface to the second die-to-die input interface, and the first die-to-die input interface and the second die-to-die output interface may include a stitched scan chain to test signal connections for each test data signal path from the second die-to-die output interface to the first die-to-die input interface.
In a twenty-sixth aspect, a method operational on a stacked circuit may include receiving test data signals and a source test clock in a first die, transitioning the test data signals from the source test clock to a first balanced clock tree, employing the test data signals transitioned to the first balanced clock tree to test the first die, wherein the test of the first die results in first die test responses, transitioning the test data signals to a low-latency clock, conveying the low-latency clock and the test data signals transitioned to the low-latency clock to a second die stacked on the first die and electrically coupled to the first die, transitioning the test data signals in the second die from the test clock to a second balanced clock tree, employing the test data signals transitioned to the second balanced clock tree to test the second die, wherein the test of the second die results in second die test responses, transitioning the second die test responses to the low-latency clock, and conveying the second die test responses and low-latency clock from the second die to the first die.
In a twenty-seventh aspect, alone or in combination with the twenty-sixth aspect, the method may further include tuning at least one of the low-latency clock or the test data signals when conveying the low-latency clock and test data signals to the second die, and tuning at least one of the low-latency clock or the second die test responses when conveying the low-latency clock and the second die test responses to the first die.
In a twenty-eighth aspect, alone or in combination with one or more of the twenty-sixth or twenty-seventh aspects, the method may further include conveying transition values from the first die to the second die along at least one test data signal path, shifting the results of the transition values through a scan chain, and detecting a connection defect along the at least one test data signal path based on the results on the scan chain.
In a twenty-ninth aspect, alone or in combination with one or more of the twenty-sixth through twenty-eighth aspects, conveying the test data signals transitioned to the low-latency clock to the second die may include conveying the test data signals transitioned to the low-latency clock to the second die on a plurality of test data signal paths forming a data bus.
In a thirtieth aspect, alone or in combination with one or more of the twenty-sixth through twenty-ninth aspects, conveying the second die test responses to the first die may include conveying the second die test responses to the first die on a plurality of test data signal paths forming a data bus.
One or more of the components, processes, features, and/or functions illustrated in
It is noted that the figures in the disclosure may represent actual representations and/or conceptual representations of various parts, components, objects, devices, packages, integrated devices, integrated circuits, and/or transistors. In some instances, the figures may not be to scale. In some instances, for purpose of clarity, not all components and/or parts may be shown. In some instances, the position, the location, the sizes, and/or the shapes of various parts and/or components in the figures may be exemplary. In some implementations, various components and/or parts in the figures may be optional.
The word “exemplary” is used herein to mean “serving as an example, instance, or illustration.” Any implementation or aspect described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects of the disclosure. Likewise, the term “aspects” does not require that all aspects of the disclosure include the discussed feature, advantage or mode of operation. The term “coupled” is used herein to refer to the direct or indirect coupling between two objects. For example, if object A physically touches object B, and object B touches object C, then objects A and C may still be considered coupled to one another—even if they do not directly physically touch each other. The term “electrically coupled” may mean that two objects are directly or indirectly coupled together such that an electrical current (e.g., signal, power, ground) may travel between the two objects. Two objects that are electrically coupled may or may not have an electrical current traveling between the two objects. The term “encapsulating” means that the object may partially encapsulate or completely encapsulate another object. It is further noted that the term “over” as used in the present application in the context of one component located over another component, may be used to mean a component that is on another component and/or in another component (e.g., on a surface of a component or embedded in a component). Thus, for example, a first component that is over the second component may mean that (1) the first component is over the second component, but not directly touching the second component, (2) the first component is on (e.g., on a surface of) the second component, and/or (3) the first component is in (e.g., embedded in) the second component. The term “about ‘value X’”, or “approximately value X”, as used in the disclosure means within 10 percent of the ‘value X’. For example, a value of about 1 or approximately 1, would mean a value in a range of 0.9-1.1.
In some implementations, an interconnect is an element or component of a device or package that allows or facilitates an electrical connection between two points, elements and/or components. In some implementations, an interconnect may include a trace, a via, a pad, a pillar, a redistribution metal layer, and/or an under bump metallization (UBM) layer. An interconnect may include one or more metal components (e.g., seed layer+metal layer). In some implementations, an interconnect is an electrically conductive material that may be configured to provide an electrical path for a current (e.g., a data signal, ground or power). An interconnect may be part of a circuit. An interconnect may include more than one element or component. An interconnect may be defined by one or more interconnects. Different implementations may use similar or different processes to form the interconnects. In some implementations, a chemical vapor deposition (CVD) process and/or a physical vapor deposition (PVD) process for forming the interconnects. For example, a sputtering process, a spray coating, and/or a plating process may be used to form the interconnects.
Also, it is noted that various disclosures contained herein may be described as a process that is depicted as a flowchart, a flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process is terminated when its operations are completed.
The various features associate with the examples described herein and shown in the accompanying drawings can be implemented in different examples and implementations without departing from the scope of the present disclosure. Therefore, although certain specific constructions and arrangements have been described and shown in the accompanying drawings, such embodiments are merely illustrative and not restrictive of the scope of the disclosure, since various other additions and modifications to, and deletions from, the described embodiments will be apparent to one of ordinary skill in the art. Thus, the scope of the disclosure is only determined by the literal language, and legal equivalents, of the claims which follow.