This application is a priority based on prior application No. JP 2004-341600, filed Nov. 26, 2004, in Japan.
1. Field of the Invention
The present invention relates to a processor forming a plurality of CPU cores on a piece of chip, its error analytical method and a program, and in particular, it relates to a processor for scanning out a scan chain which connects a plurality of error holding latches built into a plurality of CPU cores and collecting and analyzing error information, its error analytical method and program.
2. Description of the Related Arts
Although, in general, in a conventional processor, as shown in
According to such a chip multi processor, the LSI enhanced more in a degree of integration is effectively used, and further, an upgrade for every LSI chip is realized. Particularly, in case the chip multi processor is constructed with a pin compatibility of the LSI chip remained as it is, there is no change in the whole system, and by only replacing the LSI chip, which is mounted with the multi chip processor, a sharp upgrade can be realized, and therefore, great hopes are entertained of it in view of the cost performance.
Now, even in case the LSI chip is constructed to be the chip multi processor, the collection and analysis of an error by using the scan chain of an error holding latch, which is used in a conventional chip single processor, is required.
The JTAG circuit with the chip single processor of
an error collecting scan chain 116 connected and constituted by error holding latches 114.
That is, the error information collecting circuit of
As one of the scan chains designatable by the command resistor 106, an error collecting scan chain 116 connected and constituted by the error holding latches 114 is allotted for such a scan chain so as to allow the scan chain to hold factors which have caused various errors.
In case an error occurs, the error collecting scan chain 116 is scanned out, and error information is collected by an external error analytical instrument, and an bit showing the error occurrence within that information is extracted, and the most extreme source of the error occurrence is sought out, and adequate automatic degeneration and part replacement instructions are issued by a firmware program.
In the error analysis, the processing of searching the most extreme source of the error occurrence is realized by writing a dependency relation in a database. To be specific, when some errors occur, an error capable of propagation is written, and in case a plurality of error factors exist, by comparing them to the database, an attempt is made to search out a still more upstream error factor.
[Patent Document 1] Japanese Patent Application Laid-Open No. 2002-169787
However, in case error information is collected by providing a JTAG circuit for the chip multi processor shown in
Further, the chip multiple processor intends to improve the system performance by interchange with the conventional chip single processor, and therefore, requires a pin compatible with the conventional chip single processor, but, according to the technique of mounting the JTAG circuit two times, there arises a problem that this pin compatibility ends up collapsing. The pin compatibility may be performed not on a LSI level, but on a card module level.
Further, in case the chip multi processor degenerates either of the CPU cores 102-1 and 102-2 because of a production yield ratio, so is degenerated either one on condition that it is used as a chip single processor, and hence, it requires the pin compatible with the conventional chip single processor, but according to the technique of mounting the JTAG circuit two times, there is a problem that, even in this case, the pin compatibility ends up collapsing.
According to the present invention, there are provided a processor of a chip multiple constitution, its analytical method and program, which realize an error information collection to expand a boundary scan without impairing the pin compatible with the chip single processor.
The present invention provides a processor, and is characterized by comprising:
a plurality of CPU cores formed on a piece of chip;
a scan chain circuit (error collecting scan chain) connecting and constituting a plurality of error holding latches built in a plurality of CPU cores into a line of scan chain;
a plurality of mask circuits dividing an interior of the scan chain into error holding latch groups (CPU latch groups) corresponding to a plurality of CPUs and allowing latch content of the error holding latch group corresponding to a degenerated CPU core within a plurality of CPU cores to be masked at the test operation time; and
a scan control circuit (test access port controller) for scanning out the scan chain circuit at the error occurrence time so as to output and collect error information.
Here, the scan control circuit scans out the error information based on the designation of the scan chain by a command resistor.
Further, the processor of the present invention further forms a secondary cache on a chip, and in this case, the scan chain circuit connects and constitutes a plurality of error holding latches built in a plurality of CPU cores and the secondary cache into a line of scan chain,
wherein a plurality of masks circuits are provided for every error holding latch group corresponding to a plurality of CPU cores and the secondary cache in the scan chain, and allows the latch content of the error holding latch group corresponding to a degenerate portion within a plurality of CPU cores or the secondary cache to be masked at the test operating time.
Further, as a processor of the present invention, the scan chain circuit divides a plurality of error holding latches built in a plurality of CPU cores for every error level at the error information collecting time and connects and constitutes them into a line of scan chain,
wherein a plurality of mask circuits are provided for every error holding latch group corresponding to a plurality of CPU cores in the scan chains which are divided into error levels and connected and constituted, and allow the latch content of the error holding latch group of each error level corresponding to the degenerated CPU core within a plurality of CPU cores to be masked.
For example, a plurality of error holding latches are divided into a high level error and a low level error, and a scan chain circuit and a mask circuit are provided for every two error levels.
Further, the error holding latch comprises a data input terminal, a reset input terminal, a clock terminal, a data output terminal, a shift input terminal, a shift output terminal, and a shift clock terminal, and the mask circuit prohibits a clock input to the clock terminal by the input of a core separating signal, and at the same time, fix-inputs a rest signal to the reset terminal so as to mask the latch content.
Further, the error holding latch comprises a data input terminal, a reset input terminal, a clock terminal, a data output terminal, a shit input terminal, a shit output terminal and a shift clock terminal, and the mask circuit prohibits the data input for the data input terminal by the input of the core separating signal, so that the latch content may be masked.
The present invention provides an error analytical method of the processor, which forms a plurality of CPU cores on a piece of chip. This error analytical method of the processor according to the present invention is characterized by comprising:
a scan chain constituting step of connecting and constituting a plurality of error holding latches built in a plurality of CPU cores into a line of scan chain;
a masking step of dividing the interior of the scan chain into the error holding latch group corresponding to a plurality of CPU cores and allowing the latch content of the error holding latch group corresponding to the degenerated CPU core within a plurality of CPU cores to be masked at the test operating time; and
an error information collecting step of scanning out the scan chain at the error occurrence time and collecting error information.
The present invention provides a program executed by a computer, which constitutes an error analytical instrument of the processor forming a plurality of CPU cores on a piece of chip. The program according to this invention is characterized by allowing the computer to execute:
a scan chain constituting step of connecting and constituting a plurality of error holding latches built in a plurality of CPU cores into a line of scan chain;
a masking step of dividing the interior of the scan chain into the error holding latch group corresponding to a plurality of CPU cores and allowing the latch content of the error holding latch group corresponding to the degenerated CPU core within a plurality of CPU cores to be masked at the test operating time; and
an error information collecting step of scanning out the scan chain at the error occurrence time and collecting error information.
The details of the error analytical method and program according to the present invention are basically the same as the case of the processor according to the present invention.
According to the present invention, even when a processor is formed on a LSI chip in which a plurality of CPU cores are formed, by connecting and constituting a line of scan chain for the error holding latches within a plurality of CPU cores so as to perform a scanning-out at the error occurrence time, the pin compatible with the conventional chip single processor can be maintained even when an error collecting function by the JTAG is mounted, and by the replacement of the processor only, the upgrade of the whole system can be achieved without changing the system, and moreover, the error analysis of a plurality of CPU cores can be performed with the same resolution as conventional.
In case a portion of a plurality of CPU cores is separated due to degeneration, the mask circuit provided in the portion of the error collecting scan chain corresponding to the degenerated CPU core is allowed to be operated by a CPU core separating signal, so that the error holding latch of the degenerated portion is masked so as not to become a bite 1 indicating an error content, and even when the latch of the degenerated portion is contained in the error collecting scan chain, the bite of the generated portion is all taken as a normal bite by the masking process, and there is no particular processing required such as removing the error of the degenerated portion for the error information collected by performing the scanning-out of the scan chain, and the conventional error analysis can be applied as it is.
Further, in the cases where the CPU core is degenerated and used and where it is not degenerated but used, when a scan length of the scan chain for error collection is changed by using a bypass resistor, though a system change for the scanning-out at the error occurrence time is required, in the present invention, since the scan length is not changed for both of the cases where the CPU core is degenerated and used and where it is not degenerated but used, there is no need to change the system for collecting and analyzing the error information by the scanning-out.
The above and other objects, features, and advantages of the present invention will become more apparent from the following detailed description with reference to the drawings.
The LSI chip 10 is provided with pins 11-1 to 11-8 for subjecting the internal circuit 12 to an external connection, and the pins 11-1 to 11-8 are connected to the internal circuit 12. Although the pins 11-1 to 11-8 are shown as comprising eight pins for ease of explanation, an actual LIS chip 10 is adequately provided with more than that number of pins.
This LSI chip is mounted with a boundary scan test function corresponding to a JTAG by an IEEE 1149.1. A circuit unit corresponding to the JTAG in the LSI chip 10 is constituted by a boundary scan resistor 18, a command resistor 20, a bypass resistor 22, and a test access port controller (TAPC) 24.
Further, as the pins for the boundary scan test by the JTAG, a test data input pin (TD1 pin) 25, a test data output pin (TDO pin) 26, a test mode select pin (TMS pin) 28, a test lock pin (TCK pin) 30, and a test reset pin (TRST pin) 32 are provided. The boundary scan resistor 18 connects latches 18-1 to 18-8 provided between connection lines with the internal circuit 12 and the pins 11-1 to 11-8 to a line of chain (daisy chain) so as to be connected between the test data input pin 25 and the test data output pin 26, and the boundary scan resister 18 operates as a shift resistor constituted by the latches 18-1 to 18-8.
By inputting an adequate data to the latches 18-1 to 18-8 of the boundary scan resistor 18, a data can be outputted from any output pin of the corresponding pins 11-1 to 11-8, and a state of the input pin can be monitored.
The command resistor 20 reads the number of the command resistor and decodes it, and can allow the internal circuit 12 to perform various functions. Further, the command resistor 20 is allotted with the command resistor number usable by the user, and by allotting this command resistor number to the scan chain connecting a plurality of latches provided in the internal circuit 12 into a line, a specific scan chain is designated so that the scanning-in or the scanning-out can be performed. In the present invention, as to be clarified in the following explanation, a specific command resistor number usable by the user is allotted to the error collecting scan chain which is constituted by connecting the error holding latches provided in the internal circuit 12 to a line of scan chain in the command resistor 20, and the error information of the internal circuit 12 can be scanned out and collected by error information collection instructions from the outside.
The bypass resistor 22 provides a route for by-passing a data inputted from the test data input pin 25 to the test data output pin 26 in the shortest possible route. As for how to use this bypass resistor 22, in case a plurality of circuits are mounted in the internal circuit 12 of the LSI chip 10, since the boundary scan resistor 18 connects the latches 18-1 to 18-8 as a line of chain for a plurality of circuits, the latch portion of the boundary scan resistor 18 of a circuit portion requiring no test is used when making a bypass and the like.
The test access port controller 24 constitutes a state machine, which controls each resistor by a signal from the test mode selector pin 28 and a signal from the test clock pin 30.
Here, the signals for each pin for use of the JTAG will be described as follows. The test data input signal for the data input pin 25 is a signal which serial-inputs a command and a data to the internal circuit 12 as a test object, and is sampled by a rising edge of the test clock for the test clock pin 30.
The test data output signal from the test data output pin 26 is a signal which serial-outputs a data from the internal circuit 12 as a test object, and a change of this output value is performed by a falling edge of the test clock signal for the test clock pin 30. The test clock pin 30 supplies a clock to the internal circuit 12 which becomes a test object, and becomes an exclusive input capable of being used independent of the system clock peculiar to the internal circuit 12.
The signal for the test mode select pin 28 is a signal for controlling a test operation, and is sampled by a rising edge of the test clock, and this signal is decoded by the test access port controller 24.
The signal for the test reset pin 32 is a negative logic signal, which asynchronously initializes the test access port controller 24, and is used as an option.
A signal line from five pins from the JTAG aiming at the boundary scan test which is mounted on such LSI chip 10 is connected to a host 34, and executes a test operation, an error collection at the time of error occurrence, and an error analysis by the instruction from the host 34. The host 34 is provided with a test processing unit 36, an error information collecting unit 38, an error information analyzing unit 40, and an error analyzing data base 42.
The test processing unit 36 executes a preset test processing with the internal circuit 12 of the LSI chip 10 as a target. The error information collecting unit 38, in case an error occurs at the time of the test operation of the internal circuit 12, designates the scanning-out operation of the error collecting scan chain to be clarified by the following explanation by the output of the command resister number preallotted to the command resister 20, and by the scanning-out operation of the error collecting scan chain, the bit information held in each error holding latch which constitutes the chain is collected as error information.
The error information analyzing unit 40 extracts a bit in which an error occurs from the error information collected by the error information collecting unit 38, and searches an factor which is the most extreme source of the error occurrence with reference to the error analyzing data base 42, and performs an adequate automatic degeneration processing and instructs the replacement of parts or the like.
In
The command resistor number usable by the user is allotted to the general control scan chains 44-1 to 44-3 and the error collecting scan chain 48, and by setting the command resistor number peculiar to each scan chain to the command resistor 20, the scan chain corresponding to the command resistor number designated by the decoding of the command resistor 20 is selected, thereby performing the scanning-in or the scanning-out. For example, assuming that there exist command resistor numbers IR01 to IR04 as the command resistor number usable by the user, the command resistor number IR01 to IR03 are allotted to the general control scan chains 44-1 to 44-3, respectively, and, the command resistor number IR04 is allotted to the error collecting scan chain 48.
Each of the general control scan chains 44-1 to 44-3 connects latches for test or verification purpose provided in the CPU cores 14-1 and 14-2 of the internal circuit 12 and the secondary cache 16 to a line of chain, thereby constituting a shift resistor.
In the meantime, the error collecting scan chain 48 provides error holding latches 50-1 to 50-n in the CPU core 14-1 of
Hence, the error collecting scan chain 48 is divided into a CPU latch group 56-1 corresponding to the CPU core 14-1, a CPU latch group 56-2 corresponding to the CPU core 14-2, and a secondary cache latch group 56-3 corresponding to the secondary cache 16.
Further, for the error collecting scan chain 48, mask circuits 58-1, 58-2, and 58-3 are provided. For the mask circuits 58-1 to 58-3, core separating signals E1, E2 and E3 are supplied when becoming a degeneration object.
For example, in case the CPU core 14-1 of
As a specific example of the masking process for the error holding latches 50-1 to 50-n, the embodiment of the present invention performs either one of the following:
(1) a clock off control and a reset control of the latches, and
(2) a mask control of an error input.
Even in the case of the mask circuit 58-2 for the CPU latch group 56-2 including the error holding latches 52-1 to 52-n provided by corresponding to the CPU core 14-2 of
Further, similarly in the case of the mask circuit 58-3 of the secondary cache group 56-3 including the error holding latches 54-1 to 54-n provided in the second cache 16, the core separating signal E3 becomes effective when the secondary cache 16 becomes an degeneration object, so that the masking process for releasing the error holding function by the error holding latches 54-1 to 54-n is performed.
Here, as the memory element 64 used in the resistor file 62, a flip flop of a master slave constitution to mount a scan circuit is usually used. The scan circuit mounted on the memory element 64 connects the memory elements 64 which constitutes the resister file 62 in series in a route different from the normal operation, thereby constituting a shift resister which becomes the scan chain and operates only at a testing time.
In the case of the resister file 62, though the error collecting scan chain is constituted by taking all the memories 64 as the error holding latches, as the occasion demands, the error collecting scan chain may be constituted with any memory element taken as the error holding latch. In this way, by arranging the error holding latch at any given position in the interior of the CPU cores 14-1 and 14-2 and the secondary cache 16, and connecting and constituting a line of chain by connecting in order shift inputs and shift outputs for all the error holding latches at the error detection time, the error collecting scan chain 48 as shown in
In
the error holding latch 50 comprises a shift input SI and a shift output SO for the scanning-out of the error detection time, and further comprises a shift clock SCK.
In this way, for the error holding latch 50, a mask circuit 82 is provided for a supply line of the clock signal E2 for the clock CK. The mask circuit 82 is constituted by an AND gate 84 and an OR gate 85, the one input side of the AND gate 84 is added with a clock signal E2, and the other inversing input side is added with a core separating signal E1. Further, the one side of the OR gates 85 is inputted with a reset signal E3, and the other side of the OR gate 85 is inputted with the core separating signal E1.
By providing such a mask circuit 82, when an object core is degenerated, so that the core separating signal E1 becomes a bit [1], the AND gate 84 of the mask circuit 82 is put into a prohibited state, the supply of the clock signal E2 to the clock input CK of the error holding latch 50 is stopped. At the same time, the core separating signal E1 takes a reset signal for the reset input RS as [1] through the OR gate 85, thereby resetting the error holding latch 50.
By such a clock off control and a reset control, even when an error input signal (error bit [1]) E4 is given to a data input terminal D for the error holding latch 50, the error input signal E4 is not latched by the error holding latch 50, and the latch content is fixed to a normal bit [0] on a steady base.
Hence, the data input D and the data output Q are separated so as to be converted into a shift input SI and a shift output SO, and a holding bit of the error holding latch 50, which is read by the scanning-out operation performed by the supply of the shift clock input by the shift clock signal E6, is a normal bit [0], and can be prevented from being scanned out as an error bit [1].
In the embodiment of
In
By providing such a mask circuit 86, when the core separating signal E1 becomes a bit [1] due to degeneration of the core, and the AND gate 88 becomes a bit [0] by its inversing input and is put into a prohibited state, and the input of the error signal E4 to the error holding latch 50 is prohibited, and as a result, at the scanning out time in which the data input D and the data output Q in the error holding latch 50 are separated so as to be converted into the shift input SI and the shift output SO, a normal bit [0] is scanned out from the error holding latch 50 on a steady base.
In this way, any one of the separating signals E1 to E3 becomes effective, and the masking process of the error holding latch in the core which becomes the degeneration object of any one of CPU lath groups 56-1 and 56-2 or a scan cache group 56-3 is performed. In the meantime, at step S3, in case no degenerated core exists, the process at step S4 is skipped.
Next, at step S5, the command resister number of the error collecting scan chain 48 is designated, and the scanning-out thereof is instructed. This command resister number is transmitted to the command resister 20 of
Subsequently, at step S6, collection of the error information by the scanning-out is performed, and at step S7, the completion of the collection is determined, and a series of processes are completed. After the completion of this error information collecting process, the collected error information is delivered to the error information analyzing unit 40 provided in the host 34 of FEB 2, and an error bit is extracted from the error information, and with reference to the error analyzing data base 42, a cause of the most extreme source of the error occurrence is sought out, thereby performing an adequate automatic degeneration or issuing part replacement instruction.
In
In this case also, mask circuits 58-1 and 58-2 are provided by corresponding to the CPU latch groups 56-1 and 56-2, and by making either of the core separating signals E1 and E2 for the cores, which becomes the degeneration object, effective, a mask process is performed for fixing and holding a normal bit [0] on a steady base without allowing the error holding latch, which becomes the degeneration object, to hold an error bit [1]. A specific constitution of these mask circuits 58-1 and 58-2 takes either one of those illustrated in
In the embodiment of
Further, in the embodiment of
Further, the present invention provides a program for collecting error information by driving the error collecting scan chain with the mount circuit of the JTAG of the LSI chip 10 as an object by the host 34 of
The present invention includes adequate modifications without impairing its object and advantage, and moreover, is not subjected to the limit by the numerical values shown by the above described embodiments.
Here, summing up the features of the present invention allows the following claims to be appended.
Number | Date | Country | Kind |
---|---|---|---|
2004-341600 | Nov 2004 | JP | national |