The present disclosure relates to the field of data processing. More particularly it relates to self-testing in a processor core to identify faults.
A processor core may be provided with the capability to identify faults which develop in the processor core over time. One approach to self-testing is to provide two processor cores which operate in lock-step, so that differences in the output of one core can be identified by reference to the output of the other. Another approach is software-based, which may be preferred over the dual-core lock-step approach because of the reduced area and power savings associated therewith. However, the extent and complexity of contemporary processor cores can mean that such software itself may need to be extensive and complex. Furthermore, a balance generally needs to be stuck between frequently executing the self-test software and thus quickly identifying faults in the core, and accepting greater disruption that the execution of this self-test software causes. The self-test software not only temporarily puts the “real” data processing of the core on hold, but further the result of executing the self-test software is then typically written out to memory. Depending on the context in which the processor core is implemented, a relatively high “fault grading” capability may be required of the self-test functionality. For example, it may be required that a fault grading capability of greater than 90% is provided (i.e. the self-testing is capable of identifying at least 90% of the possible faults which could occur). Indeed in some safety critical implementations the fault grading capability may be required to be greater than 99%. In order to provide such fault grading capability using self-test software, a great deal of effort may need to be invested in order to write and iteratively optimize such software, and for this reason when a very high fault grading capability is required the dual-core lock-step approach may be preferred.
In at least some embodiments the present techniques provide an apparatus comprising processor core circuitry to perform data processing operations by executing data processing instructions; self-test control circuitry to cause the processor core circuitry to temporarily switch from a first state of executing the data processing instructions to a second state of executing a self-test sequence of instructions, before returning to the first state of executing the data processing instructions without a reboot of the processor core circuitry being required; and self-test support circuitry, wherein the processor core circuitry is responsive to the self-test sequence of instructions to cause an export of at least one self-test data item via the self-test support circuitry to the self-test control circuitry.
In at least some embodiments the present techniques provide a method of operating a data processing apparatus, comprising the steps of performing data processing operations in processor core circuitry in a first state by executing data processing instructions; signalling from self-test control circuitry to cause the processor core circuitry to temporarily switch from the first state to a second state of executing a self-test sequence of instructions, before returning to the first state of executing the data processing instructions without a reboot of the processor core circuitry being required; and causing an export of at least one self-test data item via self-test support circuitry in the processor core circuitry to the self-test control circuitry.
In at least some embodiments the present techniques provide an apparatus comprising means for performing data processing operations by executing data processing instructions; means for controlling a self-test process comprising causing the means for performing data processing operations to temporarily switch from a first state of executing the data processing instructions to a second state of executing a self-test sequence of instructions, before returning to the first state of executing the data processing instructions without a reboot of the means for performing data processing operations being required; and means for supporting the self-test process, wherein the means for performing data processing operations is responsive to the self-test sequence of instructions to cause an export of at least one self-test data item via the means for supporting the self-test process to the means for controlling the self-test process.
The present techniques will be described further, by way of example only, with reference to embodiments thereof as illustrated in the accompanying drawings, in which:
At least some embodiments provide an apparatus comprising processor core circuitry to perform data processing operations by executing data processing instructions; self-test control circuitry to cause the processor core circuitry to temporarily switch from a first state of executing the data processing instructions to a second state of executing a self-test sequence of instructions, before returning to the first state of executing the data processing instructions without a reboot of the processor core circuitry being required; and self-test support circuitry, wherein the processor core circuitry is responsive to the self-test sequence of instructions to cause an export of at least one self-test data item via the self-test support circuitry to the self-test control circuitry.
The present techniques provide an approach to self-testing which combines both software and hardware aspects. A self-test controller (hardware monitor) is provided externally to the processor core which administers the self-testing. This hardware monitor could be provided in a number of different ways, but may for example be provided as a peripheral device to core (CPU) to which commands are issued with respect to the self-testing that needs to be carried out. Additionally, self-test support circuitry is provided via which self-test data items can be exported to the self-test control circuitry. This means that the self-test data items are provided to the self-test control circuitry in a very direct manner. This is to be contrasted to prior art software-based approaches to self-testing in which the exporting of self-test data items typically involves those self-test data items being written out to memory (for example in the form of a cyclical redundancy check (CRC) value) which requires the self-test software to control many aspects of the operation of the core, for example involving additional instructions (such as store instructions) to propagate this data out through the memory system to the memory. In consequence, the present techniques provide that the self-test data items are more directly observable, despite coming from potentially “deep” within the core. Additionally, the present techniques do not require that the processor core is put into a special self-testing state in order to perform its self-testing and therefore the processor core can readily switch between its usual data processing and the self-testing by means of its normal context switching mechanisms. Moreover following a period of self-testing, the processor core can return quickly to executing (regular) data processing instructions without needing to reboot (reset) itself. Fast context switching between regular data processing and self-testing is therefore supported.
In some embodiments the processor core circuitry is capable of completing the export of the at least one self-test data item after returning to the first state of executing the data processing instructions. The provision of the self-test support circuitry, via which the self-test data items are exported to the self-test control circuitry, provides an independent path, in particular independent to the memory interface/system of the processor core and thus there is no need after a period of self-test for any “clearance” of the self-test data items through a particular stage of the memory system before usual execution of data processing instructions (in the first state) can resume.
In dependence on the particular self-test data items which are to be exported from the processor core to the self-test control circuitry, and therefore where in the processor core the self-test data items originate, the self-test support circuitry may take a variety of forms. In one embodiment the self-test support circuitry comprises a dedicated self-test data path from the processor core circuitry to the self-test control circuitry. Efficient and independent exporting of the self-test data items to the self-test support circuitry is thus supported.
In some embodiments the self-test test support circuitry comprises instruction type indication circuitry and the at least one self-test data item comprises at least one instruction type indication provided by the instruction type indication circuitry, wherein the self-test control circuitry is responsive to reception of the at least one instruction type indication to maintain an instruction type count, and wherein the self-test control circuitry is capable of comparing the instruction type count to an expected instruction type count. Accordingly, the self-test control circuitry can thus be configured to monitor instruction types of the data processing instructions being executed by the processor core. Further, the instruction type represents a signal within the processor core which is generally already present (in the sense that a processor core without self-test capability would typically have this signal) and therefore the additional self-test support circuitry required to enable the export of an instruction type indication is modest. This self-test control circuitry can maintain an instruction type count, for example counting instances of one particular instruction type (although counts of more than one instruction type are also contemplated) and then compares this count to an expected instruction type count (on the basis of the known self-test sequence of instructions which was executed). Where a difference in the instruction type count and the expectation occurs this is indicative of a fault in the core prior to the point at which the instruction type indication has been “picked out of the core. Note that there is no need for there to be any knowledge of how the processor core is structured for this technique to work, meaning that these monitoring techniques can be applied to a great range of processor core circuitry and indeed that laborious bespoke configuration of the self-testing capability for the specific processor core is not needed). For example, where a system designer has flexibility in the particular configuration and layout of a processor core, there is no need for the self-test techniques to be reconfigured, as could otherwise often be the case for a purely software-based approach.
The instruction type indication circuitry, providing the instruction type indication, may be provided at a range of locations within the processor core circuitry. For example, in some embodiments the processor core circuitry comprises instruction fetch circuitry and the instruction type indication circuitry is comprised within the instruction fetch circuitry. For example, the instruction type may be used in the instruction fetch circuitry to determine how a given instruction is handled in parts of the processor core circuitry which follow the instruction fetch circuitry. Just one example of this is the situation where the processor core circuitry comprises multiple execution pipelines and issuance of an instruction to one of those execution pipelines is dependent on the instruction type, since each execution pipeline may have its own specialisms.
Indeed, the present techniques recognise that, for the purposes of self-testing, the instruction type indication may be maintained to a later stage in the pipeline (even though it may not explicitly be needed by those later parts of a pipeline), such that it can be used as an indicator within a self-test context. The instruction type indication can be exported from the processor core circuitry at a number of points along the execution pipeline, thus providing a mechanism by which a finer granularity of identification of the location of a fault is supported.
As mentioned above, the instruction type indication may be exported from a range of positions within the processor core circuitry, and in some embodiments the processor core circuitry comprises instruction retirement circuitry and the instruction type indication circuitry is comprised within the instruction retirement circuitry. Accordingly, although the instruction type indication may not be required to be maintained as far as the instruction retirement circuitry (i.e. essentially the end of execution pipeline), the maintenance of this information to this point within the execution pipeline means that the self-testing can identify a fault all the way up to this point in the execution pipeline.
The self-test support circuitry can also be implemented in other ways within the processor core circuitry. For example in some embodiments the processor core circuitry comprises memory transaction buffering circuitry temporarily to hold memory transaction requests before the memory transaction requests are issued to a memory system, wherein the self-test support circuitry comprises memory transaction request indication circuitry and the at least one self-test data item comprises at least one memory transaction request indication provided by the memory transaction request indication circuitry, wherein the self-test control circuitry is responsive to reception of the at least one memory transaction request indication to maintain a memory transaction request count, and wherein the self-test control circuitry is capable of comparing the memory transaction request count to an expected memory transaction request count. Indications of memory transactions requests thus represent another example of the self-test data items which may be exported to the self-test control circuitry, and the self-test support circuitry in such examples comprises memory transaction requests indication circuitry, for example forming part of a bus interface unit and/or a store buffer. The self-test control circuitry can identify faults in the processor core circuitry related to memory transactions by comparing the number of memory transaction requests seen (counted) to an expected number.
In some examples the self-test support circuitry comprises memory transaction request indication circuitry before the memory transaction buffering circuitry and after the memory transaction buffering circuitry, and the at least one self-test data item comprises memory transaction requests indications from before and after the memory transaction buffering circuitry. Accordingly, not only can the number of memory transaction requests be checked to be correct both before and after the buffering circuitry, but for example where the buffer circuitry has a configuration in which memory transaction requests are merged into the same entry in the buffer when they relate to the same or similar (small offset) addresses, monitoring the number of memory transaction requests before and after the buffer can identify if this process is happening correctly or if a fault has developed.
Memory transaction buffering circuitry, for example a store buffer, may be configured to continually drain its entries as soon as this is possible (i.e. to pass those requests out through the memory system to an external memory). Indeed the memory system itself may apply back pressure to cause the store buffer to drain its entries as soon as possible. This may cause difficulties in terms of monitoring numbers of memory transaction requests within the memory transaction buffering circuitry. To address this the present techniques in some examples provide that the self-test support circuitry comprises memory transaction blocking circuitry to block issuance of the memory transaction requests to the memory system in dependence on a blocking signal from the self-test control circuitry. This ability to block issuance of the buffered memory transaction request thus enables a stable state of the memory transaction buffering circuitry temporarily to be established, to facilitate memory transaction request counting.
Criteria for asserting and releasing the blocking signal may be variously configured but in some examples the self-test control circuitry is capable of asserting the blocking signal until an indication is received from the memory transaction buffering circuitry that the memory transaction buffering circuitry is full. This not only allows a more thorough testing of the memory transaction buffering circuitry, but also presents an efficient mechanism for determining when to release the blocking signal.
There are various ways in which the memory transaction buffering circuitry may be determined to be full, but in some examples this is done by monitoring the number of memory transaction requests which enter the memory transaction buffering circuitry, where it is known how many memory transaction requests the buffering circuitry can hold.
Nevertheless the present techniques recognise, as mentioned above, that the memory transaction buffering circuitry may merge memory transaction requests into a single entry and accordingly there may not be a simple correspondence between the number of memory transaction requests which the buffering circuitry receives and the number of entries which are occupied as a result. Hence in some embodiments the indication that the memory transaction buffering circuitry is full comprises a full buffer signal generated by the memory transaction buffering circuitry. Thus, when all entries in the buffer are occupied the buffer generates the full buffer signal.
The present techniques further recognise however, that when seeking to fill up the memory transaction buffering circuitry deadlock situations may occur which can prevent the memory transaction buffering circuits from reaching that full status. For example where a fault in an address register has caused a change of address, the buffering circuitry may be unable to add a transaction request to the required entry because the entry to be used for that address is (erroneously) fully occupied and the system then hangs, waiting for that entry to clear (which it can't do due to the assertion of the blocking signal). To address this the present techniques provides some examples of the self-test control circuitry with timer circuitry and the self-test control circuitry is responsive to an elapsed period signal from the timer circuitry to assert a time out error signal. For example, when the blocking signal is asserted and the memory transaction buffer circuitry begins filling the timer may be started and once the buffer and circuitry is full the timer is cleared before the elapsed period. If the elapsed period is reached this is an indication that a deadlock may have occurred and an error can be notified in this situation.
The self-test control circuitry may cause the processor circuitry to switch between the first and second states in a variety of ways. In some examples the self-test control circuitry is capable of causing the processor core circuitry to operate in the second state of executing the self-test sequence of instructions for a predetermined time period before returning to the first state. The predetermined time period can thus be set such that period of running of the self-test software does not significantly affect the usual data processing operation of the processor core circuitry. When in the second state (executing the self-test sequence of instructions) interrupts to the processor core may be disabled and if the processor core is to nevertheless respond quickly to an interrupt (as may be essential in some implementations) then a limited run time of the self-test software is necessary.
Another mechanism which may be provided to support this limited disruption of the self-test functionality to the normal operation of the core is by applying priorities to the software executed. For example, in some embodiments the processor core circuitry is capable of switching between data processing operations for more than one application defined by corresponding sets of data processing instructions, wherein each application has a predefined priority and the processor core circuitry is responsive to an execution request for an application having a higher predefined priority than a predefined priority of a currently executing application to suspend the currently executing application and to execute the application having the higher predefined priority, and wherein the self-test sequence of instructions is defined to have a predefined priority lower than the highest priority available in the apparatus. This configuration then ensures that the self-test sequence of instructions cannot dominate and may always be ended by a higher priority application, thus providing the system designer, in particular the software author, with a mechanism to ensure that critical applications can always be assured of execution, prioritised over the self-test application.
The self-test control circuitry itself may be controlled in a variety of ways, but in some examples one or more commands are issued by the processor core to configure the self-test control circuitry and/or cause it to perform particular tasks. For example, in some embodiments the self-test control circuitry is responsive to at least one received command by performing at least one of configuring itself and the processor core circuitry for operation in the second state; causing the processor core circuitry to switch to the second state; and causing the processor core circuitry to switch back to the first state and determining a self-test result from the at least one self-test data item received from the processor core circuitry.
The result of carrying out the self-testing, in particular when it is identified that a difference in the self-test data items received by the self-test control circuitry to an expectation has occurred, may be handled in a variety of ways. In some examples the apparatus further comprises error management control circuitry, wherein the self-test control circuitry is capable of signalling a self-test result after the processor core circuitry has completed executing the self-test sequence of instructions, and the error management control circuitry is responsive to the self-test result to initiate an error response. The error management circuitry may respond to such fault identification in a number of ways, but in some examples the error response comprises disabling at least one component of the apparatus. The error management circuitry may also generate a fault grading on the basis of the self-test result.
Some examples provide a method of operating a data processing apparatus, comprising the steps of performing data processing operations in processor core circuitry in a first state by executing data processing instructions; signalling from self-test control circuitry to cause the processor core circuitry to temporarily switch from the first state to a second state of executing a self-test sequence of instructions, before returning to the first state of executing the data processing instructions without a reboot of the processor core circuitry being required; and causing an export of at least one self-test data item via self-test support circuitry in the processor core circuitry to the self-test control circuitry.
Some examples provide an apparatus comprising means for performing data processing operations by executing data processing instructions; means for controlling a self-test process comprising causing the means for performing data processing operations to temporarily switch from a first state of executing the data processing instructions to a second state of executing a self-test sequence of instructions, before returning to the first state of executing the data processing instructions without a reboot of the means for performing data processing operations being required; and means for supporting the self-test process, wherein the means for performing data processing operations is responsive to the self-test sequence of instructions to cause an export of at least one self-test data item via the means for supporting the self-test process to the means for controlling the self-test process.
Some particular example embodiments are now described with reference to the figures.
Under normal usage a load issued by the load/store unit 128 passes via the BIU 142 to the L2 memory (and possibly beyond), whilst a store issued by the load/store unit 128 is temporarily buffered in the store unit (STU) 144 before passing via the BIU 142 to the L2 memory. A store memory transaction request in the store buffer 144 comprises an address, and the relevant data to be stored at that address, and the address is one self-test data item which can be exported to the SBIST controller. In the example shown in
In brief overall summary an apparatus and method for processor core self-testing are disclosed. The apparatus comprises processor core circuitry to perform data processing operations by executing data processing instructions. Separate self-test control circuitry causes the processor core circuitry to temporarily switch from a first state of executing the data processing instructions to a second state of executing a self-test sequence of instructions, before returning to the first state of executing the data processing instructions without a reboot of the processor core circuitry being required. There is also self-test support circuitry, wherein the processor core circuitry is responsive to the self-test sequence of instructions to cause an export of at least one self-test data item via the self-test support circuitry to the self-test control circuitry.
In the present application, the words “configured to . . . ” or “arranged to” are used to mean that an element of an apparatus has a configuration able to carry out the defined operation. In this context, a “configuration” means an arrangement or manner of interconnection of hardware or software. For example, the apparatus may have dedicated hardware which provides the defined operation, or a processor or other processing device may be programmed to perform the function. “Configured to” or “arranged to” does not imply that the apparatus element needs to be changed in any way in order to provide the defined operation.
Although illustrative embodiments have been described in detail herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to those precise embodiments, and that various changes, additions and modifications can be effected therein by one skilled in the art without departing from the scope of the invention as defined by the appended claims. For example, various combinations of the features of the dependent claims could be made with the features of the independent claims without departing from the scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
1606107.9 | Apr 2016 | GB | national |