Claims
- 1. A method for preserving error data in a computing platform, the method comprising:
detecting an error on the platform; collecting error data; storing the error data in volatile storage; and transferring the error data from the volatile storage to a target component with access to non-volatile storage.
- 2. The method of claim 1, further comprising:
preventing access to portions of the volatile storage used to store the platform error data until the platform error data has been stored in non-volatile storage; receiving a signal indicating the target component has stored the platform error data in non-volatile storage; and in response to receiving the signal, allowing access to the portions of volatile storage used to store the platform error data.
- 3. The method of claim 1, wherein transferring the error data to a target component with access to non-volatile storage comprises transferring the error data to a hardware management console.
- 4. The method of claim 1, wherein transferring the error data to a target component with access to non-volatile storage comprises transferring the error data to a logical partition.
- 5. The method of claim 4, further comprising rebooting the logical partition, in response to determining the logical partition is not operating properly.
- 6. The method of claim 5, further comprising rebooting a system firmware component that manages the logical partition, in response to determining the system firmware component is not operating properly.
- 7. The method of claim 1, further comprising selecting the target component.
- 8. The method of claim 7, wherein selecting the target component comprises:
determining if an out-band component with access to non-volatile storage is available, the out-band component capable of receiving the platform error data in the absence of a functioning operating system; and if so, selecting the out-band component as the target component.
- 9. The method of claim 1, wherein collecting error data comprises collecting hardware and software state information.
- 10. A computer-readable medium containing a program for preserving platform data in a computing platform which, when executed by a processor, performs operations comprising:
collecting platform data; storing the platform data in volatile storage; and transferring the platform data from the volatile storage to a target component with access to non-volatile storage.
- 11. The computer-readable medium of claim 10, wherein the operations further comprise:
detecting an error on the platform; and collecting the platform data in response to detecting the error.
- 12. The computer-readable medium of claim 10, wherein the operations further comprise:
determining if an out-band component with access to non-volatile storage is available, the out-band component capable of receiving the platform error data in the absence of a functioning operating system; and if so, selecting the out-band component as the target component.
- 13. The computer-readable medium of claim 12, wherein the operations further comprise:
selecting a logical partition as the target component, if an out-band component is not available.
- 14. The computer-readable medium of claim 13, wherein the operations further comprise:
determining if the logical partition is properly operating; and if not, rebooting the logical partition.
- 15. A computing platform, comprising:
at least one system processor; volatile storage; and a service processor configured to, in response to detecting the occurrence of one or more platform errors, collect platform error data, store the platform error data to the volatile storage, and transfer the platform error data to a target component with access to non-volatile storage.
- 16. The computing platform of claim 15, wherein the service processor is capable of starting and stopping the at least one system processor.
- 17. The computing platform of claim 15, wherein the service processor is further configured to prevent access to portions of the volatile storage used for storing the platform error data until receiving a signal from the target component indicating the platform error data has been stored to the non-volatile storage.
- 18. A logically partitioned computer system, comprising:
one or more logical partitions, each having an associated operating system with access to non-volatile storage; a partition manager configured to create and manage the one or more logical partitions; volatile storage; and a service processor configured to, in response to detecting the occurrence of one or more platform errors, collect platform error data, store the platform error data to the volatile storage, and transfer the platform error data from the volatile storage to a target component with access to non-volatile storage.
- 19. The logically partitioned computer system of claim 18, wherein the target component is configured to:
receive the platform error data from the service processor; store the platform error data to non-volatile storage; and generate a signal to indicate to the service processor the platform error data has been stored in non-volatile storage.
- 20. The logically partitioned computer system of claim 18, wherein the target component is a hardware management console in communication with the service processor.
- 21. The logically partitioned computer system of claim 18, wherein:
the target component is a logical partition; and the service processor is configured to transfer the platform error data to the logical partition via the partition manager.
- 22. The logically partitioned computer system of claim 18, wherein the service processor is configured to:
determine if an out-band component with access to non-volatile storage is available, the out-band component capable of receiving the platform error data from the service processor whether or not the partition manager is not operating properly; and if so, selecting the out-band component as the target component.
- 23. The logically partitioned computer system of claim 22, wherein:
the service processor is configured to select one of the logical partitions as the target component if the out-band component is not available; and transfer the platform error data to the selected logical partition via the partition manager.
- 24. The logically partitioned computer system of claim 23, wherein the partition manager is configured to, if a logical partition is selected as the target component:
determine if the selected logical partition is operating properly; and if not, reboot the selected logical partition.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] The present application is related to U.S. patent application Ser. No. 09/672,043, filed Sep. 29, 2000, entitled “Technique for Configuring Processors in System With Logical Partitions”; U.S. patent application Ser. No. 09/346,206, filed Jul. 1, 1999, entitled “apparatus for Supporting a Logically Partitioned Computer System”; U.S. Pat. No. 6,467,007, entitled “Processor Reset Generated Via Memory Access Interrupt”; U.S. Pat. No. 5,659,786, entitled “System and Method for Dynamically Performing Resource Reconfiguration in a Logically Partitioned Data Processing System” and U.S. Pat. No. 4,843,541, entitled “Logical Resource Partitioning of a Data Processing”, which are herein incorporated by reference.