Claims
- 1. A computer system comprising:
a first controller for implementing a first function, wherein said first function comprises part of a checkpoint operation and wherein said checkpoint operation comprises a series of contiguous checkpoint cycles; and a second controller coupled to said first controller, said second controller for implementing a second function, wherein said second function comprises a write back operation from a first memory location to a second memory location and wherein said write back operation occurs before a checkpoint cycle ends; wherein information already at said second memory location is selectively written back to a third memory location.
- 2. The computer system of claim 1 comprising a processor node coupled to said first controller, wherein said first controller has access to state information in said processor node.
- 3. The computer system of claim 1 comprising multiple processor nodes coupled to said first controller, wherein said first controller has access to state information in each of said processor nodes.
- 4. The computer system of claim 1 wherein said first controller causes a processor to write back dirty cache entries to said second memory location and to store state information for said processor in a fourth memory location.
- 5. The computer system of claim 1 wherein said first controller causes said third memory location to be emptied upon completion of a checkpoint cycle.
- 6. The computer system of claim 1 wherein said first controller monitors free space in a memory comprising said third memory location and initiates a new checkpoint cycle when said free space is below a specified threshold.
- 7. The computer system of claim 1 wherein said first controller monitors a condition and initiates a new checkpoint cycle when said condition is satisfied, wherein said condition is based on number of processor cycles, number of bus cycles, number of cycles performed by a component of said computer system, number of transactions performed by a component of said computer system, number of instructions executed by a component of said computer system, number of write back operations, an internal signal, or an external signal.
- 8. The computer system of claim 1 wherein said second controller identifies candidates for said write back operation.
- 9. The computer system of claim 1 wherein said second controller maintains a list comprising a plurality of pointers, each pointer representing a candidate for said write back operation, wherein said write back operation is initiated by said second controller selecting one of said pointers.
- 10. The computer system of claim 9 wherein a flag is associated with said list, said flag for indicating whether said list includes a pointer for each candidate for said write back operation.
- 11. The computer system of claim 1 wherein said write back operation occurs before a computational phase of said checkpoint cycle ends.
- 12. A checkpointing method comprising:
performing a first function comprising a checkpoint operation, wherein said checkpoint operation comprises a series of contiguous checkpoint cycles; performing a second function comprising a write back operation, wherein said write back operation comprises a write back of information from a first memory location to a second memory location and wherein said write back operation is performed before a checkpoint cycle ends; and writing information already at said second memory location to a third memory location.
- 13. The method of claim 12 comprising:
accessing state information for a processor node.
- 14. The method of claim 12 wherein said first function comprises:
storing state information for a processor node in a fourth memory location.
- 15. The method of claim 12 wherein said first function comprises:
writing back dirty cache entries at a processor node to said second memory location.
- 16. The method of claim 12 wherein said first function comprises:
emptying said third memory location upon completion of a checkpoint cycle.
- 17. The method of claim 12 wherein said write back operation occurs before a computational phase of said checkpoint cycle ends.
- 18. The method of claim 12 wherein said second function comprises:
identifying candidates for said write back operation.
- 19. The method of claim 12 wherein said second function comprises:
maintaining a list comprising a plurality of pointers, wherein each pointer represents a candidate for said write back operation; and initiating said write back operation by selecting one of said pointers.
- 20. The method of claim 19 comprising:
setting a flag associated with said list, said flag for indicating whether said list includes a pointer for each candidate for said write back operation.
- 21. The method of claim 12 wherein said first function is executed using a first controller and said second function is executed using a second controller.
- 22. The method of claim 12 wherein said first function and said second function are executed using a single controller.
- 23. The method of claim 12 comprising:
monitoring free space in a memory comprising said third memory location; and initiating a new checkpoint cycle when said free space is below a specified threshold.
- 24. The method of claim 12 comprising:
monitoring a condition; and initiating a new checkpoint cycle when said condition is satisfied, wherein said condition is based on number of processor cycles, number of bus cycles, number of cycles performed by a component of said computer system, number of transactions performed by a component of said computer system, number of instructions executed by a component of said computer system, number of write back operations, an internal signal, or an external signal.
- 25. A computer-readable medium having computer-readable code stored thereon for causing a computer system to perform a checkpointing method comprising:
executing a first function comprising a checkpoint operation, wherein said checkpoint operation comprises a series of contiguous checkpoint cycles; executing a second function comprising a write back operation, wherein said write back operation comprises a write back of information from a first memory location to a second memory location and wherein said write back operation is performed before a checkpoint cycle ends; and recording information already at said second memory location in a third memory location.
- 26. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
reading state information for a processor node.
- 27. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
writing back dirty cache entries at a processor node to said second memory location.
- 28. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
recording state information for a processor node in a fourth memory location.
- 29. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
emptying said third memory location upon completion of a checkpoint cycle.
- 30. The computer-usable medium of claim 25 wherein said write back operation occurs before a computational phase of said checkpoint cycle ends.
- 31. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
identifying candidates for said write back operation.
- 32. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
generating a list comprising a plurality of pointers, wherein each pointer represents a candidate for said write back operation; and selecting one of said pointers to initiate said write back operation.
- 33. The computer-usable medium of claim 31 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
associating a flag with said list; and setting said flag to a value to indicate whether said list includes a pointer for each candidate for said write back operation.
- 34. The computer-usable medium of claim 25 wherein said computer system comprises multiple processor nodes.
- 35. The computer-usable medium of claim 25 wherein said first function is executed using a first controller and said second function is executed using a second controller.
- 36. The computer-usable medium of claim 25 wherein said first function and said second function are executed using a single controller.
- 37. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
completing a checkpoint cycle when free space in a memory comprising said third memory location is below a specified threshold.
- 38. The computer-usable medium of claim 25 wherein said computer-readable program code embodied therein causes a computer system to perform a checkpointing method comprising:
completing a checkpoint cycle when a condition is satisfied, wherein said condition is based on number of processor cycles, number of bus cycles, number of cycles performed by a component of said computer system, number of transactions performed by a component of said computer system, number of instructions executed by a component of said computer system, number of write back operations, an internal signal, or an external signal.
CROSS REFERENCE TO RELATED APPLICATION
[0001] This Application is a Continuation-in-Part of the co-pending, commonly owned US Patent Application, Attorney Docket No. HP-10015426, Ser. No. 09/952,994, filed Sep. 14, 2001, by Manohar K. Prabhu, and entitled “Preemptive Write Back Controller.”
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
09952994 |
Sep 2001 |
US |
Child |
10106723 |
Mar 2002 |
US |