Bianchini, R., Jr., Ruskens, R., "An Adaptive Distributed System-Level Diagnosis Algorithm and Its Implementation," Proc. of 21st IEEE Conf. on Fault Tolerant Computing Systems (FTCS), pp. 222-229, Jul. 1991. |
Huang, Y., Kintala, C., "Software Implemented Fault Tolerance: Technologies and Experience," Proc. of 23d IEEE Conf. on Fault Tolerant Computing Systems CFTCS, pp. 2-9, Jun. 22, 1993. |
Koo, R., Toueg, S., "Checkpointing and Rollback-Recovery for Distributed Systems," IEEE Trans. Software Eng., vol. SE-13, No. 1, pp. 23-31, Jan. 1987. |
Wang, Y.-M., Fuchs, W.K., "Lazy Checkpoint Coordination for Bounding Rollback Propagation," Proc. IEEE Symposium Reliable Distributed Systems, pp. 78-85, Oct. 1993. |
Wang. Y.-M., et al., "Progressive Retry for Software Error Recovery in Distributed Systems," Proc. of 23d IEEE Conf. on Fault-Tolerant Systems (FTCS), pp. 138-144, Jun. 22, 1993. |