Luk & Mowry, “Cooperative Prefetching: Compiler and Hardware Support for Effective Instruction Prefetching in Modern Processors,” Proceedings of the 31st annual ACM/IEEE Int'l Symbosium on Microarchitecture, Dallas, Texas USA, 1998, pp. 182-194.* |
Rajiv Gupta, “Code Optimization as a Side Effect of Instruction Scheduling,” IEEE 1997, pp. 370-377.* |
Kennedy & Roth, “Context Optimization for SIMD Execution,” IEEE Aug. 1994, pp. 445-453.* |
T. Ball et al., “Efficient Path Profiling,” Proceedings of the 29th Annual IEEE/ACM International Symposium on Microarchitecture—MICRO-29, Dec. 2-4, 1996, Paris France, pp. 46-57 (Dec. 1996). |
G. Ammons et al., “Exploiting Hardware Performance Counters with Flow and Context Sensitive Profiling,” Proceedings of the 1997 ACM SIGPLAN, Conference on Programming, Language Design and Implementation (PLDI), vol. 32, No. 5, pp. 85-96 (Jun. 1997). |