Waters, "Automatic transformation of series expressions into loops", ACM Trans. Lang. & Syst. vol. 13, No. 1, pp. 52-96, Jan. 1991. |
Carr et al., "Compiler optimizations for improving data loacality", ASPLOS-ACM, pp. 252-262, Oct. 1994. |
Carr et al., "Improving the ratio of memory operations to floating point operations in loops", ACM Trans. Prog. Lang. & Syst., vol. 16, No. 6, pp. 1768-1810, Nov. 1994. |
Wei Li, Compiler cache optimizations for banded matrix problems, ICS 95, ACM, pp. 21-30, 1995. |
Li et al., Exploiting cache affnity in software cache coherence, ICS 94, ACM, Jul. 1997, pp. 264-273, 1994. |
Bhattacharya et al., Performance analysis and optimization of schedules for conditional and loop intensive specifications, DAC 94, Procedings of 31 st Annual Conf., Design Automation, pp. 491-496, 1994. |
Ohta et al., Optimal tile size adjustment in compiling general DOACROSS loop nests., ICS 95, ACM Jun. 1995, pp. 270-279. |
Debray, Saumya, Abstract interpretation and low level code optimization, PEPM 95, ACM, pp. 111-121, 1995. |
Wang Ko, Precise compile time performance prediction for superscalar based computers, SIGPLAN 94, ACM Jun. 1994, pp. 73-84. |