| Nakamura et al., “Effectiveness of Register Preloading on CP-PACS Node Processor”, 1998, IEEE, p 83-90.* |
| Jegou et al., “Speculative Prefetching”, 1993, ACM, p 57-66.* |
| Chen et al., “An Efficient Architecture for Loop Based Data Preloading”, 1992, IEEE, p 92-101.* |
| “Design and Evaluation of Compiler Algorithm for Prefetching, Architectural Support for Programming Languages and Operating Systems”, 1992, p 62-73.* |
| “Data Bus Technology RISC-based Massively Parallel Supercomputer”, Journal of IPSJ, vol. 38 No. 6, 1997, p. 485-492.* |
| “A Data Locality Optimizing Algorithm, Programming Language Design and Implementation”, 1991, p 30-40.* |
| “Cross-loop Reuse Analysis and its Application to Cache Optimizations, Workshop on Languages and Compilers for Parallel Computing”, 1996, p 1-15.* |
| “Software Pipelining: An Effective Scheduling Technique for VLIW Machines”, 1988, p 318-328.* |
| “Tolerating Data Access Latency with Register Preloading”, Proceedings of the 1992 International Conference on Supercomputing, 1992, p 1-11. |