Nakamura et al., “Effectiveness of Register Preloading on CP-PACS Node Processor”, 1998, IEEE, p 83-90.* |
Jegou et al., “Speculative Prefetching”, 1993, ACM, p 57-66.* |
Chen et al., “An Efficient Architecture for Loop Based Data Preloading”, 1992, IEEE, p 92-101.* |
“Design and Evaluation of Compiler Algorithm for Prefetching, Architectural Support for Programming Languages and Operating Systems”, 1992, p 62-73.* |
“Data Bus Technology RISC-based Massively Parallel Supercomputer”, Journal of IPSJ, vol. 38 No. 6, 1997, p. 485-492.* |
“A Data Locality Optimizing Algorithm, Programming Language Design and Implementation”, 1991, p 30-40.* |
“Cross-loop Reuse Analysis and its Application to Cache Optimizations, Workshop on Languages and Compilers for Parallel Computing”, 1996, p 1-15.* |
“Software Pipelining: An Effective Scheduling Technique for VLIW Machines”, 1988, p 318-328.* |
“Tolerating Data Access Latency with Register Preloading”, Proceedings of the 1992 International Conference on Supercomputing, 1992, p 1-11. |