| Bailey, David H., "Vector Computer Memory Bank Contention", IEEE Transactions on Computers, vol. C-36, No. 3, Mar. 1987, pp. 293-298. |
| Budnik, Paul and David J. Kuck, "The Organization and Use of Parallel Memories", IEEE Transactions on Computers, Dec. 1971, pp. 1566-1569. |
| Fu, John W. C., and Janak H. Patel, "Data Prefetching in Multiprocessor Vector Cache Memories", Center for Reliable and High-Performance Computing, University of Illinois at Urbana-Champaign, 1991, pp. 54-63. |
| Gannon, Dennis, William Jalby, and Kyle Gallivan, "Strategies for Cache and Local Memory Management by Global Program Transformation", International Conference on Supercomputing, 1987, pp. 229-254. |
| Harper, III, David T., "Block, Multistride Vector, and FFT Accesses in Parallel Memory System", IEEE Transactions on Parallel and Distribution Systems, vol. 2, No. 1, Jan. 1991, pp. 43-51. |
| Hill, Mark D., "A Case for Direct-Mapped Caches", IEEE Computer, Dec. 1988, pp. 25-40. |
| Lam, Monica S., Edward E. Rothberg, and Michael E. Wolf, "The Cache Performance and Optimizations of Blocked Algorithms", 4th International Conference on ASPLOS, 1991, pp. 63-74. |
| Lawrie, Duncan H. and Chandra R. Vora, "The Prime Memory System for Array Access", IEEE Transactions on Computers, vol. C-31, No. 5, May 1982, pp. 435-442. |
| Oed, Wilfred, and Otto Lange, "On the Effective Bandwidth of Interleaved Memories in Vector Processor Systems", IEEE Transactions on Computers, vol. C-34, No. 10, Oct. 1985, pp. 949-957. |
| Raghavan, Ram and John P. Hayes, "On Randomly Inerleaved Memories", Proceedings of Supercoputing, 1990, pp. 49-57. |
| Smith, Alan Jay, "Cache Memories", Computing Surveys, vol. 14, No. 3, Sep. 1982, pp. 473-523. |
| So, Kimming, and Vittorio Zecca, "Cache Performance of Vector Processors", International Symposium on Computer Architecture, 1988, pp. 261-268. |
| Yang, Qing, Laxmi N. Bhuyan, and Bao-Chyn Liu, "Analysis and Comparison of Cache Coherence Protocols for a Packet-Switched Multiprocessor", IEEE Transactions on Computers, vol. 38, No. 8, Aug. 1989, pp. 1143-1153. |
| Dongarra, J. et al., "A Set of Level 3 Basic Linear Algebra Subprograms", ACM Transactions on Mathematics Software, vol. 16-1, Mar. 1990, pp. 1-17. |
| Abu-Sufah, W., and A. D. Malony, "Vector Processing on the ALLIANT FX/8 Microprocessor", International Conference on Parallel Processing, Aug. 1986, pp. 559-566. |
| Bandarkar, D. and R. Brunner, "VAX Vector Architecture", Proc. 17th International Symposium on Computer Architecture, 1990, pp. 204-215. |
| Hennessy, J. L. and D. A. Patterson, Computer Architecture, A Quantitative Approach, Morgan Kaufmann, 1990. |
| Stone, H. S., High Performance Computer Architecture, Addison-Wesley, 1990. |
| Pettofrezzo, A. J., and D. R. Byrkit, Elements of Number Theory, Prentice-Hall, 1970. |
| Cooley, J. W., The Structure of FFT and Convolution Algorithms, Research Report, IBM T. J. Watson Research Center, 1990. |