Agarwal, Anant, et al., “An Evaluation of Directory Schemes for Cache Coherence”, Proceedings of 15th International Symposiumn on Computer Architecture(“ISCA”) (May 1998) pp. 280-289. |
Barroso, Luiz Andre, et al., “Impact of Chip-Level Integration on Performance of OLTP Workloads”, High-Performance Computer Architecture (“HPCA”) (Jan. 2000). |
Barroso, Luiz Andre, et al., “Memory System Characterization of Commercial Workloads”, ICSA (Jun. 1998). |
Eggers, Susan J., et al., “Simultaneous Multithreading: A Platform for Next-generation Processors”, University of Washington, DEC Western Research Laboratory ({eggers,levy,jlo}@cs.washington.edu) ({emer,stamm}@vssad.enet.dec.com) pp. 1-15. |
Eickemeyer, Richard J., et al., “Evaluation of Multithreaded Uniprocessors for Commercial Application Environments”, ACM(1996) (0-89791-786-3) pp. 203-212. |
Gupta, Anoop, et al., “Reducing Memory and Traffic Requirements for Scalable Directory-Based Cache Coherence Schemes”, Stanford University Computer Systems Laboratory pp. 1-10. |
Hammond, Lance, et al., “A Single-Chip Multiprocessor”, IEEE (Sep. 1997) (0018-9162). |
Hammond, Lance, et al., “Data Speculation Support for a Chip Multiprocessor”, Stanford University, Computer Systems Laboratory (http://www-hydra.stanford.edu/). |
Jouppi, Norman P., et al., “Tradeoffs in Two-Level On-Chip Caching”, WRL Research Report 93/3, Western Research Laboratory (WRL-Techreports@decwrl.dec.com) (Dec. 1993) pp. 1-31. |
Krishnan, Venkata, et al., “Hardware and Software Support for Speculative Execution of Sequential Binaries on a Chip-Multiprocessor”, University of Illinois at Urbana-Champaign (http://iacoma.cs.uiuc.edu.). |
Kuskin, Jeffrey, et al., “The Stanford FLASH Multiprocessor”, Stanford University, Computer Systems Laboratory. |
Laudon, James, et al., “The SGI Origin: A ccNUMA Highly Scalable Server”, Silicon Graphics, Inc. (laudon@sgi.com). |
Lenoski, Daniel, et al. “The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor”, IEEE (1990) (CH2887-8) pp. 148-159. |
Nayfeh, Basem A., et al., “Evaluation of Design Alternatives for a Multiprocessor Microprocessor”, ACM (1996) (0-89791-786-3) pp. 67-77. |
Nowatzyk, Andreas G., et al., “S-Connect: from Networks of Workstations to Supercomputer Performance”, 22nd Annual International Symposium on Computer Architecture (“ISCA”) (Jun. 1995). |
Nowatzyk, Andreas, et al., “Exploiting Parallelism in Cache Coherency Protocol Engines”, Sun Microsystems Computer Corporation. |
Olukotun, Kunle, et al., “The Case for a Single-Chip Multiprocessor”, Proceedings Seventh International Symposium Architectural Support for Programming Languages and Operating Systems (“ASPLOS VII”) (Oct. 1996). |
Steffan, J. Gregory, et al., “The Potential for Using Thread-Level Data Speculation to Facilitate Automatic Parallelization”, HPCA-4 (Feb. 1998) pp. 1-12. |
Tremblay, Marc, “MAJC™-5200 AVLIW Convergent MPSOC”, Sun Microsystems, Inc., Microprocessor Forum (1999). |
Kunkel, Steven, et al., “System Optimization for OLTP Workloads”, IEEE (1999) (0272-1732) pp. 56-64. |
Diefendorff, Keith, “Power4 Focuses on Memory Bandwidth”, Microdesign Resources, Microprocessor Report vol. 13 No. 13 (Oct. 1999). |
Hammond, Lance, et al., “The Stanford Hydra CMP”, Stanford University, Computer Systems Laboratory (http://www-hydra.stanford.edu). |