"Instruction Match Function for Processor Performance Monitoring," IBM Technical Disclosure Bulletin, 39:12, pp. 119-121 (Dec. 1996). |
Instruction Set Execution Frequency Estimation by Means of Program Counter Sampling, IBM Technical Disclosure Bulletin, 33:10a, P. 203 (Mar. 1991). |
Anderson, J.M. et al., "Continuous Profiling: Where Have All the Cycles Gone?", Digital Systems Research Center, URL:ftp://ftp.digital.com/pub/DEC/SRC/techical-notes/SRC-1077-016.pdf (downloaded Nov. 16, 1999) by EPO. |
Abraham et al., Predicting Load Latencies Using Cache Profiling, HPL-94-110, Nov. 1994, copyright Hewlett-Packard Co. |
Anderson et al., Continuous Profiling: Where have all the Cycles Gone? To be published in The Proceedings of the 16.sup.th ACM Symposium on Operating Systems Principles, copyright 1997 by the Assoc. for Computing Machinery. |
Ball et al., Efficient Path Profiling, Published in Proceedings of MICRO-29, Dec. 2-4, 1996, in Paris, France, pp. 46-57. Copyright 1996 IEEE. |
Bershad et al., Avoiding Conflict Misses Dynamically in Large Direct-Mapped Caches, Proceedings of the 6.sup.th International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 158-170, Oct. 4-7, 1994. |
Cohn et al., Hot Cold Optimization of Large Windows/NT Applications, Proceedings of the 29.sup.th Annual International Symposium on Microarchitecture, pp. 80-89, Dec. 1996. Copyright 1996 IEEE. |
Conte et al., Using Branch Handling Hardware to Support Profile-Driven Optimization, Proceedings of the 1994 27.sup.th Annual International Symposium on Microarchitecture, Nov. 30--Dec. 2, 1994, San Jose, Calif. |
Conte et al., Accurate and Practical Profile-Driven Compilation Using the Profile Buffer, Proceedings of the 29.sup.th Annual International Symposium on Microarchitecture, pp. 36-45, Dec. 2-4, 1996. |
Fisher, J.A., Global Code Generation for Instruction-Level Parallelism: Trace Scheduling-2, Hewlett-Packard Technical Report No. HPL-93-43, Jun., 1993. To be published by Springer-Verlag, London, UK. |
Horowitz et al., Informing Memory Operations: Providing Memory Performance Feedback in Modern Processors, Proceedings of the 23.sup.rd Annual International Symposium on Computer Architecture, pp. 260-270, May 22-24, 1996. |
Hwu et al., The Superblock: An Effective Technique for VLIW and Superscalar Compilation, Center for Reliable and High-Performance Computing, Univ. of Illinois, Urbana-Champaign, Illinois, 61801. |
Romer et al., Dynamic Page Mapping Policies for Cache Conflict Resolution on Standard Hardware, Proceedings of the First Symposium for Operating Systems Design and Implementation, pp. 255-266, 1994. |
Romer et al., Reducing TLB and Memory Overhead Using Online Superpage Promotion, Proceedings of the 22.sup.nd Annual International Symposium on Computer Architecture, pp. 176-187, Jun. 1995. |
Tullsen et al., Exploiting Choice: Instruction Fetch and Issue on an Implementable Simultaneous Multithreading Processor, Proceedings of the 23.sup.rd Annual International Symposium on Computer Architecture, Philadelphia, PA, May, 1996. |
Tullsen et al., Simultaneous Multithreading: Maximizing On-Chip Parallelism, Proceedings of the 22.sup.nd Annual International Symposium on Computer Architecture, Santa Margherita Ligure, Italy, Jun. 1995. |
Verghese et al., Operating System Support for Improving Data Locality on CC-Numa Compute Servers, Proceedings of the 7.sup.th International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 279-289, Oct. 1-5, 1996. |
Young et al., Improving the Accuracy of Static Branch Prediction using Branch Correlation, Proceedings of the 6.sup.th International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 232-241, Oct. 4-7, 1994. |