Yale N. Patt

Computer, April, 2024

Timely, Efficient, and Accurate Branch Precomputation.

[BibT_eX]

[DOI]

Aniket Anand Deshmukh

Lingzhe Chester Cai

Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

Alternate Path Fetch.

[BibT_eX]

[DOI]

Aniket Anand Deshmukh

Lingzhe Chester Cai

Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Hetero: Where we've been, Where we are, and What Next?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

HCW 2024 Keynote: Hetero: Where we've been, Where we are, and What Next?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2022

SafeGuard: Reducing the Security Risk from Row-Hammer via Low-Cost Integrity Protection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021

Branch Runahead: An Alternative to Branch Prediction for Impossible to Predict Branches.

[BibT_eX]

[DOI]

Stephen Pruett

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

Criticality Driven Fetch.

[BibT_eX]

[DOI]

Aniket Anand Deshmukh

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

2020

Dynamic Merge Point Prediction.

[BibT_eX]

[DOI]

Stephen Pruett

CoRR, 2020

BranchNet: A Convolutional Neural Network to Predict Hard-To-Predict Branches.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Tailored Page Sizes.

[BibT_eX]

[DOI]

Faruk Guvenilir

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

2018

Duplicon Cache: Mitigating Off-Chip Memory Bank and Bank Group Conflicts Via Data Duplication.

[BibT_eX]

[DOI]

Ben Lin

Michael B. Healy

Philip G. Emma

Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

2016

Common Bonds: MIPS, HPS, Two-Level Branch Prediction, and Compressed Code RISC Processor.

[BibT_eX]

[DOI]

IEEE Micro, 2016

Efficient Execution of Bursty Applications.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2016

Continuous runahead: Transparent hardware acceleration for memory intensive workloads.

[BibT_eX]

[DOI]

Milad Hashemi

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Accelerating Dependent Cache Misses with an Enhanced Memory Controller.

[BibT_eX]

[DOI]

Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Greater Performance and Better Efficiency: Predicated Execution has shown us the way.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

Filtered runahead execution with a runahead buffer.

[BibT_eX]

[DOI]

Milad Hashemi

Proceedings of the 48th International Symposium on Microarchitecture, 2015

2014

Author retrospective for increasing the instruction fetch rate via multiple branch prediction and a branch address cache.

[BibT_eX]

[DOI]

Deborah T. Marr

Proceedings of the ACM International Conference on Supercomputing 25th Anniversary Volume, 2014

2013

Keynote 1 - VLSI 2.0: R&D Post Moore.

[BibT_eX]

[DOI]

Sani R. Nassif

Philippe Olivier Alexandre Navaux

Magdy S. Abadir

Proceedings of the 21st IEEE/IFIP International Conference on VLSI and System-on-Chip, 2013

Utility-based acceleration of multithreaded applications on asymmetric CMPs.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

2012

Fairness via Source Throttling: A Configurable and High-Performance Fairness Substrate for Multicore Memory Systems.

[BibT_eX]

[DOI]

ACM Trans. Comput. Syst., 2012

Energy Savings via Dead Sub-Block Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

Predicting Performance Impact of DVFS for Realistic Memory Systems.

[BibT_eX]

[DOI]

Eiman Ebrahimi

Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012

MorphCore: An Energy-Efficient Microarchitecture for High Performance ILP and High Throughput TLP.

[BibT_eX]

[DOI]

Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012

High performance supercomputers: should the individual processor be more than a brick?

[BibT_eX]

[DOI]

Proceedings of the International Conference on Supercomputing, 2012

Bottleneck identification and scheduling in multithreaded applications.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012

2011

HPS Microarchitecture.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Parallel Computing, 2011

Prefetch-Aware Memory Controllers.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2011

Data Marshaling for Multicore Systems.

[BibT_eX]

[DOI]

IEEE Micro, 2011

Top Picks [Guest editors' introduction].

[BibT_eX]

[DOI]

IEEE Micro, 2011

Improving GPU performance via large warps and two-level warp scheduling.

[BibT_eX]

[DOI]

Veynu Narasiman

Michael Shebanow

Chang Joo Lee

Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011

Parallel application memory scheduling.

[BibT_eX]

[DOI]

Eiman Ebrahimi

Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011

Prefetch-aware shared resource management for multi-core systems.

[BibT_eX]

[DOI]

Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

2010

Accelerating Critical Section Execution with Asymmetric Multicore Architectures.

[BibT_eX]

[DOI]

IEEE Micro, 2010

Data marshaling for multi-core architectures.

[BibT_eX]

[DOI]

Proceedings of the 37th International Symposium on Computer Architecture (ISCA 2010), 2010

Fairness via source throttling: a configurable and high-performance fairness substrate for multi-core memory systems.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Architectural Support for Programming Languages and Operating Systems, 2010

Feedback-directed pipeline parallelism.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009

Virtual Program Counter (VPC) Prediction: Very Low Cost Indirect Branch Prediction Using Conditional Branch Prediction Hardware.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2009

What Else Is Broken? Can We Fix It?

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, 2009

Improving memory bank-level parallelism in the presence of prefetching.

[BibT_eX]

[DOI]

Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Coordinated control of multiple prefetchers in multi-core systems.

[BibT_eX]

[DOI]

Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Flexible reference-counting-based hardware acceleration for garbage collection.

[BibT_eX]

[DOI]

José A. Joao

Proceedings of the 36th International Symposium on Computer Architecture (ISCA 2009), 2009

Multi-core demands multi-interfaces.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

Techniques for bandwidth-efficient prefetching of linked data structures in hybrid prefetching systems.

[BibT_eX]

[DOI]

Eiman Ebrahimi

Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

Accelerating critical section execution with asymmetric multi-core architectures.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems, 2009

The Challenges of Multicore: Information and Mis-Information.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems, 2009

2008

Set-Dueling-Controlled Adaptive Insertion for High-Performance Caching.

[BibT_eX]

[DOI]

IEEE Micro, 2008

Dynamic Predication of Indirect Jumps.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2008

Can They Be Fixed: Some Thoughts After 40 Years in the Business.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, 2008

Prefetch-Aware DRAM Controllers.

[BibT_eX]

[DOI]

Proceedings of the 41st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-41 2008), 2008

Achieving Out-of-Order Performance with Almost In-Order Complexity.

[BibT_eX]

[DOI]

Francis Tseng

Proceedings of the 35th International Symposium on Computer Architecture (ISCA 2008), 2008

Performance-aware speculation control using wrong path usefulness prediction.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on High-Performance Computer Architecture (HPCA-14 2008), 2008

Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs.

[BibT_eX]

[DOI]

M. Aater Suleman

Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems, 2008

Improving the performance of object-oriented languages with dynamic predication of indirect jumps.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Architectural Support for Programming Languages and Operating Systems, 2008

2007

Diverge-Merge Processor: Generalized and Energy-Efficient Dynamic Predication.

[BibT_eX]

[DOI]

IEEE Micro, 2007

Single-Threaded vs. Multithreaded: Where Should We Focus?

[BibT_eX]

[DOI]

IEEE Micro, 2007

Adaptive insertion policies for high performance caching.

[BibT_eX]

[DOI]

Proceedings of the 34th International Symposium on Computer Architecture (ISCA 2007), 2007

VPC prediction: reducing the cost of indirect branches via hardware-based dynamic devirtualization.

[BibT_eX]

[DOI]

Proceedings of the 34th International Symposium on Computer Architecture (ISCA 2007), 2007

Feedback Directed Prefetching: Improving the Performance and Bandwidth-Efficiency of Hardware Prefetchers.

[BibT_eX]

[DOI]

Proceedings of the 13st International Conference on High-Performance Computer Architecture (HPCA-13 2007), 2007

Line Distillation: Increasing Cache Capacity by Filtering Unused Words in Cache Lines.

[BibT_eX]

[DOI]

M. Aater Suleman

Proceedings of the 13st International Conference on High-Performance Computer Architecture (HPCA-13 2007), 2007

The Transformation Hierarchy in the Era of Multi-core.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2007

Profile-assisted Compiler Support for Dynamic Predication in Diverge-Merge Processors.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Symposium on Code Generation and Optimization (CGO 2007), 2007

2006

Address-Value Delta (AVD) Prediction: A Hardware Technique for Efficiently Parallelizing Dependent Cache Misses.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2006

Efficient Runahead Execution: Power-Efficient Memory Latency Tolerance.

[BibT_eX]

[DOI]

IEEE Micro, 2006

Wish Branches: Enabling Adaptive and Aggressive Predicated Execution.

[BibT_eX]

[DOI]

IEEE Micro, 2006

Foreword.

[BibT_eX]

[DOI]

Jean-Luc Gaudiot

Kevin Skadron

IEEE Comput. Archit. Lett., 2006

Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches.

[BibT_eX]

[DOI]

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-39 2006), 2006

Diverge-Merge Processor (DMP): Dynamic Predicated Execution of Complex Control-Flow Graphs Based on Frequently Executed Paths.

[BibT_eX]

[DOI]

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-39 2006), 2006

A Case for MLP-Aware Cache Replacement.

[BibT_eX]

[DOI]

Proceedings of the 33rd International Symposium on Computer Architecture (ISCA 2006), 2006

Computer Architecture Research and Future Microprocessors: Where Do We Go from Here?

[BibT_eX]

[DOI]

Proceedings of the 33rd International Symposium on Computer Architecture (ISCA 2006), 2006

2D-Profiling: Detecting Input-Dependent Branches with a Single Input Data Set.

[BibT_eX]

[DOI]

Proceedings of the Fourth IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2006), 2006

2005

An Analysis of the Performance Impact of Wrong-Path Memory References on Out-of-Order and Runahead Execution Processors.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2005

Using the First-Level Caches as Filters to Reduce the Pollution Caused by Speculative Memory References.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2005

On Reusing the Results of Pre-Executed Instructions in a Runahead Execution Processor.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2005

Address-Value Delta (AVD) Prediction: Increasing the Effectiveness of Runahead Execution by Exploiting Regular Memory Allocation Patterns.

[BibT_eX]

[DOI]

Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-38 2005), 2005

Wish Branches: Combining Conditional Branching and Predication for Adaptive Predicated Execution.

[BibT_eX]

[DOI]

Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-38 2005), 2005

The V-Way Cache: Demand Based Associativity via Global Replacement.

[BibT_eX]

[DOI]

David Thompson

Proceedings of the 32st International Symposium on Computer Architecture (ISCA 2005), 2005

Techniques for Efficient Processing in Runahead Execution Engines.

[BibT_eX]

[DOI]

Proceedings of the 32st International Symposium on Computer Architecture (ISCA 2005), 2005

A Unifying Theory of Distributed Processing (Or, The Chutzpah One Should Expect When You Invite a Microarchitect into Your Sandbox).

[BibT_eX]

[DOI]

Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Microarchitecture-Based Introspection: A Technique for Transient-Fault Tolerance in Microprocessors.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Conference on Dependable Systems and Networks (DSN 2005), 28 June, 2005

The microprocessor of the year 2014: do Pentium 4, Pentium M, and Power 5 provide any hints?

[BibT_eX]

[DOI]

Proceedings of the 2005 ACS / IEEE International Conference on Computer Systems and Applications (AICCSA 2005), 2005

2004

Understanding the effects of wrong-path memory references on processor performance.

[BibT_eX]

[DOI]

Proceedings of the 3rd Workshop on Memory Performance Issues, 2004

Cache Filtering Techniques to Reduce the Negative Impact of Useless Speculative Memory References on Processor Performance.

[BibT_eX]

[DOI]

Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2004), 2004

Wrong Path Events: Exploiting Unusual and Illegal Program Behavior for Early Misprediction Detection and Recovery.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual International Symposium on Microarchitecture (MICRO-37 2004), 2004

Opening and keynote 1.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Symposium on Performance Analysis of Systems and Software, 2004

The future of simulation: A field of dreams.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Symposium on Performance Analysis of Systems and Software, 2004

Introduction to computing systems - from bits and gates to C and beyond (2. ed.).

[BibT_eX]

McGraw-Hill, ISBN: 978-0-07-246750-5, 2004

2003

Runahead Execution: An Effective Alternative to Large Instruction Windows.

[BibT_eX]

[DOI]

IEEE Micro, 2003

Teaching and teaching computer architecture: two very different topics: (some opinions about each).

[BibT_eX]

[DOI]

Proceedings of the 2003 workshop on Computer architecture education, 2003

Partitioned first-level cache design for clustered microarchitectures.

[BibT_eX]

[DOI]

Paul Racunas

Proceedings of the 17th Annual International Conference on Supercomputing, 2003

Runahead Execution: An Alternative to Very Large Instruction Windows for Out-of-Order Processors.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Symposium on High-Performance Computer Architecture (HPCA'03), 2003

The High Performance Microprocessor in the Year 2013: What Will It Look Like? What It Won't Look Like?

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - HiPC 2003, 10th International Conference, 2003

2002

Microarchitectural support for precomputation microthreads.

[BibT_eX]

[DOI]

Proceedings of the 35th Annual International Symposium on Microarchitecture, 2002

Difficult-Path Branch Prediction Using Subordinate Microthreads.

[BibT_eX]

[DOI]

Proceedings of the 29th International Symposium on Computer Architecture (ISCA 2002), 2002

Using Internal Redundant Representations and Limited Bypass to Support Pipelined Adders and Register Files.

[BibT_eX]

[DOI]

Mary D. Brown

Proceedings of the Eighth International Symposium on High-Performance Computer Architecture (HPCA'02), 2002

Handling of packet dependencies: a critical issue for highly parallel network processors.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Compilers, 2002

2001

Requirements, bottlenecks, and good fortune: agents for microprocessor evolution.

[BibT_eX]

[DOI]

Proc. IEEE, 2001

Programming early considered harmful.

[BibT_eX]

[DOI]

Proceedings of the 32rd SIGCSE Technical Symposium on Computer Science Education, 2001

Select-free instruction scheduling logic.

[BibT_eX]

[DOI]

Mary D. Brown

Proceedings of the 34th Annual International Symposium on Microarchitecture, 2001

2000

Soft updates: a solution to the metadata update problem in file systems.

[BibT_eX]

[DOI]

ACM Trans. Comput. Syst., 2000

On pipelining dynamic instruction scheduling logic.

[BibT_eX]

[DOI]

Mary D. Brown

Proceedings of the 33rd Annual IEEE/ACM International Symposium on Microarchitecture, 2000

Higher and Higher Performance Microprocessors: Are The Problems Just Too Hard To Solve?

[BibT_eX]

[DOI]

Proceedings of the 26th EUROMICRO 2000 Conference, 2000

1999

Evaluation of Design Options for the Trace Cache Fetch Mechanism.

[BibT_eX]

[DOI]

Daniel H. Friendly

IEEE Trans. Computers, 1999

Computer architecture education: mechanical engineers need it too.

[BibT_eX]

[DOI]

Proceedings of the 1999 workshop on Computer architecture education, 1999

Simultaneous Subordinate Microthreading (SSMT).

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Symposium on Computer Architecture, 1999

1998

Using System-Level Models to Evaluate I/O Subsystem Designs.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 1998

Increasing the Instruction Fetch Rate via Block-Structured Instruction Set Architectures.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1998

Putting the Fill Unit to Work: Dynamic Optimizations for Trace Cache Microprocessors.

[BibT_eX]

[DOI]

Daniel H. Friendly

Proceedings of the 31st Annual IEEE/ACM International Symposium on Microarchitecture, 1998

Alternative Implementations of Two-Level Adaptive Branch Prediction.

[BibT_eX]

[DOI]

Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Retrospective: Alternative Implementations of Two-Level Adaptive Training Branch Prediction.

[BibT_eX]

[DOI]

Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Improving Trace Cache Effectiveness with Branch Promotion and Trace Packing.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual International Symposium on Computer Architecture, 1998

Retrospective: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality.

[BibT_eX]

[DOI]

Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

An Analysis of Correlation and Predictability: What Makes Two-Level Branch Predictors Work.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual International Symposium on Computer Architecture, 1998

Variable Length Path Branch Prediction.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS-VIII Proceedings of the 8th International Conference on Architectural Support for Programming Languages and Operating Systems, 1998

1997

Recovery requirements of branch prediction storage structures in the presence of mispredicted-path execution.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1997

Improving branch prediction accuracy by reducing pattern history table interference.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1997

One Billion Transistors, One Uniprocessor, One Chip.

[BibT_eX]

[DOI]

Computer, 1997

Identifiying Obstacles in the Path to More.

[BibT_eX]

[DOI]

Computer, 1997

Reducing the Performance Impact of Instruction Cache Misses by Writing Instructions into the Reservation Stations Out-of-Order.

[BibT_eX]

[DOI]

Paul Racunas

Proceedings of the Thirtieth Annual IEEE/ACM International Symposium on Microarchitecture, 1997

Alternative Fetch and Issue Policies for the Trace Cache Fetch Mechanism.

[BibT_eX]

[DOI]

Daniel H. Friendly

Proceedings of the Thirtieth Annual IEEE/ACM International Symposium on Microarchitecture, 1997

The Agree Predictor: A Mechanism for Reducing Negative Branch History Interference.

[BibT_eX]

[DOI]

Proceedings of the 24th International Symposium on Computer Architecture, 1997

Target Prediction for Indirect Jumps.

[BibT_eX]

[DOI]

Eric Hao

Proceedings of the 24th International Symposium on Computer Architecture, 1997

Using Non-Volatile Storage to Improve the Reliability of RAID5 Disk Arrays.

[BibT_eX]

[DOI]

Proceedings of the Digest of Papers: FTCS-27, 1997

1996

Branch Classification: New Mechanism for Improving Branch Predictor Performance.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1996

Using Predicated Execution to Improve the Performance of a Dynamically Scheduled Machine with Speculative Execution.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1996

First Courses and Fundamentals.

[BibT_eX]

[DOI]

ACM Comput. Surv., 1996

Microarchitecture, Compilers and Algorithms.

[BibT_eX]

[DOI]

ACM Comput. Surv., 1996

Education in computer science and computer engineering starts with computer architecture.

[BibT_eX]

[DOI]

Proceedings of the 1996 workshop on Computer architecture education, 1996

Using Hybrid Branch Predictors to Improve Branch Prediction Accuracy in the Presence of Context Switches.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual International Symposium on Computer Architecture, 1996

The effects of mispredicted-path execution on branch prediction structures.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, 1996

1995

Scanning the Issue - Special Issue on Microprocessors.

[BibT_eX]

[DOI]

Proc. IEEE, 1995

Enhancing instruction scheduling with a block-structured ISA.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1995

Components of a computer architecture education: optimal and suboptimal.

[BibT_eX]

[DOI]

Proceedings of the 1995 Workshop on Computer Architecture Education, 1995

Components of a computer architecture education.

[BibT_eX]

[DOI]

Proceedings of the 1995 Workshop on Computer Architecture Education, 1995

On-Line Extraction of SCSI Disk Drive Parameters.

[BibT_eX]

[DOI]

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1995

Alternative implementations of hybrid branch predictors.

[BibT_eX]

[DOI]

Eric Hao

Proceedings of the 28th Annual International Symposium on Microarchitecture, Ann Arbor, Michigan, USA, November 29, 1995

Track Piggybacking: An Improved Rebuild Algorithm for RAID5 Disk Arrays.

[BibT_eX]

Proceedings of the 1995 International Conference on Parallel Processing, 1995

1994

The I/O Subsystem - A Candidate for Improvement: Guest Editor's Introduction.

[BibT_eX]

[DOI]

Computer, 1994

Disk Arrays: High-Performance, High-Reliability Storage Subsystems.

[BibT_eX]

[DOI]

Computer, 1994

Scheduling Algorithms for Modern Disk Drives.

[BibT_eX]

[DOI]

Bruce L. Worthington

Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems, 1994

Metadata Update Performance in File Systems.

[BibT_eX]

[DOI]

Proceedings of the First USENIX Symposium on Operating Systems Design and Implementation (OSDI), 1994

Facilitating superscalar processing via a combined static/dynamic register renaming scheme.

[BibT_eX]

[DOI]

Eric Sprangle

Proceedings of the 27th Annual International Symposium on Microarchitecture, San Jose, California, USA, November 30, 1994

The effect of speculatively updating branch history on branch prediction accuracy, revisited.

[BibT_eX]

[DOI]

Eric Hao

Proceedings of the 27th Annual International Symposium on Microarchitecture, San Jose, California, USA, November 30, 1994

1993

Comparing Rebuild Algorithms for Mirrored and RAID5 Disk Arrays.

[BibT_eX]

[DOI]

Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, 1993

The Process-Flow Model: Examining I/O Performance from the System's Point of View.

[BibT_eX]

[DOI]

Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems, 1993

Branch history table indexing to prevent pipeline bubbles in wide-issue superscalar processors.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Symposium on Microarchitecture, 1993

A comparative performance evaluation of various state maintenance mechanisms.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Symposium on Microarchitecture, 1993

A Comparison of Dynamic Branch Predictors that Use Two Levels of Branch History.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual International Symposium on Computer Architecture, 1993

Increasing the Instruction Fetch Rate via Multiple Branch Prediction and a Branch Address Cache.

[BibT_eX]

[DOI]

Deborah T. Marr

Proceedings of the 7th international conference on Supercomputing, 1993

Trading Disk Capacity for Performance.

[BibT_eX]

[DOI]

Proceedings of the Second International Symposium on High Performance Distributed Computing, 1993

1992

An experimental single-chip data flow CPU.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, January, 1992

Highest performance computing machines.

[BibT_eX]

[DOI]

Microprocess. Microprogramming, 1992

Report of the Purdue Workshop on Grand Challenges in Computer Architecture for the Support of High Performance Computing.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 1992

A comprehensive instruction fetch mechanism for a processor supporting speculative execution.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual International Symposium on Microarchitecture, 1992

An investigation of the performance of various dynamic scheduling techniques.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual International Symposium on Microarchitecture, 1992

1991

Experimental Research in Computer Architecture - Guest Editor's Introduction to the Special Issue.

[BibT_eX]

[DOI]

Computer, 1991

Two-Level Adaptive Training Branch Prediction.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual IEEE/ACM International Symposium on Microarchitecture, 1991

The Effect of Real Data Cache Behavior on the Performance of a Microarchitecture that Supports Dynamic Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual IEEE/ACM International Symposium on Microarchitecture, 1991

Exploiting Fine-Grained Parallelism Through a Combination of Hardware and Software Techniques.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual International Symposium on Computer Architecture. Toronto, 1991

Single Instruction Stream Parallelism is Greater Than Two.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual International Symposium on Computer Architecture. Toronto, 1991

1990

An Area-Efficient Register Alias Table for Implementing HPS.

[BibT_eX]

Proceedings of the 1990 International Conference on Parallel Processing, 1990

1989

Alternative implementations of Prolog: the microarchitecture perspective.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern., 1989

Real Machines: Design Choices / Engineering Trade-Offs - Guest Editor's Introduction.

[BibT_eX]

[DOI]

Computer, 1989

Unification Parallelism: How Much Can We Exploit?

[BibT_eX]

Ashok Singhal

Proceedings of the Logic Programming, 1989

Microarchitecture choices (implementation of the VAX).

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Workshop and Symposium on Microprogramming and Microarchitecture, 1989

A High Performance Prolog Processor with Multiple Function Units.

[BibT_eX]

[DOI]

Ashok Singhal

Proceedings of the 16th Annual International Symposium on Computer Architecture. Jerusalem, 1989

Performance benefits of large execution atomic units in dynamically scheduled machines.

[BibT_eX]

[DOI]

Proceedings of the 3rd international conference on Supercomputing, 1989

1988

The Use of Microcode Instrumentation for Development, Debugging and Tuning of Operating System Kernels.

[BibT_eX]

[DOI]

Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems, 1988

Implementing a Prolog machine with multiple functional units.

[BibT_eX]

[DOI]

Ashok Singhal

Proceedings of the 21st Annual Workshop and Symposium on Microprogramming and Microarchitecture, 1988, San Diego, California, USA, November 28, 1988

Hardware support for large atomic units in dynamically scheduled machines.

[BibT_eX]

[DOI]

Michael Shebanow

Proceedings of the 21st Annual Workshop and Symposium on Microprogramming and Microarchitecture, 1988, San Diego, California, USA, November 28, 1988

Hierarchical registers for scientific computers.

[BibT_eX]

[DOI]

John A. Swensen

Proceedings of the 2nd international conference on Supercomputing, 1988

1987

Checkpoint Repair for High-Performance Out-of-Order Execution Machines.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 1987

Aquarius.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 1987

On tuning the microarchitecture of an HPS implementation of the VAX.

[BibT_eX]

[DOI]

Proceedings of the 20st Annual Workshop and Symposium on Microprogramming and Microarchitecture, 1987

SPAM: a microcode based tool for tracing operating system events.

[BibT_eX]

[DOI]

Proceedings of the 20st Annual Workshop and Symposium on Microprogramming and Microarchitecture, 1987

Exploiting horizontal and vertical concurrency via the HPSm microprocessor.

[BibT_eX]

[DOI]

Proceedings of the 20st Annual Workshop and Symposium on Microprogramming and Microarchitecture, 1987

Fast Temporary Storage for Serial and Parallel Execution.

[BibT_eX]

[DOI]

John A. Swensen

Proceedings of the 14th Annual International Symposium on Computer Architecture. Pittsburgh, 1987

Checkpoint Repair for Out-of-order Execution Machines.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual International Symposium on Computer Architecture. Pittsburgh, 1987

Advantages of Implementing PROLOG by Microprogramming a Host General Purpose Computer.

[BibT_eX]

Jeffrey D. Gee

Proceedings of the Logic Programming, 1987

1986

Run-time generation of HPS microinstructions from a VAX instruction stream.

[BibT_eX]

[DOI]

Proceedings of the 19th annual workshop on Microprogramming, 1986

A microcode-based environment for noninvasive performance analysis.

[BibT_eX]

[DOI]

Proceedings of the 19th annual workshop on Microprogramming, 1986

The implementation of Prolog via VAX 8600 microcode.

[BibT_eX]

[DOI]

Jeffrey D. Gee

Proceedings of the 19th annual workshop on Microprogramming, 1986

HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Symposium on Computer Architecture, Tokyo, Japan, June 1986, 1986

Experiments with HPS, a Restricted Data Flow Microarchitecture for High Performance Computers.

[BibT_eX]

Proceedings of the Spring COMPCON'86, 1986

High Performance Prolog, The Multiplicative Effect of Several Levels of Implementation.

[BibT_eX]

Proceedings of the Spring COMPCON'86, 1986

1985

Retrofitting the VAX-11/780 Microarchitecture for IEEE Floating Point Arithmetic - Implementation Issues, Measurements, and Analysis.

[BibT_eX]

[DOI]

David B. Aspinwall

IEEE Trans. Computers, 1985

Critical issues regarding HPS, a high performance microarchitecture.

[BibT_eX]

[DOI]

Proceedings of the 18th annual workshop on Microprogramming, 1985

HPS, a new microarchitecture: rationale and introduction.

[BibT_eX]

[DOI]

Michael Shebanow

Proceedings of the 18th annual workshop on Microprogramming, 1985

Microcode and the protection of intellectual effort.

[BibT_eX]

[DOI]

John K. Ahlstrom

Proceedings of the 18th annual workshop on Microprogramming, 1985

Compiling Prolog into microcode: a case study using the NCR/32-000.

[BibT_eX]

[DOI]

Proceedings of the 18th annual workshop on Microprogramming, 1985

Performance Studies of a Prolog Machine Architecture.

[BibT_eX]

[DOI]

Tep P. Dobry

Proceedings of the 12th Annual Symposium on Computer Architecture, 1985

Aquarius - A High Performance Computing System for Symbolic/Numeric Applications.

[BibT_eX]

Proceedings of the Spring COMPCON'85, 1985

1984

Alternative proposals for implementing Prolog concurrently and implications regarding their respective microarchitectures.

[BibT_eX]

[DOI]

Carl Ponder

Proceedings of the 17th annual workshop on Microprogramming, 1984

Design decisions influencing the microarchitecture for a Prolog machine.

[BibT_eX]

[DOI]

Tep P. Dobry

Proceedings of the 17th annual workshop on Microprogramming, 1984

The Aquarius Project.

[BibT_eX]

Proceedings of the COMPCON'84, Digest of Papers, Twenty-Eighth IEEE Computer Society International Conference, San Francisco, California, USA, February 27, 1984

1979

Some results on the asymptotic behavior of functions on subsets of the natural numbers.

[BibT_eX]

[DOI]

David F. McAllister

Proceedings of the 17th Annual Southeast Regional Conference, 1979

1977

Independent necessary conditions for functional completeness in m-valued logic.

[BibT_eX]

[DOI]