Norman P. Jouppi

Muhammad Mukarram Bin Tariq

Jung Ho Ahn

CoRR, 2023

Lightwave Fabrics: At-Scale Optical Circuit Switching for Datacenter and Machine Learning Systems.

[BibT_eX]

[DOI]

Amin Vahdat

Proceedings of the ACM SIGCOMM 2023 Conference, 2023

TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

A Machine Learning Supercomputer with an Optically Reconfigurable Interconnect and Embeddings Support.

[BibT_eX]

[DOI]

Andy Swing

Proceedings of the 35th IEEE Hot Chips Symposium, 2023

Hyperscale Hardware Optimized Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2021

The Design Process for Google's Training Chips: TPUv2 and TPUv3.

[BibT_eX]

[DOI]

IEEE Micro, 2021

Ten Lessons From Three Generations Shaped Google's TPUv4i : Industrial Product.

[BibT_eX]

[DOI]

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

NeuroMeter: An Integrated Power, Area, and Timing Modeling Framework for Machine Learning Accelerators Industry Track Paper.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

Searching for Fast Model Families on Datacenter Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Highly Available Data Parallel ML training on Mesh Networks.

[BibT_eX]

[DOI]

Sameer Kumar

Norm Jouppi

CoRR, 2020

A domain-specific supercomputer for training deep neural networks.

[BibT_eX]

[DOI]

Commun. ACM, 2020

Google's Training Chips Revealed: TPUv2 and TPUv3.

[BibT_eX]

[DOI]

Proceedings of the IEEE Hot Chips 32 Symposium, 2020

2018

Motivation for and Evaluation of the First Tensor Processing Unit.

[BibT_eX]

[DOI]

IEEE Micro, 2018

A domain-specific architecture for deep neural networks.

[BibT_eX]

[DOI]

Commun. ACM, 2018

2017

In-Datacenter Performance Analysis of a Tensor Processing Unit.

[BibT_eX]

[DOI]

Tara Vazir Ghaemmaghami

CoRR, 2017

In-Datacenter Performance Analysis of a Tensor Processing Unit.

[BibT_eX]

[DOI]

Tara Vazir Ghaemmaghami

Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

2016

Common Bonds: MIPS, HPS, Two-Level Branch Prediction, and Compressed Code RISC Processor.

[BibT_eX]

[DOI]

IEEE Micro, 2016

2015

CACTI-IO: CACTI With OFF-Chip Power-Area-Timing Models.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2015

History-Assisted Adaptive-Granularity Caches (HAAG$) for High Performance 3D DRAM Architectures.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

2014

Efficient Data Mapping and Buffering Techniques for Multilevel Cell Phase-Change Memories.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2014

Endurance-aware cache line management for non-volatile caches.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2014

2013

The McPAT Framework for Multicore and Manycore Architectures: Simultaneously Modeling Power, Area, and Timing.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

A circuit-architecture co-optimization framework for exploring nonvolatile memory hierarchies.

[BibT_eX]

[DOI]

Xiangyu Dong

ACM Trans. Archit. Code Optim., 2013

Practical nonvolatile multilevel-cell phase change memory.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

Kiln: closing the performance gap between systems with and without persistence support.

[BibT_eX]

[DOI]

Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, 2013

A circuit-architecture co-optimization framework for evaluating emerging memory hierarchies.

[BibT_eX]

[DOI]

Xiangyu Dong

Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2013

McSimA+: A manycore simulator with application-level+ simulation and detailed microarchitecture modeling.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2013

Design of cross-point metal-oxide ReRAM emphasizing reliability and cost.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2013

i<sup>2</sup>WAP: Improving non-volatile cache lifetime by reducing inter- and intra-set write variations.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

Understanding the trade-offs in multi-level cell ReRAM memory design.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

2012

NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2012

Improving System Energy Efficiency with Memory Rank Subsetting.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2012

Free-p: A Practical End-to-End Nonvolatile Memory Protection Mechanism.

[BibT_eX]

[DOI]

Doe Hyun Yoon

Jichuan Chang

Mattan Erez

IEEE Micro, 2012

Optical High Radix Switch Design.

[BibT_eX]

[DOI]

IEEE Micro, 2012

MAGE: adaptive granularity and ECC for resilient and power efficient memory systems.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Design trade-offs for high density cross-point resistive memory.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Low Power Electronics and Design, 2012

LOT-ECC: Localized and tiered reliability mechanisms for commodity memory systems.

[BibT_eX]

[DOI]

Aniruddha N. Udipi

Proceedings of the 39th International Symposium on Computer Architecture (ISCA 2012), 2012

Staged Reads: Mitigating the impact of DRAM writes on DRAM reads.

[BibT_eX]

[DOI]

Niladrish Chatterjee

Proceedings of the 18th IEEE International Symposium on High Performance Computer Architecture, 2012

CACTI-3DD: Architecture-level modeling for 3D die-stacked DRAM main memory.

[BibT_eX]

[DOI]

Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

2011

Multi-Core Cache Hierarchies

[BibT_eX]

[DOI]

Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers, ISBN: 978-3-031-01734-6, 2011

Hybrid checkpointing using emerging nonvolatile memories for future exascale systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2011

DRAM errors in the wild: technical perspective.

[BibT_eX]

[DOI]

Commun. ACM, 2011

System implications of memory reliability in exascale computing.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2011

System-level integrated server architectures for scale-out datacenters.

[BibT_eX]

[DOI]

Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011

Combining memory and a controller with photonics through 3D-stacking to enable scalable and energy-efficient systems.

[BibT_eX]

[DOI]

Aniruddha N. Udipi

Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

The role of optics in future high radix switch design.

[BibT_eX]

[DOI]

Proceedings of the 38th International Symposium on Computer Architecture (ISCA 2011), 2011

CACTI-P: Architecture-level modeling for SRAM-based structures with advanced leakage reduction techniques.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE/ACM International Conference on Computer-Aided Design, 2011

FREE-p: Protecting non-volatile memory against both hard and soft errors.

[BibT_eX]

[DOI]

Doe Hyun Yoon

Jichuan Chang

Mattan Erez

Proceedings of the 17th International Conference on High-Performance Computer Architecture (HPCA-17 2011), 2011

Design implications of memristor-based RRAM cross-point structures.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2011

CMOS Nanophotonics: Technology, System Implications, and a CMP Case Study.

[BibT_eX]

[DOI]

Jung Ho Ahn

Proceedings of the Low Power Networks-on-Chip., 2011

2010

Simple but Effective Heterogeneous Main Memory with On-Chip Memory Controller Support.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2010

Rethinking DRAM design and organization for energy-constrained multi-cores.

[BibT_eX]

[DOI]

Aniruddha N. Udipi

Niladrish Chatterjee

Proceedings of the 37th International Symposium on Computer Architecture (ISCA 2010), 2010

2009

Introduction to the special issue on the 2008 workshop on design, analysis, and simulation of chip multiprocessors (dasCMP'08).

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2009

A High-Speed Optical Multidrop Bus for Computer Interconnections.

[BibT_eX]

[DOI]

IEEE Micro, 2009

Multicore DIMM: an Energy Efficient Memory Module with Independently Controlled DRAMs.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2009

Technical perspective - Software and hardware support for deterministic replay of parallel programs.

[BibT_eX]

[DOI]

Commun. ACM, 2009

Leveraging 3D PCRAM technologies to reduce checkpoint overhead for future exascale systems.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

Future scaling of processor-memory interfaces.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures.

[BibT_eX]

[DOI]

Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Emerging technologies and their impact on system design.

[BibT_eX]

[DOI]

Proceedings of the 2009 International Symposium on Low Power Electronics and Design, 2009

PCRAMsim: System-level performance, energy, and area modeling for Phase-Change RAM.

[BibT_eX]

[DOI]

Xiangyu Dong

Proceedings of the 2009 International Conference on Computer-Aided Design, 2009

Resilience Challenges for Exascale Systems.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems, 2009

2008

Introduction to the special issue on the 2007 workshop on design, analysis, and simulation of chip multiprocessors (dasCMP'07).

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2008

Architecting Efficient Interconnects for Large Caches with CACTI 6.0.

[BibT_eX]

[DOI]

IEEE Micro, 2008

Implementing high availability memory with a duplication cache.

[BibT_eX]

[DOI]

Proceedings of the 41st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-41 2008), 2008

System implications of integrated photonics.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Symposium on Low Power Electronics and Design, 2008

Corona: System Implications of Emerging Nanophotonic Technology.

[BibT_eX]

[DOI]

Jung Ho Ahn

Proceedings of the 35th International Symposium on Computer Architecture (ISCA 2008), 2008

A Comprehensive Memory Modeling Tool and Its Application to the Design and Analysis of Future Memory Hierarchies.

[BibT_eX]

[DOI]

Proceedings of the 35th International Symposium on Computer Architecture (ISCA 2008), 2008

A High-Speed Optical Multi-Drop Bus for Computer Interconnections.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual IEEE Symposium on High Performance Interconnects (HOTI 2008), 2008

A Nanophotonic Interconnect for High-Performance Many-Core Computation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual IEEE Symposium on High Performance Interconnects (HOTI 2008), 2008

2007

Introduction to the special issue on the 2006 workshop on design, analysis, and simulation of chip multiprocessors: (dasCMP'06).

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2007

Isolation in Commodity Multicore Processors.

[BibT_eX]

[DOI]

Nidhi Aggarwal

James E. Smith

Computer, 2007

High-performance ethernet-based communications for future multi-core processors.

[BibT_eX]

[DOI]

Michael S. Schlansker

Nagabhushan Chitlur

Erwin Oertli

Paul M. Stillwell Jr.

Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

Optimizing NUCA Organizations and Wiring Alternatives for Large Caches with CACTI 6.0.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-40 2007), 2007

Configurable isolation: building high availability systems with commodity multi-core processors.

[BibT_eX]

[DOI]

Nidhi Aggarwal

James E. Smith

Proceedings of the 34th International Symposium on Computer Architecture (ISCA 2007), 2007

Microprocessors in the era of terascale integration.

[BibT_eX]

[DOI]

Shekhar Borkar

Per Stenström

Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007

2006

Architecture - The potential energy efficiency of vector acceleration.

[BibT_eX]

[DOI]

Christophe Lemuet

Jack Sampson

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Exploiting Fine-Grained Data Parallelism with Chip Multiprocessors and Fast Barriers.

[BibT_eX]

[DOI]

Jack Sampson

Rubén González

Michael S. Schlansker

Brad Calder

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-39 2006), 2006

Improving the performance and power efficiency of shared helpers in CMPs.

[BibT_eX]

[DOI]

Proceedings of the 2006 International Conference on Compilers, 2006

Core architecture optimization for heterogeneous chip multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques (PACT 2006), 2006

2005

Dynamically configurable shared CMP helper engines for improved performance.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2005

Fast synchronization for chip multiprocessors.

[BibT_eX]

[DOI]

Jack Sampson

Rubén González

Michael S. Schlansker

SIGARCH Comput. Archit. News, 2005

Introduction to the special issue on the 2005 workshop on design, analysis, and simulation of chip multiprocessors (dasCMP'05).

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2005

Heterogeneous Chip Multiprocessors.

[BibT_eX]

[DOI]

Computer, 2005

System-wide performance monitors and their application to the optimization of coherent memory accesses.

[BibT_eX]

[DOI]

Sami Yehia

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2005

Telepresence Systems With Automatic Preservation of User Head Height, Local Rotation, and Remote Translation.

[BibT_eX]

[DOI]

Stan Thomas

Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Enterprise IT Trends and Implications for Architecture Research.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on High-Performance Computer Architecture (HPCA-11 2005), 2005

2004

BiReality: mutually-immersive telepresence.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Multimedia, 2004

Conjoined-Core Chip Multiprocessing.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual International Symposium on Microarchitecture (MICRO-37 2004), 2004

Single-ISA Heterogeneous Multi-Core Architectures for Multithreaded Workload Performance.

[BibT_eX]

[DOI]

Proceedings of the 31st International Symposium on Computer Architecture (ISCA 2004), 2004

A First Generation Mutually-Immersive Mobile Telepresence Surrogate with Automatic Backtracking.

[BibT_eX]

[DOI]

Subu Iyer

Wayne Mack

April Slayden Mitchell

Stan Thomas

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Region of interest editing of MPEG-2 video streams in the compressed domain.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

The Future Evolution of High-Performance Microprocessors.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2004

2003

Processor Power Reduction Via Single-ISA Heterogeneous Multi-Core Architectures.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2003

Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual International Symposium on Microarchitecture, 2003

2002

The Optimal Logic Depth Per Pipeline Stage is 6 to 8 FO4 Inverter Delays.

[BibT_eX]

[DOI]

M. S. Hrishikesh

Doug Burger

Stephen W. Keckler

Premkishore Shivakumar

Proceedings of the 29th International Symposium on Computer Architecture (ISCA 2002), 2002

First steps towards mutually-immersive mobile telepresence.

[BibT_eX]

[DOI]

Proceedings of the 2002 ACM on Computer supported cooperative work video program, 2002

First steps towards mutually-immersive mobile telepresence.

[BibT_eX]

[DOI]

Proceedings of the CSCW 2002, 2002

2000

Reconfigurable caches and their application to media processing.

[BibT_eX]

[DOI]

Sarita V. Adve

Proceedings of the 27th International Symposium on Computer Architecture (ISCA 2000), 2000

Prefiltered Antialiased Lines Using Half-Plane Distance Functions.

[BibT_eX]

[DOI]

Bob McNamara

Joel McCormack

Proceedings of the 2000 ACM SIGGRAPH/EUROGRAPHICS Workshop on Graphics Hardware, 2000

1999

Implementing Neon: a 256-bit graphics accelerator.

[BibT_eX]

[DOI]

IEEE Micro, 1999

Real products, real technology Guest Editor's Introduction].

[BibT_eX]

[DOI]

John Wawrzynek

IEEE Micro, 1999

The Multicluster Architecture: Reducing Processor Cycle Time Through Partitioning.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1999

Feline: Fast Elliptical Lines for Anisotropic Texture Mapping.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques, 1999

Performance of Image and Video Processing with General-Purpose Processors and Media ISA Extensions.

[BibT_eX]

[DOI]

Sarita V. Adve

Proceedings of the 26th Annual International Symposium on Computer Architecture, 1999

Z3: An Economical Hardware Technique for High-Quality Antialiasing and Transparency.

[BibT_eX]

[DOI]

Chun-Fa Chang

Proceedings of the 1999 ACM SIGGRAPH/EUROGRAPHICS Workshop on Graphics Hardware, 1999

1998

Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache Prefetch Buffers.

[BibT_eX]

[DOI]

Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Retrospective: Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers.

[BibT_eX]

[DOI]

Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Neon: A Single-Chip 3D Workstation Graphics Accelerator.

[BibT_eX]

[DOI]

Proceedings of the 1998 ACM SIGGRAPH/EUROGRAPHICS Workshop on Graphics Hardware, Lisbon, Portugal, August 31, 1998

1997

The Multicluster Architecture: Reducing Cycle Time Through Partitioning.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth Annual IEEE/ACM International Symposium on Microarchitecture, 1997

Complexity-Effective Superscalar Processors.

[BibT_eX]

[DOI]

Subbarao Palacharla

James E. Smith

Proceedings of the 24th International Symposium on Computer Architecture, 1997

Memory-System Design Considerations for Dynamically-Scheduled Processors.

[BibT_eX]

[DOI]

Proceedings of the 24th International Symposium on Computer Architecture, 1997

1996

CACTI: an enhanced cache access and cycle time model.

[BibT_eX]

[DOI]

Steven J. E. Wilton

IEEE J. Solid State Circuits, 1996

A speed, power, and supply noise evaluation of ECL driver circuits.

[BibT_eX]

[DOI]

Stefanos Sidiropoulos

Suresh Menon

IEEE J. Solid State Circuits, 1996

[BibT_eX]

[DOI]

Paul Chow

Proceedings of the Second International Symposium on High-Performance Computer Architecture, 1996

1995

How Useful Are Non-Blocking Loads, Stream Buffers and Speculative Execution in Multiple Issue Processors?

[BibT_eX]

[DOI]

Paul Chow

Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture (HPCA 1995), 1995

1994

Designing, packaging, and testing a 300-MHz, 115 W ECL microprocessor.

[BibT_eX]

[DOI]

Patrick D. Boyle

John S. Fitch

IEEE Micro, 1994

Tradeoffs in Two-Level On-Chip Caching.

[BibT_eX]

[DOI]

Steven J. E. Wilton

Proceedings of the 21st Annual International Symposium on Computer Architecture. Chicago, 1994

Complexity/Performance Tradeoffs with Non-Blocking Loads.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual International Symposium on Computer Architecture. Chicago, 1994

1993

Cache Write Policies and Performance.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual International Symposium on Computer Architecture, 1993

1992

A Simulation Based Study of TLB Performance.

[BibT_eX]

[DOI]

J. Bradley Chen

Anita Borg

Proceedings of the 19th Annual International Symposium on Computer Architecture. Gold Coast, 1992

1991

Computer Technology and Architecture: An Evolving Interaction.

[BibT_eX]

[DOI]

John L. Hennessy

Computer, 1991

1990

Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual International Symposium on Computer Architecture, 1990

1989

A 20-MIPS sustained 32-bit CMOS microprocessor with high ratio of sustained to peak performance.

[BibT_eX]

[DOI]

Jeffrey Y.-F. Tang

IEEE J. Solid State Circuits, October, 1989

The Nonuniform Distribution of Instruction-Level and Machine Parallelism and Its Effect on Performance.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 1989

Architectural and Organizational Tradeoffs in the Design of the MultiTitan CPU.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual International Symposium on Computer Architecture. Jerusalem, 1989

Integration and packaging plateaus of processor performance.

[BibT_eX]

[DOI]

Proceedings of the Computer Design: VLSI in Computers and Processors, 1989

Available Instruction-Level Parallelism for Superscalar and Superpipelined Machines.

[BibT_eX]

[DOI]

David W. Wall

Proceedings of the ASPLOS-III Proceedings, 1989

A Unified Vector/Scalar Floating-Point Architecture.

[BibT_eX]

[DOI]

Jonathan Bertoni

David W. Wall

Proceedings of the ASPLOS-III Proceedings, 1989

1988

Superscalar vs. superpipelined machines.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 1988

1987

Timing Analysis and Performance Improvement of MOS VLSI Designs.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1987

Derivation of Signal Flow Direction in MOS VLSI.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 1987

1983

Timing analysis for nMOS VLSI.

[BibT_eX]

[DOI]