Krste Asanovic

Parthasarathy Ranganathan

IEEE Micro, 2023

AuRORA: Virtualized Accelerator Orchestration for Multi-Tenant Workloads.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

CDPU: Co-designing Compression and Decompression Processing Units for Hyperscale Systems.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

Profiling Hyperscale Big Data Processing.

[BibT_eX]

[DOI]

Parthasarathy Ranganathan

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

MoCA: Memory-Centric, Adaptive Execution for Multi-Tenant Deep Neural Networks.

[BibT_eX]

[DOI]

Seah Kim

Hasan Genc

Vadim Vadimovich Nikiforov

Parthasarathy Ranganathan

Borivoje Nikolic

Yakun Sophia Shao

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

2022

An Eight-Core 1.44-GHz RISC-V Vector Processor in 16-nm FinFET.

[BibT_eX]

[DOI]

Vladimir M. Milovanovic

IEEE J. Solid State Circuits, 2022

Verifying RISC-V Physical Memory Protection.

[BibT_eX]

[DOI]

CoRR, 2022

Hammer: a modular and reusable physical design flow tool: invited.

[BibT_eX]

[DOI]

Nayiri Krzysztofowicz

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Cerberus: A Formal Approach to Secure and Efficient Enclave Memory Sharing.

[BibT_eX]

[DOI]

Anjo Vahldiek-Oberwagner

Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security, 2022

2021

Accelerating Genomic Data Analytics With Composable Hardware Acceleration Framework.

[BibT_eX]

[DOI]

IEEE Micro, 2021

Accessible, FPGA Resource-Optimized Simulation of Multiclock Systems in FireSim.

[BibT_eX]

[DOI]

IEEE Micro, 2021

A Hardware Accelerator for Protocol Buffers.

[BibT_eX]

[DOI]

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

4.3 An Eight-Core 1.44GHz RISC-V Vector Machine in 16nm FinFET.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2021

COBRA: A Framework for Evaluating Compositions of Hardware Branch Predictors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021

Vertically Integrated Computing Labs Using Open-Source Hardware Generators and Cloud-Hosted FPGAs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

A 16mm2 106.1 GOPS/W Heterogeneous RISC-V Multi-Core Multi-Accelerator SoC in Low-Power 22nm FinFET.

[BibT_eX]

[DOI]

Proceedings of the 47th ESSCIRC 2021, 2021

Gemmini: Enabling Systematic Deep-Learning Architecture Evaluation via Full-Stack Integration.

[BibT_eX]

[DOI]

Jonathan Ragan-Kelley

Borivoje Nikolic

Yakun Sophia Shao

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

2020

A Dual-Core RISC-V Vector Processor With On-Chip Fine-Grain Power Management in 28-nm FD-SOI.

[BibT_eX]

[DOI]

John Charles Wright

Colin Schmidt

Ben Keller

IEEE Trans. Very Large Scale Integr. Syst., 2020

Chipyard: Integrated Design, Simulation, and Implementation Framework for Custom SoCs.

[BibT_eX]

[DOI]

IEEE Micro, 2020

Building Open Trusted Execution Environments.

[BibT_eX]

[DOI]

IEEE Secur. Priv., 2020

ProTuner: Tuning Programs with Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, 2020

RLDRM: Closed Loop Dynamic Cache Allocation with Deep Reinforcement Learning for Network Function Virtualization.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE Conference on Network Softwarization, 2020

AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Third Conference on Machine Learning and Systems, 2020

Genesis: A Hardware Acceleration Framework for Genomic Data Analysis.

[BibT_eX]

[DOI]

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

Keystone: an open framework for architecting trusted execution environments.

[BibT_eX]

[DOI]

Proceedings of the EuroSys '20: Fifteenth EuroSys Conference 2020, 2020

Invited: Chipyard - An Integrated SoC Research and Implementation Environment.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

An Energy-Efficient RISC-V RV32IMAC Microcontroller for Periodical-Driven Sensing Applications.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE Custom Integrated Circuits Conference, 2020

NeuroVectorizer: end-to-end vectorization with deep reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the CGO '20: 18th ACM/IEEE International Symposium on Code Generation and Optimization, 2020

FirePerf: FPGA-Accelerated Full-System Hardware/Software Performance Profiling and Co-Design.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019

A Hardware Accelerator for Tracing Garbage Collection.

[BibT_eX]

[DOI]

Martin Maas

John Kubiatowicz

IEEE Micro, 2019

FireSim: FPGA-Accelerated Cycle-Exact Scale-Out System Simulation in the Public Cloud.

[BibT_eX]

[DOI]

IEEE Micro, 2019

BROOM: An Open-Source Out-of-Order Processor With Resilient Low-Voltage Operation in 28-nm CMOS.

[BibT_eX]

[DOI]

IEEE Micro, 2019

Co-design of deep neural nets and neural net accelerators for embedded vision applications.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2019

Sanctorum: A lightweight security monitor for secure enclaves.

[BibT_eX]

[DOI]

IACR Cryptol. ePrint Arch., 2019

Gemmini: An Agile Systolic Array Generator Enabling Systematic Evaluations of Deep-Learning Architectures.

[BibT_eX]

[DOI]

CoRR, 2019

Deep Reinforcement Learning in System Optimization.

[BibT_eX]

[DOI]

CoRR, 2019

Keystone: A Framework for Architecting TEEs.

[BibT_eX]

[DOI]

CoRR, 2019

AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Simmani: Runtime Power Modeling for Arbitrary RTL with Automatic Signal Selection.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Golden Gate: Bridging The Resource-Efficiency Gap Between ASICs and FPGA Prototypes.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2019

Centrifuge: Evaluating full-system HLS-generated heterogenous-accelerator SoCs using FPGA-Acceleration.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2019

FPGA Accelerated INDEL Realignment in the Cloud.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

FASED: FPGA-Accelerated Simulation and Evaluation of DRAM.

[BibT_eX]

[DOI]

Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

AutoPhase: Compiler Phase-Ordering for HLS with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

Open-Source EDA Tools and IP, A View from the Trenches.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

2018

An Out-of-Order RISC-V Processor with Resilient Low-Voltage Operation in 28NM CMOS.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Symposium on VLSI Circuits, 2018

DESSERT: Debugging RTL Effectively with State Snapshotting for Error Replays across Trillions of Cycles.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Field Programmable Logic and Applications, 2018

Generating the Next Wave of Custom Silicon.

[BibT_eX]

[DOI]

Borivoje Nikolic

Elad Alon

Proceedings of the 44th IEEE European Solid State Circuits Conference, 2018

2017

Reprogrammable Redundancy for SRAM-Based Cache Vmin Reduction in a 28-nm RISC-V Processor.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2017

A RISC-V Processor SoC With Integrated Power Management at Submicrosecond Timescales in 28 nm FD-SOI.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2017

Distributed-Memory Breadth-First Search on Massive Graphs.

[BibT_eX]

[DOI]

CoRR, 2017

Microprocessor chip with photonic I/O.

[BibT_eX]

[DOI]

Proceedings of the Optical Fiber Communications Conference and Exhibition, 2017

Reducing Pagerank Communication via Propagation Blocking.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Cyclist: Accelerating hardware development.

[BibT_eX]

[DOI]

Jonathan Bachrach

Albert Magyar

Patrick Li

Richard Lin

Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017

Return of the Runtimes: Rethinking the Language Runtime System for the Cloud 3.0 Era.

[BibT_eX]

[DOI]

Martin Maas

John Kubiatowicz

Proceedings of the 16th Workshop on Hot Topics in Operating Systems, 2017

A Hardware Accelerator for Computing an Exact Dot Product.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Symposium on Computer Arithmetic, 2017

2016

An Agile Approach to Building RISC-V Microprocessors.

[BibT_eX]

[DOI]

IEEE Micro, 2016

A RISC-V Vector Processor With Simultaneous-Switching Switched-Capacitor DC-DC Converters in 28 nm FDSOI.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2016

The Renewed Case for the Reduced Instruction Set Computer: Avoiding ISA Bloat with Macro-Op Fusion for RISC-V.

[BibT_eX]

[DOI]

Christopher Celio

CoRR, 2016

Strober: Fast and Accurate Sample-Based Energy Simulation for Arbitrary RTL.

[BibT_eX]

[DOI]

Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Vector Processors for Energy-Efficient Embedded Systems.

[BibT_eX]

[DOI]

Proceedings of the Fourth ACM International Workshop on Many-core Embedded Systems, 2016

Sub-microsecond adaptive voltage scaling in a 28nm FD-SOI processor SoC.

[BibT_eX]

[DOI]

Proceedings of the ESSCIRC Conference 2016: 42nd European Solid-State Circuits Conference, 2016

Reprogrammable redundancy for cache Vmin reduction in a 28nm RISC-V processor.

[BibT_eX]

[DOI]

Proceedings of the IEEE Asian Solid-State Circuits Conference, 2016

Taurus: A Holistic Language Runtime System for Coordinating Distributed Managed-Language Applications.

[BibT_eX]

[DOI]

Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016

2015

Per-Core DVFS With Switched-Capacitor Converters for Energy Efficiency in Manycore Processors.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2015

Single-chip microprocessor that communicates directly using light.

[BibT_eX]

[DOI]

Nat., 2015

The GAP Benchmark Suite.

[BibT_eX]

[DOI]

CoRR, 2015

A RISC-V vector processor with tightly-integrated switched-capacitor DC-DC converters in 28nm FDSOI.

[BibT_eX]

[DOI]

Proceedings of the Symposium on VLSI Circuits, 2015

GAIL: the graph algorithm iron law.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Irregular Applications - Architectures and Algorithms, 2015

Locality Exists in Graph Processing: Workload Characterization on an Ivy Bridge Server.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Workload Characterization, 2015

Trash Day: Coordinating Garbage Collection in Distributed Systems.

[BibT_eX]

[DOI]

Proceedings of the 15th Workshop on Hot Topics in Operating Systems, 2015

Raven: A 28nm RISC-V vector processor with integrated switched-capacitor DC-DC converters and adaptive clocking.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Hot Chips 27 Symposium (HCS), 2015

DIABLO: A Warehouse-Scale Computer Network Simulator using FPGAs.

[BibT_eX]

[DOI]

Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, 2015

2014

Exploring the Design Space of SPMD Divergence Management on Data-Parallel Architectures.

[BibT_eX]

[DOI]

Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

Author retrospective for optimizing matrix multiply using PHiPAC: a portable high-performance ANSI C coding methodology.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Supercomputing 25th Anniversary Volume, 2014

Joint impact of random variations and RTN on dynamic writeability in 28nm bulk and FDSOI SRAM.

[BibT_eX]

[DOI]

Proceedings of the 44th European Solid State Device Research Conference, 2014

A 45nm 1.3GHz 16.7 double-precision GFLOPS/W RISC-V processor with vector accelerators.

[BibT_eX]

[DOI]

Proceedings of the ESSCIRC 2014, 2014

2013

Exploring the Tradeoffs between Programmability and Efficiency in Data-Parallel Accelerators.

[BibT_eX]

[DOI]

ACM Trans. Comput. Syst., 2013

Direction-optimizing breadth-first search.

[BibT_eX]

[DOI]

Sci. Program., 2013

A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Distributed Memory Breadth-First Search Revisited: Enabling Bottom-Up Search.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

The RISC-V instruction set.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Hot Chips 25 Symposium (HCS), 2013

Measuring the gap between programmable and fixed-function accelerators: A case study on speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Hot Chips 25 Symposium (HCS), 2013

Welcome from general chairs.

[BibT_eX]

[DOI]

Jan-Willem van de Waerdt

Proceedings of the 2013 IEEE Hot Chips 25 Symposium (HCS), 2013

Tessellation: refactoring the OS around explicit resource containers with continuous adaptation.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

Convergence and scalarization for data-parallel architectures.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, 2013

PHANTOM: practical oblivious computation in a secure processor.

[BibT_eX]

[DOI]

Proceedings of the 2013 ACM SIGSAC Conference on Computer and Communications Security, 2013

2012

SRAM Assist Techniques for Operation in a Wide Voltage Range in 28-nm CMOS.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2012

Globally Synchronized Frames for guaranteed quality-of-service in on-chip networks.

[BibT_eX]

[DOI]

Jae W. Lee

Man Cheuk Ng

J. Parallel Distributed Comput., 2012

Designing Chip-Level Nanophotonic Interconnection Networks.

[BibT_eX]

[DOI]

IEEE J. Emerg. Sel. Topics Circuits Syst., 2012

Context-centric Security.

[BibT_eX]

[DOI]

Proceedings of the 7th USENIX Workshop on Hot Topics in Security, 2012

GPUs as an opportunity for offloading garbage collection.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Memory Management, 2012

Chisel: constructing hardware in a Scala embedded language.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Design Automation Conference 2012, 2012

2011

Real-time Musical Applications on an Experimental Operating System for Multi-Core Processors.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Computer Music Conference, 2011

The Maven vector-thread architecture.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Hot Chips 23 Symposium (HCS), 2011

Tessellation operating system: Building a real-time, responsive, high-throughput client OS for many-core architectures.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Hot Chips 23 Symposium (HCS), 2011

2010

Guest Editors' Introduction: Hot Chips 21.

[BibT_eX]

[DOI]

Ralph Wittig

IEEE Micro, 2010

Composing parallel software efficiently with lithe.

[BibT_eX]

[DOI]

Heidi Pan

Benjamin Hindman

Proceedings of the 2010 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2010

A case for FAME: FPGA architecture model execution.

[BibT_eX]

[DOI]

Proceedings of the 37th International Symposium on Computer Architecture (ISCA 2010), 2010

Re-architecting DRAM memory systems with monolithically integrated silicon photonics.

[BibT_eX]

[DOI]

Proceedings of the 37th International Symposium on Computer Architecture (ISCA 2010), 2010

RAMP gold: an FPGA-based architecture simulator for multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 47th Design Automation Conference, 2010

2009

Building Many-Core Processor-to-DRAM Networks with Monolithic CMOS Silicon Photonics.

[BibT_eX]

[DOI]

IEEE Micro, 2009

A view of the parallel computing landscape.

[BibT_eX]

[DOI]

Commun. ACM, 2009

Silicon-photonic clos networks for global on-chip communication.

[BibT_eX]

[DOI]

Proceedings of the Third International Symposium on Networks-on-Chips, 2009

Designing multi-socket systems using silicon photonics.

[BibT_eX]

[DOI]

Proceedings of the 23rd international conference on Supercomputing, 2009

2008

Implementing the scale vector-thread processor.

[BibT_eX]

[DOI]

Ronny Krashinsky

Christopher Batten

ACM Trans. Design Autom. Electr. Syst., 2008

MEMOCODE 2008 Co-Design Contest.

[BibT_eX]

[DOI]

Patrick Schaumont

James C. Hoe

Proceedings of the 6th ACM & IEEE International Conference on Formal Methods and Models for Co-Design (MEMOCODE 2008), 2008

Building Manycore Processor-to-DRAM Networks with Monolithic Silicon Photonics.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual IEEE Symposium on High Performance Interconnects (HOTI 2008), 2008

Compiling for vector-thread architectures.

[BibT_eX]

[DOI]

Mark Hampton

Proceedings of the Sixth International Symposium on Code Generation and Optimization (CGO 2008), 2008

2007

Activity-Sensitive Flip-Flop and Latch Selection for Reduced Energy.

[BibT_eX]

[DOI]

Ronny Krashinsky

Christoforos E. Kozyrakis

IEEE Trans. Very Large Scale Integr. Syst., 2007

RAMP: Research Accelerator for Multiple Processors.

[BibT_eX]

[DOI]

James C. Hoe

Derek Chiou

IEEE Micro, 2007

Continual hashing for efficient fine-grain state inconsistency detection.

[BibT_eX]

[DOI]

Jae W. Lee

Myron King

Proceedings of the 25th International Conference on Computer Design, 2007

Transactors for parallel hardware and software co-design.

[BibT_eX]

[DOI]

Proceedings of the IEEE International High Level Design Validation and Test Workshop, 2007

2006

Energy-aware lossless data compression.

[BibT_eX]

[DOI]

Kenneth C. Barr

ACM Trans. Comput. Syst., 2006

Unbounded Transactional Memory.

[BibT_eX]

[DOI]

IEEE Micro, 2006

Rethinking Hardware Support for Network Analysis and Intrusion Prevention.

[BibT_eX]

[DOI]

Proceedings of the 1st USENIX Workshop on Hot Topics in Security, 2006

METERG: Measurement-Based End-to-End Performance Estimation Technique in QoS-Capable Multiprocessors.

[BibT_eX]

[DOI]

Jae W. Lee

Proceedings of the 12th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS 2006), 2006

Accelerating architectural exploration using canonical instruction segments.

[BibT_eX]

[DOI]

Rose F. Liu

Proceedings of the 2006 IEEE International Symposium on Performance Analysis of Systems and Software, 2006

Branch trace compression for snapshot-based simulation.

[BibT_eX]

[DOI]

Kenneth C. Barr

Proceedings of the 2006 IEEE International Symposium on Performance Analysis of Systems and Software, 2006

Implementing virtual memory in a vector processor with software restart markers.

[BibT_eX]

[DOI]

Mark Hampton

Proceedings of the 20th Annual International Conference on Supercomputing, 2006

Research accelerator for multiple processors.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE Hot Chips 18 Symposium (HCS), 2006

2005

A Speculative Control Scheme for an Energy-Efficient Banked Register Fil.

[BibT_eX]

[DOI]

Jessica H. Tseng

IEEE Trans. Computers, 2005

Controlling program execution through binary instrumentation.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2005

Mondrix: memory isolation for linux using mondriaan memory protection.

[BibT_eX]

[DOI]

Emmett Witchel

Junghwan Rhee

Proceedings of the 20th ACM Symposium on Operating Systems Principles 2005, 2005

Accelerating Multiprocessor Simulation with a Memory Timestamp Record.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2005

Replacing global wires with an on-chip network: a power analysis.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005

Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors.

[BibT_eX]

[DOI]

Michael Zhang

Proceedings of the 32st International Symposium on Computer Architecture (ISCA 2005), 2005

2004

The Vector-Thread Architecture.

[BibT_eX]

[DOI]

IEEE Micro, 2004

Cache Refill/Access Decoupling for Vector Machines.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual International Symposium on Microarchitecture (MICRO-37 2004), 2004

Power-optimal pipelining in deep submicron technology.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Symposium on Low Power Electronics and Design, 2004

2003

Reducing power density through activity migration.

[BibT_eX]

[DOI]

Kenneth C. Barr

Proceedings of the 2003 International Symposium on Low Power Electronics and Design, 2003

Banked Multiported Register Files for High-Frequency Superscalar Microprocessors.

[BibT_eX]

[DOI]

Jessica H. Tseng

Proceedings of the 30th International Symposium on Computer Architecture (ISCA 2003), 2003

Hardware Works, Software Doesn't: Enforcing Modularity with Mondriaan Memory Protection.

[BibT_eX]

[DOI]

Emmett Witchel

Proceedings of HotOS'03: 9th Workshop on Hot Topics in Operating Systems, 2003

2002

Fine-grain CAM-tag cache resizing using miss tags.

[BibT_eX]

[DOI]

Michael Zhang

Proceedings of the 2002 International Symposium on Low Power Electronics and Design, 2002

Dynamic Fine-Grain Leakage Reduction Using Leakage-Biased Bitlines.

[BibT_eX]

[DOI]

Proceedings of the 29th International Symposium on Computer Architecture (ISCA 2002), 2002

Mondrian memory protection.

[BibT_eX]

[DOI]

Emmett Witchel

Josh Cates

Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-X), 2002

2001

Multithreading decoupled architectures for complexity-effective general purpose computing.

[BibT_eX]

[DOI]

Michael Sung

Ronny Krashinsky

SIGARCH Comput. Archit. News, 2001

Direct addressed caches for reduced power consumption.

[BibT_eX]

[DOI]

Proceedings of the 34th Annual International Symposium on Microarchitecture, 2001

Heads and tails: a variable-length instruction format supporting parallel fetch and decode.

[BibT_eX]

[DOI]

Heidi Pan

Proceedings of the 2001 International Conference on Compilers, 2001

2000

Energy-Efficient Register Access.

[BibT_eX]

[DOI]

Jessica H. Tseng

Proceedings of the 13th Annual Symposium on Integrated Circuits and Systems Design, 2000

Dynamic zero compression for cache energy reduction.

[BibT_eX]

[DOI]

Luis Villa

Michael Zhang

Christoforos E. Kozyrakis

Proceedings of the 33rd Annual IEEE/ACM International Symposium on Microarchitecture, 2000

1997

Scalable Processors in the Billion-Transistor Era: IRAM.

[BibT_eX]

[DOI]

Computer, 1997

A Fast Kohonen Net Implementation for Spert-II.

[BibT_eX]

[DOI]

Christoforos E. Kozyrakis

Proceedings of the Biological and Artificial Computation: From Neuroscience to Technology, 1997

Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology.

[BibT_eX]

[DOI]

Proceedings of the 11th international conference on Supercomputing, 1997

Intelligent RAM (IRAM): The Industrial Setting, Applications and Architectures.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 1997 International Conference on Computer Design: VLSI in Computers & Processors, 1997

Using PHiPAC to speed error back-propagation learning.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Spert-II: A Vector Microprocessor System.

[BibT_eX]

[DOI]

Computer, 1996

1995

SPERT-II: A Vector Microprocessor System and its Application to Large Problems in Backpropagation Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 8, 1995

1993

Using simulations of reduced precision arithmetic to design a neuro-microprocessor.

[BibT_eX]

[DOI]

Nelson Morgan

John Wawrzynek

J. VLSI Signal Process., 1993

The design of a neuro-microprocessor.

[BibT_eX]

[DOI]

John Wawrzynek