James Laudon

Orcid: 0000-0003-1061-1780

According to our database1, James Laudon authored at least 44 papers between 1987 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Scalable Multi-Domain Adaptation of Language Models using Modular Experts.
CoRR, 2024

2023
GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Brainformers: Trading Simplicity for Efficiency.
Proceedings of the International Conference on Machine Learning, 2023

Lifelong Language Pretraining with Distribution-Specialized Experts.
Proceedings of the International Conference on Machine Learning, 2023

2022
Mixture-of-Experts with Expert Choice Routing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards the Co-design of Neural Networks and Accelerators.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks.
Proceedings of the IEEE International Symposium on Workload Characterization, 2022

2021
A graph placement methodology for fast chip design.
Nat., 2021

The Design Process for Google's Training Chips: TPUv2 and TPUv3.
IEEE Micro, 2021

Rethinking Co-design of Neural Architectures and Hardware Accelerators.
CoRR, 2021

Apollo: Transferable Architecture Exploration.
CoRR, 2021

Ten Lessons From Three Generations Shaped Google's TPUv4i : Industrial Product.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

2020
A Single-Shot Generalized Device Placement for Large Dataflow Graphs.
IEEE Micro, 2020

Chip Placement with Deep Reinforcement Learning.
CoRR, 2020

A domain-specific supercomputer for training deep neural networks.
Commun. ACM, 2020

Transferable Graph Optimizers for ML Compilers.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Google's Training Chips Revealed: TPUv2 and TPUv3.
Proceedings of the IEEE Hot Chips 32 Symposium, 2020

2019
GDP: Generalized Device Placement for Dataflow Graphs.
CoRR, 2019

2017
In-Datacenter Performance Analysis of a Tensor Processing Unit.
CoRR, 2017


2009
Throughput-Oriented Multicore Processors.
Proceedings of the Multicore Processors and Systems, 2009

2007
iChip Multiprocessor Architecture: Techniques to Improve Throughput and Latency
Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers, ISBN: 978-3-031-01720-9, 2007

The Coming Wave of Multithreaded Chip Multiprocessors.
Int. J. Parallel Program., 2007

Virtual private caches.
Proceedings of the 34th International Symposium on Computer Architecture (ISCA 2007), 2007

2006
Multi-Core Microprocessors Are Here.
login Usenix Mag., 2006

Fair Queuing Memory Systems.
Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-39 2006), 2006

2005
Performance/Watt: the new server focus.
SIGARCH Comput. Archit. News, 2005

The RASE (Rapid, Accurate Simulation Environment) for chip multiprocessors.
SIGARCH Comput. Archit. News, 2005

Maximizing CMP Throughput with Mediocre Cores.
Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT 2005), 2005

1998
The DASH Prototype: Implementation and Performance.
Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

Retrospective: The DASH Prototype: Implementation and Performance.
Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

1997
The SGI Origin: A ccNUMA Highly Scalable Server.
Proceedings of the 24th International Symposium on Computer Architecture, 1997

System overview of the SGI Origin 200/2000 product line.
Proceedings of the Proceedings IEEE COMPCON 97, 1997

1994
Interleaving: A Multithreading Technique Targeting Multiprocessors and Workstations.
Proceedings of the ASPLOS-VI Proceedings, 1994

Architectural and Implementation Tradeoffs in the Design of Multiple-Context Processors.
Proceedings of the Multithreaded Computer Architecture, 1994

1993
The DASH Prototype: Logic Overhead and Performance.
IEEE Trans. Parallel Distributed Syst., 1993

1992
The Stanford Dash Multiprocessor.
Computer, 1992

Architectural and implementation tradeoffs in the design of multiple-context processors.
Proceedings of the 19th Annual International Symposium on Computer Architecture. Gold Coast, 1992

1990
The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor.
Proceedings of the 17th Annual International Symposium on Computer Architecture, 1990

Memory Consistency and Event Ordering in Scalable Shared-Memory Multiprocessors.
Proceedings of the 17th Annual International Symposium on Computer Architecture, 1990

Design of scalable shared-memory multiprocessors: the DASH approach.
Proceedings of the Intellectual Leverage: Thirty-Fifth IEEE Computer Society International Conference, 1990

1988
The Astronautics ZS-1 processor.
Proceedings of the Computer Design: VLSI in Computers and Processors, 1988

1987
The ZS-1 Central Processor.
Proceedings of the Second International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS II), 1987


  Loading...