2023
Unikernel Linux (UKL).
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022
Integrating Unikernel Optimizations in a General Purpose OS.
CoRR, 2022

A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads.
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

A software-defined tensor streaming multiprocessor for large-scale machine learning.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

The Groq Software-defined Scale-out Tensor Streaming Multiprocessor : From chips-to-systems architectural overview.
Proceedings of the 2022 IEEE Hot Chips 34 Symposium, 2022

ASSISTER: Assistive Navigation via Conditional Instruction Generation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Challenges/Opportunities to Enable Dependable Scale-out System with Groq Deterministic Tensor-Streaming Processors.
Proceedings of the 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2022

Answer Fast: Accelerating BERT on the Tensor Streaming Processor.
Proceedings of the 33rd IEEE International Conference on Application-specific Systems, 2022

2021
X-World: Accessibility, Vision, and Autonomy Meet.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Think Fast: A Tensor Streaming Processor (TSP) for Accelerating Deep Learning Workloads.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020