2024
Flex Attention: A Programming Model for Generating Optimized Attention Kernels.
CoRR, 2024

PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transformation and Graph Compilation.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Transcending Runtime-Memory Tradeoffs in Checkpointing by being Fusion Aware.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model.
CoRR, 2022

torch.fx: Practical Program Capture and Transformation for Deep Learning in Python.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

2021
Edge Proposal Sets for Link Prediction.
CoRR, 2021

The Pile: An 800GB Dataset of Diverse Text for Language Modeling.
CoRR, 2021

Measuring Coding Challenge Competence With APPS.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Value Learning for Throughput Optimization of Deep Learning Workloads.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

Combining Label Propagation and Simple Models out-performs Graph Neural Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Geometry types for graphics programming.
Proc. ACM Program. Lang., 2020

Value Function Based Performance Optimization of Deep Learning Workloads.
CoRR, 2020

Set-Structured Latent Representations.
CoRR, 2020

Better Set Representations For Relational Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Enhancing Adversarial Example Transferability With an Intermediate Level Attack.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Adversarial Example Decomposition.
CoRR, 2018

Intermediate Level Adversarial Attack for Enhanced Transferability.
CoRR, 2018