2025
Hermes: Algorithm-System Co-design for Efficient Retrieval-Augmented Generation At-Scale.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

Pirate: No Compromise Low-Bandwidth VR Streaming for Edge Devices.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

Practical Federated Recommendation Model Learning Using ORAM with Controlled Privacy.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024
Towards Understanding Systems Trade-offs in Retrieval-Augmented Generation Model Inference.
CoRR, 2024

Information Flow Control in Machine Learning through Modular Model Architecture.
Proceedings of the 33rd USENIX Security Symposium, 2024

QoS-Diff: Adaptive Auto-tuning Framework for Low-latency Diffusion Model Inference.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

Accelerating ReLU for MPC-Based Private Inference with a Communication-Efficient Sign Estimation.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Compiler-Based Memory Encryption for Machine Learning on Commodity Low-Power Devices.
Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction, 2024

LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

GPU-based Private Information Retrieval for On-Device Machine Learning Inference.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Federated Ensemble Learning: Increasing the Capacity of Label Private Recommendation Systems.
IEEE Data Eng. Bull., 2023

Approximating ReLU on a Reduced Ring for Efficient MPC-based Private Inference.
CoRR, 2023

Green Federated Learning.
CoRR, 2023

Bounding the Invertibility of Privacy-preserving Instance Encoding using Fisher Information.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Optimizing CPU Performance for Recommendation Systems At-Scale.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis.
Proceedings of the International Conference on Machine Learning, 2023

Carbon Explorer: A Holistic Framework for Designing Carbon Aware Datacenters.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Data Leakage via Access Patterns of Sparse Features in Deep Learning-based Recommendation Systems.
CoRR, 2022

Measuring and Controlling Split Layer Privacy Leakage Using Fisher Information.
CoRR, 2022

FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning.
CoRR, 2022

A Holistic Approach for Designing Carbon Aware Datacenters.
CoRR, 2022

Towards Fair Federated Recommendation Learning: Characterizing the Inter-Dependence of System and Data Heterogeneity.
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022

Sustainable AI: Environmental Implications, Challenges and Opportunities.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

An Architectural Charge Management Interface for Energy-Harvesting Systems.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

2021
Sustainable AI: Environmental Implications, Challenges and Opportunities.
CoRR, 2021

Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

2020
Dynamic Task-based Intermittent Execution for Energy-harvesting Devices.
ACM Trans. Sens. Networks, 2020

CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery.
CoRR, 2020

Adaptive low-overhead scheduling for periodic and reactive intermittent execution.
Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020

2019
Enhancing Stratospheric Weather Analyses and Forecasts by Deploying Sensors from a Weather Balloon.
CoRR, 2019

Supporting peripherals in intermittent systems with just-in-time checkpoints.
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019

2018
Adaptive Dynamic Checkpointing for Safe Efficient Intermittent Computing.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

2017
Alpaca: intermittent execution without checkpoints.
Proc. ACM Program. Lang., 2017

Intermittent Computing: Challenges and Opportunities.
Proceedings of the 2nd Summit on Advances in Programming Languages, 2017