2025
Hermes: Algorithm-System Co-design for Efficient Retrieval-Augmented Generation At-Scale.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025
Pirate: No Compromise Low-Bandwidth VR Streaming for Edge Devices.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
Practical Federated Recommendation Model Learning Using ORAM with Controlled Privacy.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
2024
Towards Understanding Systems Trade-offs in Retrieval-Augmented Generation Model Inference.
CoRR, 2024
Information Flow Control in Machine Learning through Modular Model Architecture.
Proceedings of the 33rd USENIX Security Symposium, 2024
QoS-Diff: Adaptive Auto-tuning Framework for Low-latency Diffusion Model Inference.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024
Accelerating ReLU for MPC-Based Private Inference with a Communication-Efficient Sign Estimation.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024
Compiler-Based Memory Encryption for Machine Learning on Commodity Low-Power Devices.
Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction, 2024
LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
GPU-based Private Information Retrieval for On-Device Machine Learning Inference.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
2023
Federated Ensemble Learning: Increasing the Capacity of Label Private Recommendation Systems.
IEEE Data Eng. Bull., 2023
Approximating ReLU on a Reduced Ring for Efficient MPC-based Private Inference.
CoRR, 2023
Green Federated Learning.
CoRR, 2023
Bounding the Invertibility of Privacy-preserving Instance Encoding using Fisher Information.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Optimizing CPU Performance for Recommendation Systems At-Scale.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023
Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis.
Proceedings of the International Conference on Machine Learning, 2023
Carbon Explorer: A Holistic Framework for Designing Carbon Aware Datacenters.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
Data Leakage via Access Patterns of Sparse Features in Deep Learning-based Recommendation Systems.
CoRR, 2022
Measuring and Controlling Split Layer Privacy Leakage Using Fisher Information.
CoRR, 2022
FEL: High Capacity Learning for Recommendation and Ranking via Federated Ensemble Learning.
CoRR, 2022
A Holistic Approach for Designing Carbon Aware Datacenters.
CoRR, 2022
Towards Fair Federated Recommendation Learning: Characterizing the Inter-Dependence of System and Data Heterogeneity.
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022
Sustainable AI: Environmental Implications, Challenges and Opportunities.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022
An Architectural Charge Management Interface for Energy-Harvesting Systems.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022
2021
Sustainable AI: Environmental Implications, Challenges and Opportunities.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021
2020
Dynamic Task-based Intermittent Execution for Energy-harvesting Devices.
ACM Trans. Sens. Networks, 2020
CPR: Understanding and Improving Failure Tolerant Training for Deep Learning Recommendation with Partial Recovery.
,
,
,
,
,
,
,
,
,
,
CoRR, 2020
Adaptive low-overhead scheduling for periodic and reactive intermittent execution.
Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020
2019
Enhancing Stratospheric Weather Analyses and Forecasts by Deploying Sensors from a Weather Balloon.
CoRR, 2019
Supporting peripherals in intermittent systems with just-in-time checkpoints.
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019
2018
Adaptive Dynamic Checkpointing for Safe Efficient Intermittent Computing.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018
2017
Alpaca: intermittent execution without checkpoints.
Proc. ACM Program. Lang., 2017
Intermittent Computing: Challenges and Opportunities.
Proceedings of the 2nd Summit on Advances in Programming Languages, 2017