Yujeong Choi
According to our database1,
Yujeong Choi
authored at least 9 papers
between 2020 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers.
CoRR, 2024
vTrain: A Simulation Framework for Evaluating Cost-Effective and Compute-Optimal Large Language Model Training.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
ElasticRec: A Microservice-based Model Serving Architecture Enabling Elastic Resource Scaling for Recommendation Models.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024
2023
Hera: A Heterogeneity-Aware Multi-Tenant Inference Server for Personalized Recommendations.
CoRR, 2023
2022
PARIS and ELSA: an elastic scheduling algorithm for reconfigurable multi-GPU inference servers.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022
2021
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021
2020
CoRR, 2020
PREMA: A Predictive Multi-Task Scheduling Algorithm For Preemptible Neural Processing Units.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units.
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020