Haoran Qiu

Orcid: 0009-0005-9713-0200

According to our database1, Haoran Qiu authored at least 25 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
One Queue Is All You Need: Resolving Head-of-Line Blocking in Large Language Model Serving.
CoRR, 2024

Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction.
CoRR, 2024

Decision Transformer as a Foundation Model for Partially Observable Continuous Control.
CoRR, 2024

Õ(T<sup>-1</sup>) Convergence to (Coarse) Correlated Equilibria in Full-Information General-Sum Markov Games.
CoRR, 2024

Power-aware Deep Learning Model Serving with μ-Serve.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

$\widetilde{O}(T^{-1})$ {C}onvergence to (coarse) correlated equilibria in full-information general-sum markov games.
Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

SmartOClock: Workload- and Risk-Aware Overclocking in the Cloud.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

From Stickers to Personas: Utilizing Instant Messaging Stickers for Impression Management by Gen Z.
Proceedings of the Wisdom, Well-Being, Win-Win, 2024

When Green Computing Meets Performance and Resilience SLOs.
Proceedings of the 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2024

Queue Management for SLO-Oriented Large Language Model Serving.
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023
A Geography-Based P2P Overlay Network for Fast and Robust Blockchain Systems.
IEEE Trans. Serv. Comput., 2023

AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Multi-Agent Meta-Reinforcement Learning: Sharper Convergence Rates with Task Similarity.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
A Mean-Field Game Approach to Cloud Resource Management with Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Evaluating Hardware Memory Disaggregation under Delay and Contention.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Reinforcement learning for resource management in multi-tenant serverless platforms.
Proceedings of the EuroMLSys '22: Proceedings of the 2nd European Workshop on Machine Learning and Systems, Rennes, France, April 5, 2022

SIMPPO: a scalable and incremental online learning framework for serverless resource management.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

2021
SLO beyond the Hardware Isolation Limits.
CoRR, 2021

Is Function-as-a-Service a Good Fit for Latency-Critical Services?
Proceedings of the WoSC '21: Proceedings of the Seventh International Workshop on Serverless Computing (WoSC7) 2021, 2021

Delay sensitivity-driven congestion mitigation for HPC systems.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

2020
Application-aware Congestion Mitigation forHigh-Performance Computing Systems.
CoRR, 2020

FIRM: An Intelligent Fine-grained Resource Management Framework for SLO-Oriented Microservices.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

2018
PLOVER: Fast, Multi-core Scalable Virtual Machine Fault-tolerance.
Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation, 2018

OWL: Understanding and Detecting Concurrency Attacks.
Proceedings of the 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2018


  Loading...