Ruihang Lai

According to our database1, Ruihang Lai authored at least 9 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2022
2023
2024
2025
0
1
2
3
4
5
1
4
1
2
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving.
CoRR, January, 2025

2024
WebLLM: A High-Performance In-Browser LLM Inference Engine.
CoRR, 2024

A System for Microserving of LLMs.
CoRR, 2024

XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models.
CoRR, 2024

Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development.
CoRR, 2024

2023
Relax: Composable Abstractions for End-to-End Dynamic Machine Learning.
CoRR, 2023

SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

TensorIR: An Abstraction for Automatic Tensorized Program Optimization.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Tensor Program Optimization with Probabilistic Programs.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...