Victor Rühle

According to our database1, Victor Rühle authored at least 21 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Minerva: A Programmable Memory Test Benchmark for Language Models.
CoRR, February, 2025

2024
TurboAttention: Efficient Attention Approximation For High Throughputs LLMs.
CoRR, 2024

Ensuring Fair LLM Serving Amid Diverse Applications.
CoRR, 2024

EcoAct: Economic Agent Determines When to Register What Action.
CoRR, 2024

TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning.
CoRR, 2024

Intelligent Router for LLM Workloads: Improving Performance Through Workload-Aware Scheduling.
CoRR, 2024

Lean Attention: Hardware-Aware Scalable Attention Mechanism for the Decode-Phase of Transformers.
CoRR, 2024

Risk-aware Adaptive Virtual CPU Oversubscription in Microsoft Cloud via Prototypical Human-in-the-loop Imitation Learning.
CoRR, 2024

Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Hybrid-RACA: Hybrid Retrieval-Augmented Composition Assistance for Real-time Text Prediction.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models.
CoRR, 2023

Rethinking Privacy in Machine Learning Pipelines from an Information Flow Control Perspective.
CoRR, 2023

Hybrid Retrieval-Augmented Generation for Real-time Composition Assistance.
CoRR, 2023

Bayesian Estimation of Differential Privacy.
Proceedings of the International Conference on Machine Learning, 2023

Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Spot Virtual Machine Eviction Prediction in Microsoft Cloud.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

2021
Privacy Regularization: Joint Privacy-Utility Optimization in Language Models.
CoRR, 2021

Privacy Analysis in Language Models via Training Data Leakage Report.
CoRR, 2021

Privacy Regularization: Joint Privacy-Utility Optimization in LanguageModels.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

2020
Analyzing Information Leakage of Updates to Natural Language Models.
Proceedings of the CCS '20: 2020 ACM SIGSAC Conference on Computer and Communications Security, 2020


  Loading...