Rahul Kidambi

Orcid: 0009-0002-8609-3916

According to our database1, Rahul Kidambi authored at least 29 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.
CoRR, 2024

Auctions with LLM Summaries.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Minimaximalist Approach to Reinforcement Learning from Human Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Enhancing Group Fairness in Online Settings Using Oblique Decision Forests.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Conditional Language Policy: A General Framework For Steerable Multi-Objective Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2022
Counterfactual Learning To Rank for Utility-Maximizing Query Autocompletion.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

2021
Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage.
CoRR, 2021

Optimism is All You Need: Model-Based Imitation Learning From Observation Alone.
CoRR, 2021

MobILE: Model-Based Imitation Learning From Observation Alone.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Making Paper Reviewing Robust to Bid Manipulation Attacks.
Proceedings of the 38th International Conference on Machine Learning, 2021

Top-k eXtreme Contextual Bandits with Arm Hierarchy.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
MOReL: Model-Based Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Leverage Score Sampling for Faster Accelerated Regression and ERM.
Proceedings of the Algorithmic Learning Theory, 2020

2019
The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure.
CoRR, 2019

The Step Decay Schedule: A Near Optimal, Geometrically Decaying Learning Rate Procedure For Least Squares.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Open Problem: Do Good Algorithms Necessarily Query Bad Points?
Proceedings of the Conference on Learning Theory, 2019

2018
On the insufficiency of existing momentum schemes for Stochastic Optimization.
Proceedings of the 6th International Conference on Learning Representations, 2018

Accelerating Stochastic Gradient Descent for Least Squares Regression.
Proceedings of the Conference On Learning Theory, 2018

2017
Parallelizing Stochastic Gradient Descent for Least Squares Regression: Mini-batching, Averaging, and Model Misspecification.
J. Mach. Learn. Res., 2017

Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles.
CoRR, 2017

Accelerating Stochastic Gradient Descent.
CoRR, 2017

A Markov Chain Theory Approach to Characterizing the Minimax Optimality of Stochastic Gradient Descent (for Least Squares).
Proceedings of the 37th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, 2017

2016
Parallelizing Stochastic Approximation Through Mini-Batching and Tail-Averaging.
CoRR, 2016

2015
Submodular Hamming Metrics.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

On Shannon capacity and causal estimation.
Proceedings of the 53rd Annual Allerton Conference on Communication, 2015

2013
A Quantitative Evaluation Framework for Missing Value Imputation Algorithms.
CoRR, 2013

A Structured Prediction Approach for Missing Value Imputation.
CoRR, 2013

2012
Deformable trellises on factor graphs for robust microtubule tracking in clutter.
Proceedings of the 9th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2012


  Loading...