Alekh Agarwal
Orcid: 0000-0001-7032-7162
According to our database1,
Alekh Agarwal
authored at least 130 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
J. Mach. Learn. Res., 2024
CoRR, 2024
Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning.
CoRR, 2024
Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization.
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Conditional Language Policy: A General Framework For Steerable Multi-Objective Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the International Conference on Algorithmic Learning Theory, 2024
2023
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking.
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023
2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022
2021
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift.
J. Mach. Learn. Res., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Cautiously Optimistic Policy Optimization and Exploration with Linear Function Approximation.
Proceedings of the Conference on Learning Theory, 2021
Proceedings of the Conference on Learning Theory, 2021
2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the Conference on Learning Theory, 2020
Proceedings of the Conference on Learning Theory, 2020
Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes.
Proceedings of the Conference on Learning Theory, 2020
Metareasoning in Modular Software Systems: On-the-Fly Configuration Using Reinforcement Learning with Rich Contextual Representations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoRR, 2019
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019
Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Bias Correction of Learned Generative Models via Likelihood-free Importance Weighting.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019
Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches.
Proceedings of the Conference on Learning Theory, 2019
2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the Conference On Learning Theory, 2018
Proceedings of the Conference On Learning Theory, 2018
2017
IEEE Trans. Inf. Theory, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 30th Conference on Learning Theory, 2017
Proceedings of the 30th Conference on Learning Theory, 2017
2016
SIAM J. Optim., 2016
CoRR, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of The 27th Conference on Learning Theory, 2014
Proceedings of The 27th Conference on Learning Theory, 2014
Stochastic optimization and sparse statistical recovery: An optimal algorithm for high dimensions.
Proceedings of the 48th Annual Conference on Information Sciences and Systems, 2014
2013
IEEE Trans. Inf. Theory, 2013
CoRR, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
2012
Information-Theoretic Lower Bounds on the Oracle Complexity of Stochastic Convex Optimization.
IEEE Trans. Inf. Theory, 2012
Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling.
IEEE Trans. Autom. Control., 2012
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012
Proceedings of the IEEE Statistical Signal Processing Workshop, 2012
Stochastic optimization and sparse statistical recovery: Optimal algorithms for high dimensions.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Proceedings of the 50th Annual Allerton Conference on Communication, 2012
2011
Proceedings of the COLT 2011, 2011
Fast global convergence of gradient methods for high-dimensional statistical recovery
CoRR, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
2010
Message-passing for Graph-structured Linear Programs: Proximal Methods and Rounding Schemes.
J. Mach. Learn. Res., 2010
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Fast global convergence rates of gradient methods for high-dimensional statistical recovery.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the COLT 2010, 2010
2009
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009
Proceedings of the COLT 2009, 2009
2008
Message-passing for graph-structured linear programs: proximal projections, convergence and rounding schemes.
Proceedings of the Machine Learning, 2008
2007
Proceedings of the Advances in Neural Information Processing Systems 20, 2007
Proceedings of the Machine Learning, 2007
2006
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006