A. Rupam Mahmood
Orcid: 0000-0001-6266-162XAffiliations:
- University of Alberta, Reinforcement Learning & Artificial Intelligence Lab, Edmonton, AB, Canada
- Alberta Machine Intelligence Institute (Amii), Edmonton, AB, Canada
- Kindred AI, Toronto, ON, Canada
According to our database1,
A. Rupam Mahmood
authored at least 48 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on amii.ca
On csauthors.net:
Bibliography
2024
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning.
RLJ, 2024
Learning to Optimize for Reinforcement Learning.
RLJ, 2024
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling.
RLJ, 2024
Weight Clipping for Deep Continual and Reinforcement Learning.
RLJ, 2024
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
2023
Trans. Mach. Learn. Res., 2023
CoRR, 2023
Proceedings of the Uncertainty in Artificial Intelligence, 2023
Proceedings of the International Joint Conference on Neural Networks, 2023
Real-Time Reinforcement Learning for Vision-Based Robotics Utilizing Local and Remote Computers.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Proceedings of the International Conference on Machine Learning, 2023
2022
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences.
J. Mach. Learn. Res., 2022
Proceedings of the 2022 International Conference on Robotics and Automation, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
2021
CoRR, 2021
Analyzing Neural Jacobian Methods in Applications of Visual Servoing and Kinematic Control.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021
2020
IEEE Robotics Autom. Lett., 2020
2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
2018
J. Mach. Learn. Res., 2018
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018
Proceedings of the 2nd Annual Conference on Robot Learning, 2018
2017
2016
J. Mach. Learn. Res., 2016
2015
Off-policy learning based on weighted importance sampling with linear computational complexity.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015
2014
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014
Weighted importance sampling for off-policy learning with linear function approximation.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
2013
Proceedings of the Tenth Symposium on Abstraction, Reformulation, and Approximation, 2013
Proceedings of the Learning Rich Representations from Low-Level Sensors, 2013
2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012