2025
Longhorn: State Space Models are Amortized Online Learners.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
AMO Sampler: Enhancing Text Rendering with Overshooting.
CoRR, 2024
Learning Memory Mechanisms for Decision Making through Demonstrations.
CoRR, 2024
Fine-Grained Gradient Restriction: A Simple Approach for Mitigating Catastrophic Forgetting.
CoRR, 2024
Memory-Efficient LLM Training with Online Subspace Descent.
CoRR, 2024
H-Fac: Memory-Efficient Optimization with Factorized Hamiltonian Descent.
CoRR, 2024
Communication Efficient Distributed Training with Distributed Lion.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Lion Secretly Solves a Constrained Optimization: As Lyapunov Predicts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making.
Proceedings of the Conference on Lifelong Learning Agents, 2024
Overview of t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
2023
LLM+P: Empowering Large Language Models with Optimal Planning Proficiency.
CoRR, 2023
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
FAMO: Fast Adaptive Multitask Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Benchmarking Reinforcement Learning Techniques for Autonomous Navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Model-Based Meta Automatic Curriculum Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023
Relaxed Exploration Constrained Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
APPL: Adaptive Planner Parameter Learning.
Robotics Auton. Syst., 2022
Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning.
CoRR, 2022
Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake.
CoRR, 2022
Motion planning and control for mobile robot navigation using machine learning: a survey.
Auton. Robots, 2022
BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Effective mutation rate adaptation through group elite selection.
Proceedings of the GECCO '22: Genetic and Evolutionary Computation Conference, Boston, Massachusetts, USA, July 9, 2022
Offline training of multi-agent reinforcement agents for grid-interactive buildings control.
Proceedings of the e-Energy '22: The Thirteenth ACM International Conference on Future Energy Systems, Virtual Event, 28 June 2022, 2022
A Rule-based Shield: Accumulating Safety Rules from Catastrophic Action Effects.
Proceedings of the Conference on Lifelong Learning Agents, 2022
Continual Learning and Private Unlearning.
Proceedings of the Conference on Lifelong Learning Agents, 2022
2021
Toward Agile Maneuvers in Highly Constrained Spaces: Learning From Hallucination.
IEEE Robotics Autom. Lett., 2021
A Lifelong Learning Approach to Mobile Robot Navigation.
IEEE Robotics Autom. Lett., 2021
UT Austin Villa: RoboCup 2021 3D Simulation League Competition Champions.
Proceedings of the RoboCup 2021: Robot World Cup XXIV, 2021
Conflict-Averse Gradient Descent for Multi-task learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Machine versus Human Attention in Deep Reinforcement Learning Tasks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Team Orienteering Coverage Planning with Uncertain Reward.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021
APPLR: Adaptive Planner Parameter Learning from Reinforcement.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021
Agile Robot Navigation through Hallucinated Learning and Sober Deployment.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021
APPLI: Adaptive Planner Parameter Learning From Interventions.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021
Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition.
Proceedings of the 38th International Conference on Machine Learning, 2021
2020
APPLD: Adaptive Planner Parameter Learning From Demonstration.
IEEE Robotics Autom. Lett., 2020
Motion Control for Mobile Robot Navigation Using Machine Learning: a Survey.
CoRR, 2020
Human versus Machine Attention in Deep Reinforcement Learning Tasks.
CoRR, 2020
Extended Abstract: Motion Planners Learned from Geometric Hallucination.
CoRR, 2020
Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Human Gaze Assisted Artificial Intelligence: A Review.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
2019
Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs.
CoRR, 2019
Predicting pregnancy using large-scale datafrom a women's health tracking mobile application.
Proceedings of the World Wide Web Conference, 2019
2018
Predicting pregnancy using large-scale data from a women's health tracking mobile application.
CoRR, 2018