2025

Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering.

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation.

[DOI]

,

,

,

CoRR, February, 2025

Pre-Trained Video Generative Models as World Simulators.

[DOI]

,

,

,

,

CoRR, February, 2025

A Computational CNN-LSTM-Based Mental Health Consultation System in a College Environment.

[DOI]

,

Informatica (Slovenia), 2025

Neuroplastic Expansion in Deep Reinforcement Learning.

[DOI]

,

Johan Samir Obando Ceron

,

Aaron C. Courville

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets.

[DOI]

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards Robust, Efficient, and Practical Decision-Making: From Reward-Maximizing Deep Reinforcement Learning to Reward-Matching GFlowNets.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Flow Factorization for Efficient Generative Flow Networks.

[DOI]

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning.

[DOI]

,

,

,

,

,

IEEE/ACM Trans. Netw., June, 2024

Distributional GFlowNets with Quantile Flows.

[DOI]

,

,

Ricky T. Q. Chen

,

Aaron C. Courville

,

Trans. Mach. Learn. Res., 2024

A hierarchical blockchain-enabled security-threat assessment architecture for IoV.

[DOI]

,

,

Digit. Commun. Networks, 2024

Neuroplastic Expansion in Deep Reinforcement Learning.

[DOI]

,

Johan S. Obando-Ceron

,

Aaron C. Courville

,

CoRR, 2024

Safety challenges of AI in medicine.

[DOI]

,

Nicole Xi Zhang

,

,

,

,

,

,

Danielle S. Bitterman

,

,

,

,

CoRR, 2024

Rectifying Reinforcement Learning for Reward Matching.

[DOI]

,

Emmanuel Bengio

,

,

CoRR, 2024

Bifurcated Generative Flow Networks.

[DOI]

,

,

,

,

CoRR, 2024

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.

[DOI]

,

,

,

,

,

CoRR, 2024

Evolution Guided Generative Flow Networks.

[DOI]

,

,

CoRR, 2024

Premium Power Service for Custom Power Devices Based on Trial-Purchase Model.

[DOI]

,

,

,

,

IEEE Access, 2024

AgentIR: 1st Workshop on Agent-based Information Retrieval.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning.

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

QGFN: Controllable Greediness with Action Values.

[DOI]

,

Stephen Zhewen Lu

,

,

,

Emmanuel Bengio

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training.

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training.

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Learning to Scale Logits for Temperature-Conditional GFlowNets.

[DOI]

,

,

,

,

,

,

,

Emmanuel Bengio

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Pre-Training and Fine-Tuning Generative Flow Networks.

[DOI]

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

2023

Network Topology Optimization via Deep Reinforcement Learning.

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Commun., May, 2023

One is More: Diverse Perspectives within a Single Network for Efficient DRL.

[DOI]

,

,

CoRR, 2023

Probabilistic Generative Modeling for Procedural Roundabout Generation for Developing Countries.

[DOI]

,

,

CoRR, 2023

Learning to Scale Logits for Temperature-Conditional GFlowNets.

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning.

[DOI]

,

,

CoRR, 2023

Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets.

[DOI]

,

,

,

Aaron C. Courville

,

,

CoRR, 2023

Stochastic Generative Flow Networks.

[DOI]

,

,

,

,

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Let the Flows Tell: Solving Graph Combinatorial Problems with GFlowNets.

[DOI]

,

,

,

Aaron C. Courville

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Better Training of GFlowNets with Local Credit and Incomplete Trajectories.

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch.

[DOI]

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generative Augmented Flow Networks.

[DOI]

,

,

Aaron C. Courville

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

An Intelligent Decision Framework for Loan Allocation Schemes.

[DOI]

,

,

J. Circuits Syst. Comput., 2022

RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch.

[DOI]

,

,

,

CoRR, 2022

E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance.

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Effective multi-user delay-constrained scheduling with deep recurrent reinforcement learning.

[DOI]

,

,

,

,

Proceedings of the MobiHoc '22: The Twenty-third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, Seoul, Republic of Korea, October 17, 2022

Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification.

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

2021

DEA cross-efficiency framework for efficiency evaluation with probabilistic linguistic term sets.

[DOI]

,

,

J. Oper. Res. Soc., 2021

Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning.

[DOI]

,

,

,

,

Shimon Whiteson

CoRR, 2021

Exploration in policy optimization through multiple paths.

[DOI]

,

,

Auton. Agents Multi Agent Syst., 2021

Regularized Softmax Deep Multi-Agent Q-Learning.

[DOI]

,

,

,

,

Shimon Whiteson

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Using Virtual Reality Technology to Visualize Management of College Assets in the Internet of Things Environment.

[DOI]

,

,

IEEE Access, 2020

Softmax Deep Double Deterministic Policy Gradients.

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.

[DOI]

,

,

,

,

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Multi-Path Policy Optimization.

[DOI]

,

,

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Deterministic Value-Policy Gradients.

[DOI]

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.

[DOI]

,

,

,

,

,

CoRR, 2019

A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems.

[DOI]

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Generalized deterministic policy gradient algorithms.

[DOI]

,

,

CoRR, 2018

Rebalancing Dockless Bike Sharing Systems.

[DOI]

,

,

,

,

CoRR, 2018

2017

Medical Sign Recognition of Lung Nodules Based on Image Retrieval with Semantic Features and Supervised Hashing.

[DOI]

,

,

,

J. Comput. Sci. Technol., 2017