2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering.
CoRR, March, 2025
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation.
CoRR, February, 2025
Pre-Trained Video Generative Models as World Simulators.
CoRR, February, 2025
A Computational CNN-LSTM-Based Mental Health Consultation System in a College Environment.
Informatica (Slovenia), 2025
Neuroplastic Expansion in Deep Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Towards Robust, Efficient, and Practical Decision-Making: From Reward-Maximizing Deep Reinforcement Learning to Reward-Matching GFlowNets.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Flow Factorization for Efficient Generative Flow Networks.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning.
IEEE/ACM Trans. Netw., June, 2024
Distributional GFlowNets with Quantile Flows.
Trans. Mach. Learn. Res., 2024
A hierarchical blockchain-enabled security-threat assessment architecture for IoV.
Digit. Commun. Networks, 2024
Neuroplastic Expansion in Deep Reinforcement Learning.
CoRR, 2024
Safety challenges of AI in medicine.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Rectifying Reinforcement Learning for Reward Matching.
CoRR, 2024
Bifurcated Generative Flow Networks.
CoRR, 2024
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.
CoRR, 2024
Evolution Guided Generative Flow Networks.
CoRR, 2024
Premium Power Service for Custom Power Devices Based on Trial-Purchase Model.
IEEE Access, 2024
AgentIR: 1st Workshop on Agent-based Information Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
QGFN: Controllable Greediness with Action Values.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Learning to Scale Logits for Temperature-Conditional GFlowNets.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Pre-Training and Fine-Tuning Generative Flow Networks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective.
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024
2023
Network Topology Optimization via Deep Reinforcement Learning.
IEEE Trans. Commun., May, 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL.
CoRR, 2023
Probabilistic Generative Modeling for Procedural Roundabout Generation for Developing Countries.
CoRR, 2023
Learning to Scale Logits for Temperature-Conditional GFlowNets.
CoRR, 2023
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning.
CoRR, 2023
Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets.
CoRR, 2023
Stochastic Generative Flow Networks.
Proceedings of the Uncertainty in Artificial Intelligence, 2023
Let the Flows Tell: Solving Graph Combinatorial Problems with GFlowNets.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Better Training of GFlowNets with Local Credit and Incomplete Trajectories.
Proceedings of the International Conference on Machine Learning, 2023
RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Generative Augmented Flow Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
An Intelligent Decision Framework for Loan Allocation Schemes.
J. Circuits Syst. Comput., 2022
RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch.
CoRR, 2022
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Effective multi-user delay-constrained scheduling with deep recurrent reinforcement learning.
Proceedings of the MobiHoc '22: The Twenty-third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, Seoul, Republic of Korea, October 17, 2022
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification.
Proceedings of the International Conference on Machine Learning, 2022
2021
DEA cross-efficiency framework for efficiency evaluation with probabilistic linguistic term sets.
J. Oper. Res. Soc., 2021
Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning.
CoRR, 2021
Exploration in policy optimization through multiple paths.
Auton. Agents Multi Agent Syst., 2021
Regularized Softmax Deep Multi-Agent Q-Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
2020
Using Virtual Reality Technology to Visualize Management of College Assets in the Internet of Things Environment.
IEEE Access, 2020
Softmax Deep Double Deterministic Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Reinforcement Learning with Dynamic Boltzmann Softmax Updates.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Multi-Path Policy Optimization.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
Deterministic Value-Policy Gradients.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Reinforcement Learning with Dynamic Boltzmann Softmax Updates.
CoRR, 2019
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Generalized deterministic policy gradient algorithms.
CoRR, 2018
Rebalancing Dockless Bike Sharing Systems.
CoRR, 2018
2017
Medical Sign Recognition of Lung Nodules Based on Image Retrieval with Semantic Features and Supervised Hashing.
J. Comput. Sci. Technol., 2017