2025
CORD: Generalizable Cooperation via Role Diversity.
CoRR, January, 2025
2024
Master-Slave Deep Architecture for Top-K Multiarmed Bandits With Nonlinear Bandit Feedback and Diversity Constraints.
IEEE Trans. Neural Networks Learn. Syst., December, 2024
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization.
IEEE Trans. Neural Networks Learn. Syst., October, 2024
Playable Game Generation.
CoRR, 2024
Aligning Few-Step Diffusion Models with Dense Reward Difference Learning.
CoRR, 2024
WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents.
CoRR, 2024
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
A Survey on Self-play Methods in Reinforcement Learning.
CoRR, 2024
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models.
CoRR, 2024
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination.
CoRR, 2024
More Agents Is All You Need.
CoRR, 2024
Affordable Generative Agents.
CoRR, 2024
Learning Versatile Skills with Curriculum Masking.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
HGAttack: Transferable Heterogeneous Graph Adversarial Attack.
Proceedings of the IEEE International Conference on Agents, 2024
LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Language Model Adaption for Reinforcement Learning with Natural Language Action Space.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
RLTF: Reinforcement Learning from Unit Test Feedback.
Trans. Mach. Learn. Res., 2023
A Survey on Transformers in Reinforcement Learning.
Trans. Mach. Learn. Res., 2023
Replay-enhanced Continual Reinforcement Learning.
Trans. Mach. Learn. Res., 2023
Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints.
CoRR, 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer.
CoRR, 2023
Deploying Offline Reinforcement Learning with Human Feedback.
CoRR, 2023
Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization.
CoRR, 2023
Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning.
CoRR, 2023
Mutual-Information Regularized Multi-Agent Policy Iteration.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
SeeHow: Workflow Extraction from Programming Screencasts through Action-Aware Video Analytics.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer.
Proceedings of the International Conference on Machine Learning, 2023
More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
RLogist: Fast Observation Strategy on Whole-Slide Images with Deep Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Neural Networks Learn. Syst., 2022
Pretraining in Deep Reinforcement Learning: A Survey.
CoRR, 2022
Curriculum-based Asymmetric Multi-task Reinforcement Learning.
CoRR, 2022
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation.
CoRR, 2022
Revisiting Discrete Soft Actor-Critic.
CoRR, 2022
Quantized Adaptive Subgradient Algorithms and Their Applications.
CoRR, 2022
GPN: A Joint Structural Learning Framework for Graph Neural Networks.
CoRR, 2022
Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
2021
Which Heroes to Pick? Learning to Draft in MOBA Games With Neural Networks and Tree Search.
IEEE Trans. Games, 2021
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations.
CoRR, 2021
Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization.
CoRR, 2021
Coordinated Proximal Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021
Learning Diverse Policies in MOBA Games via Macro-Goals.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Boosting Offline Reinforcement Learning with Residual Generative Modeling.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
UIS-Hunter: Detecting UI Design Smells in Android Apps.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2021
Don't Do That! Hunting Down Visual Design Smells in Complex UIs against Design Guidelines.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021
2020
Towards Playing Full MOBA Games with Deep Reinforcement Learning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Relation-Aware Transformer for Portfolio Policy Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Mastering Complex Control in MOBA Games with Deep Reinforcement Learning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Adversarial Transfer for Named Entity Boundary Detection with Pointer Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
2018
APIReal: an API recognition and linking approach for online developer forums.
Empir. Softw. Eng., 2018
2017
On domain knowledge organization and extraction in software engineering
PhD thesis, 2017
The structure and dynamics of knowledge network in domain-specific Q&A sites: a case study of stack overflow.
Empir. Softw. Eng., 2017
Ensemble application of convolutional and recurrent neural networks for multi-label text categorization.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Enhancing Knowledge Sharing in Stack Overflow via Automatic External Web Resources Linking.
Proceedings of the 22nd International Conference on Engineering of Complex Computer Systems, 2017
2016
Software-Specific Named Entity Recognition in Software Engineering Social Content.
Proceedings of the IEEE 23rd International Conference on Software Analysis, 2016
Software-specific part-of-speech tagging: an experimental study on stack overflow.
Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016
From discussion to wisdom: web resource recommendation for hyperlinks in stack overflow.
Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016
Predicting semantically linkable knowledge in developer online forums via convolutional neural network.
Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, 2016
Learning to Extract API Mentions from Informal Natural Language Discussions.
Proceedings of the 2016 IEEE International Conference on Software Maintenance and Evolution, 2016
GPU-Accelerated High-Level Synthesis for Bitwidth Optimization of FPGA Datapaths.
Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2016
2015
ActivitySpace: A Remembrance Framework to Support Interapplication Information Needs.
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015
2014
MixFX-SCORE: Heterogeneous Fixed-Point Compilation of Dataflow Computations.
Proceedings of the 22nd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2014