2025

CORD: Generalizable Cooperation via Role Diversity.

[DOI]

Kanefumi Matsuyama

Kefan Su

Jiangxing Wang

Deheng Ye

Zongqing Lu

CoRR, January, 2025

2024

Master-Slave Deep Architecture for Top-K Multiarmed Bandits With Nonlinear Bandit Feedback and Diversity Constraints.

[DOI]

IEEE Trans. Neural Networks Learn. Syst., December, 2024

Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization.

[DOI]

IEEE Trans. Neural Networks Learn. Syst., October, 2024

Playable Game Generation.

[DOI]

CoRR, 2024

Aligning Few-Step Diffusion Models with Dense Reward Difference Learning.

[DOI]

CoRR, 2024

WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents.

[DOI]

CoRR, 2024

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.

[DOI]

CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.

[DOI]

CoRR, 2024

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models.

[DOI]

CoRR, 2024

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination.

[DOI]

CoRR, 2024

More Agents Is All You Need.

[DOI]

CoRR, 2024

Affordable Generative Agents.

[DOI]

CoRR, 2024

Learning Versatile Skills with Curriculum Masking.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

HGAttack: Transferable Heterogeneous Graph Adversarial Attack.

[DOI]

Proceedings of the IEEE International Conference on Agents, 2024

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay.

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Language Model Adaption for Reinforcement Learning with Natural Language Action Space.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Curriculum-Based Asymmetric Multi-Task Reinforcement Learning.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

RLTF: Reinforcement Learning from Unit Test Feedback.

[DOI]

Trans. Mach. Learn. Res., 2023

A Survey on Transformers in Reinforcement Learning.

[DOI]

Trans. Mach. Learn. Res., 2023

Replay-enhanced Continual Reinforcement Learning.

[DOI]

Trans. Mach. Learn. Res., 2023

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints.

[DOI]

CoRR, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.

[DOI]

CoRR, 2023

Deploying Offline Reinforcement Learning with Human Feedback.

[DOI]

CoRR, 2023

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization.

[DOI]

CoRR, 2023

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning.

[DOI]

CoRR, 2023

Mutual-Information Regularized Multi-Agent Policy Iteration.

[DOI]

Jiangxing Wang

Deheng Ye

Zongqing Lu

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SeeHow: Workflow Extraction from Programming Screencasts through Action-Aware Video Analytics.

[DOI]

Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization.

[DOI]

Jiangxing Wang

Deheng Ye

Zongqing Lu

Proceedings of the Eleventh International Conference on Learning Representations, 2023

RLogist: Fast Observation Strategy on Whole-Slide Images with Deep Reinforcement Learning.

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings.

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2022

Pretraining in Deep Reinforcement Learning: A Survey.

[DOI]

CoRR, 2022

Curriculum-based Asymmetric Multi-task Reinforcement Learning.

[DOI]

CoRR, 2022

Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation.

[DOI]

CoRR, 2022

Revisiting Discrete Soft Actor-Critic.

[DOI]

CoRR, 2022

Quantized Adaptive Subgradient Algorithms and Their Applications.

[DOI]

CoRR, 2022

GPN: A Joint Structural Learning Framework for Graph Neural Networks.

[DOI]

CoRR, 2022

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning.

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021

Which Heroes to Pick? Learning to Draft in MOBA Games With Neural Networks and Tree Search.

[DOI]

IEEE Trans. Games, 2021

TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations.

[DOI]

CoRR, 2021

Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization.

[DOI]

CoRR, 2021

Coordinated Proximal Policy Optimization.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned.

[DOI]

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Learning Diverse Policies in MOBA Games via Macro-Goals.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks.

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Boosting Offline Reinforcement Learning with Residual Generative Modeling.

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

UIS-Hunter: Detecting UI Design Smells in Android Apps.

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2021

Don't Do That! Hunting Down Visual Design Smells in Complex UIs against Design Guidelines.

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

2020

Towards Playing Full MOBA Games with Deep Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Relation-Aware Transformer for Portfolio Policy Learning.

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Mastering Complex Control in MOBA Games with Deep Reinforcement Learning.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Adversarial Transfer for Named Entity Boundary Detection with Pointer Networks.

[DOI]

Jing Li

Deheng Ye

Shuo Shang

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018

APIReal: an API recognition and linking approach for online developer forums.

[DOI]

Empir. Softw. Eng., 2018

2017

On domain knowledge organization and extraction in software engineering

[DOI]

Deheng Ye

PhD thesis, 2017

The structure and dynamics of knowledge network in domain-specific Q&A sites: a case study of stack overflow.

[DOI]

Deheng Ye

Zhenchang Xing

Nachiket Kapre

Empir. Softw. Eng., 2017

Ensemble application of convolutional and recurrent neural networks for multi-label text categorization.

[DOI]

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Enhancing Knowledge Sharing in Stack Overflow via Automatic External Web Resources Linking.

[DOI]

Proceedings of the 22nd International Conference on Engineering of Complex Computer Systems, 2017

2016

Software-Specific Named Entity Recognition in Software Engineering Social Content.

[DOI]

Proceedings of the IEEE 23rd International Conference on Software Analysis, 2016

Software-specific part-of-speech tagging: an experimental study on stack overflow.

[DOI]

Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016

From discussion to wisdom: web resource recommendation for hyperlinks in stack overflow.

[DOI]

Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016

Predicting semantically linkable knowledge in developer online forums via convolutional neural network.

[DOI]

Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, 2016

Learning to Extract API Mentions from Informal Natural Language Discussions.

[DOI]

Proceedings of the 2016 IEEE International Conference on Software Maintenance and Evolution, 2016

GPU-Accelerated High-Level Synthesis for Bitwidth Optimization of FPGA Datapaths.

[DOI]

Nachiket Kapre

Deheng Ye

Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2016

2015

ActivitySpace: A Remembrance Framework to Support Interapplication Information Needs.

[DOI]

Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015

2014

MixFX-SCORE: Heterogeneous Fixed-Point Compilation of Dataflow Computations.

[DOI]

Deheng Ye

Nachiket Kapre

Proceedings of the 22nd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2014