Deheng Ye

Orcid: 0000-0002-1754-1837

According to our database1, Deheng Ye authored at least 63 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

Learning Versatile Skills with Curriculum Masking.
CoRR, 2024

WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents.
CoRR, 2024

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.
CoRR, 2024

Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models.
CoRR, 2024

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination.
CoRR, 2024

More Agents Is All You Need.
CoRR, 2024

Affordable Generative Agents.
CoRR, 2024

HGAttack: Transferable Heterogeneous Graph Adversarial Attack.
CoRR, 2024

LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Language Model Adaption for Reinforcement Learning with Natural Language Action Space.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

RLTF: Reinforcement Learning from Unit Test Feedback.
Trans. Mach. Learn. Res., 2023

A Survey on Transformers in Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

Replay-enhanced Continual Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints.
CoRR, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.
CoRR, 2023

Deploying Offline Reinforcement Learning with Human Feedback.
CoRR, 2023

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization.
CoRR, 2023

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning.
CoRR, 2023

Mutual-Information Regularized Multi-Agent Policy Iteration.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SeeHow: Workflow Extraction from Programming Screencasts through Action-Aware Video Analytics.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.
Proceedings of the International Conference on Machine Learning, 2023

More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

RLogist: Fast Observation Strategy on Whole-Slide Images with Deep Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings.
IEEE Trans. Neural Networks Learn. Syst., 2022

Pretraining in Deep Reinforcement Learning: A Survey.
CoRR, 2022

Curriculum-based Asymmetric Multi-task Reinforcement Learning.
CoRR, 2022

Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation.
CoRR, 2022

Revisiting Discrete Soft Actor-Critic.
CoRR, 2022

Quantized Adaptive Subgradient Algorithms and Their Applications.
CoRR, 2022

GPN: A Joint Structural Learning Framework for Graph Neural Networks.
CoRR, 2022

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021
Which Heroes to Pick? Learning to Draft in MOBA Games With Neural Networks and Tree Search.
IEEE Trans. Games, 2021

TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations.
CoRR, 2021

Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization.
CoRR, 2021

Coordinated Proximal Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


Learning Diverse Policies in MOBA Games via Macro-Goals.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Boosting Offline Reinforcement Learning with Residual Generative Modeling.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

UIS-Hunter: Detecting UI Design Smells in Android Apps.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2021

Don't Do That! Hunting Down Visual Design Smells in Complex UIs against Design Guidelines.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

2020
Towards Playing Full MOBA Games with Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Relation-Aware Transformer for Portfolio Policy Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Mastering Complex Control in MOBA Games with Deep Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Adversarial Transfer for Named Entity Boundary Detection with Pointer Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018
APIReal: an API recognition and linking approach for online developer forums.
Empir. Softw. Eng., 2018

2017
On domain knowledge organization and extraction in software engineering
PhD thesis, 2017

The structure and dynamics of knowledge network in domain-specific Q&A sites: a case study of stack overflow.
Empir. Softw. Eng., 2017

Ensemble application of convolutional and recurrent neural networks for multi-label text categorization.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Enhancing Knowledge Sharing in Stack Overflow via Automatic External Web Resources Linking.
Proceedings of the 22nd International Conference on Engineering of Complex Computer Systems, 2017

2016
Software-Specific Named Entity Recognition in Software Engineering Social Content.
Proceedings of the IEEE 23rd International Conference on Software Analysis, 2016

Software-specific part-of-speech tagging: an experimental study on stack overflow.
Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016

From discussion to wisdom: web resource recommendation for hyperlinks in stack overflow.
Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016

Predicting semantically linkable knowledge in developer online forums via convolutional neural network.
Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, 2016

Learning to Extract API Mentions from Informal Natural Language Discussions.
Proceedings of the 2016 IEEE International Conference on Software Maintenance and Evolution, 2016

GPU-Accelerated High-Level Synthesis for Bitwidth Optimization of FPGA Datapaths.
Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2016

2015
ActivitySpace: A Remembrance Framework to Support Interapplication Information Needs.
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015

2014
MixFX-SCORE: Heterogeneous Fixed-Point Compilation of Dataflow Computations.
Proceedings of the 22nd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2014


  Loading...