Yan Zheng

Orcid: 0000-0003-2741-058X

Affiliations:
  • Tianjin University, China


According to our database1, Yan Zheng authored at least 76 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Is AI testing beneficial for the manufacturer and social welfare? Optimal test strategy of a smart product.
Expert Syst. Appl., 2024

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making.
CoRR, 2024

SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.
CoRR, 2024

Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models.
CoRR, 2024

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.
CoRR, 2024

DiffuserLite: Towards Real-time Diffusion Planning.
CoRR, 2024

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey.
CoRR, 2024

The MMO Economist: AI Empowers Robust, Healthy, and Sustainable P2W MMO Economies.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Accelerating deep reinforcement learning via knowledge-guided policy network.
Auton. Agents Multi Agent Syst., June, 2023

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.
CoRR, 2023

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model.
CoRR, 2023

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning.
CoRR, 2023

Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration.
CoRR, 2023

HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach.
CoRR, 2023

In-Sample Policy Iteration for Offline Reinforcement Learning.
CoRR, 2023

Prompt Injection attack against LLM-integrated Applications.
CoRR, 2023

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.
CoRR, 2023

T3S: Improving Multi-Task Reinforcement Learning with Task-Specific Feature Selector and Scheduler.
Proceedings of the International Joint Conference on Neural Networks, 2023

Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.
Proceedings of the International Conference on Machine Learning, 2023

RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution.
Proceedings of the International Conference on Machine Learning, 2023

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Limited Information Opponent Modeling.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
ERL-Re<sup>2</sup>: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.
CoRR, 2022

Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes.
CoRR, 2022

GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis.
CoRR, 2022

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations.
CoRR, 2022

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.
CoRR, 2022

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks.
CoRR, 2022

Cross-domain adaptive transfer reinforcement learning based on state-action correspondence.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

A Holistic Automated Software Structure Exploration Framework for Testing.
Proceedings of the New Trends in Intelligent Software Methodologies, Tools and Techniques, 2022

GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.
Proceedings of the International Conference on Machine Learning, 2022

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator.
Proceedings of the Distributed Artificial Intelligence - 4th International Conference, 2022

2021
Vulnerability Assessment of Deep Reinforcement Learning Models for Power System Topology Optimization.
IEEE Trans. Smart Grid, 2021

ED2: An Environment Dynamics Decomposition Framework for World Model Construction.
CoRR, 2021

Efficient policy detecting and reusing for non-stationarity in Markov games.
Auton. Agents Multi Agent Syst., 2021

Detecting and Learning Against Unknown Opponents for Automated Negotiations.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Automatic HMI Structure Exploration Via Curiosity-Based Reinforcement Learning.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

A deep reinforcement learning-based agent for negotiation with multiple communication channels.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

Automatic Web Testing Using Curiosity-Driven Reinforcement Learning.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

2020
Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments.
J. Comput. Sci. Technol., 2020

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Generating Behavior-Diverse Game AIs with Evolutionary Multi-Objective Deep Reinforcement Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Continuous Multiagent Control Using Collective Behavior Entropy for Large-Scale Home Energy Management.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Schedule optimization under fuzzy constraints of vehicle capacity.
Fuzzy Optim. Decis. Mak., 2019

Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Towards Efficient Detection and Optimal Response against Sophisticated Opponents.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Reinforcement Learning Framework.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Real-time predication and navigation on traffic congestion model with equilibrium Markov chain.
Int. J. Distributed Sens. Networks, 2018

Hierarchical Deep Multiagent Reinforcement Learning.
CoRR, 2018

Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
CoRR, 2018

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.
CoRR, 2018

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
SOMR: Towards a Security-Oriented MapReduce Infrastructure.
Proceedings of the 2017 IEEE Trustcom/BigDataSE/ICESS, Sydney, Australia, August 1-4, 2017, 2017

Preliminary Application of Gesture Recognition to Virtual Acupuncture.
Proceedings of the 10th International Symposium on Computational Intelligence and Design, 2017

2015
Layered modeling and generation of Pollock's drip style.
Vis. Comput., 2015

A Short-Text Oriented Clustering Method for Hot Topics Extraction.
Int. J. Softw. Eng. Knowl. Eng., 2015


  Loading...