Yan Zheng

Orcid: 0000-0003-2741-058X

Affiliations:

Tianjin University, China

According to our database¹, Yan Zheng authored at least 83 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Is AI testing beneficial for the manufacturer and social welfare? Optimal test strategy of a smart product.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2024

CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning.

[BibT_eX]

[DOI]

CoRR, 2024

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making.

[BibT_eX]

[DOI]

CoRR, 2024

SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

DiffuserLite: Towards Real-time Diffusion Planning.

[BibT_eX]

[DOI]

CoRR, 2024

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, 2024

The MMO Economist: AI Empowers Robust, Healthy, and Sustainable P2W MMO Economies.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Value-Evolutionary-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Improving Neural Logic Machines via Failure Reflection.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generate Subgoal Images Before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PADDLE: Logic Program Guided Policy Reuse in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Accelerating deep reinforcement learning via knowledge-guided policy network.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., June, 2023

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration.

[BibT_eX]

[DOI]

CoRR, 2023

HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach.

[BibT_eX]

[DOI]

CoRR, 2023

In-Sample Policy Iteration for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Prompt Injection attack against LLM-integrated Applications.

[BibT_eX]

[DOI]

CoRR, 2023

An Automatic Test Plan Generation Approach for Automotive Software Testing.

[BibT_eX]

[DOI]

Proceedings of the New Trends in Intelligent Software Methodologies, Tools and Techniques, 2023

T3S: Improving Multi-Task Reinforcement Learning with Task-Specific Feature Selector and Scheduler.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2023

Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Neural Episodic Control with State Abstraction.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Limited Information Opponent Modeling.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022

ERL-Re<sup>2</sup>: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.

[BibT_eX]

[DOI]

CoRR, 2022

Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes.

[BibT_eX]

[DOI]

CoRR, 2022

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.

[BibT_eX]

[DOI]

CoRR, 2022

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks.

[BibT_eX]

[DOI]

CoRR, 2022

Cross-domain adaptive transfer reinforcement learning based on state-action correspondence.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2022

A Holistic Automated Software Structure Exploration Framework for Testing.

[BibT_eX]

[DOI]

Proceedings of the New Trends in Intelligent Software Methodologies, Tools and Techniques, 2022

GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator.

[BibT_eX]

[DOI]

Proceedings of the Distributed Artificial Intelligence - 4th International Conference, 2022

2021

Vulnerability Assessment of Deep Reinforcement Learning Models for Power System Topology Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Smart Grid, 2021

ED2: An Environment Dynamics Decomposition Framework for World Model Construction.

[BibT_eX]

[DOI]

CoRR, 2021

Efficient policy detecting and reusing for non-stationarity in Markov games.

[BibT_eX]

[DOI]

Auton. Agents Multi Agent Syst., 2021

Detecting and Learning Against Unknown Opponents for Automated Negotiations.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Automatic HMI Structure Exploration Via Curiosity-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Vinay Vishnumurthy Adiga

Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

A deep reinforcement learning-based agent for negotiation with multiple communication channels.

[BibT_eX]

[DOI]

Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

Automatic Web Testing Using Curiosity-Driven Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

2020

Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2020

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Generating Behavior-Diverse Game AIs with Evolutionary Multi-Objective Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Continuous Multiagent Control Using Collective Behavior Entropy for Large-Scale Home Energy Management.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Schedule optimization under fuzzy constraints of vehicle capacity.

[BibT_eX]

[DOI]

Fuzzy Optim. Decis. Mak., 2019

Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Towards Efficient Detection and Optimal Response against Sophisticated Opponents.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Reinforcement Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018

Localizing multiple software faults based on evolution algorithm.

[BibT_eX]

[DOI]

J. Syst. Softw., 2018

Real-time predication and navigation on traffic congestion model with equilibrium Markov chain.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2018

Hierarchical Deep Multiagent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.

[BibT_eX]

[DOI]

CoRR, 2018

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.

[BibT_eX]

[DOI]

Yan Zheng

Jianye Hao

Zongzhang Zhang

CoRR, 2018

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

An Accurate Modeling Technology Based on Depth Information of Cultural Relics.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Computational Intelligence and Design, 2018

2017

SOMR: Towards a Security-Oriented MapReduce Infrastructure.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Trustcom/BigDataSE/ICESS, Sydney, Australia, August 1-4, 2017, 2017

Preliminary Application of Gesture Recognition to Virtual Acupuncture.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Computational Intelligence and Design, 2017

2015

Layered modeling and generation of Pollock's drip style.

[BibT_eX]

[DOI]

Vis. Comput., 2015

A Short-Text Oriented Clustering Method for Hot Topics Extraction.

[BibT_eX]

[DOI]

Yan Zheng

Zhao-Peng Meng

Chao Xu

Int. J. Softw. Eng. Knowl. Eng., 2015

Yan Zheng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...