Yang Yu
Orcid: 0009-0008-6824-1480Affiliations:
- Nanjing University, State Key Laboratory for Novel Software Technology, China (PhD 2011)
- Pazhou Lab, Guangzhou, China
According to our database1,
Yang Yu
authored at least 236 papers
between 2005 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on wolai.com
-
on orcid.org
-
on github.com
-
on dl.acm.org
On csauthors.net:
Bibliography
2025
Open and real-world human-AI coordination by heterogeneous training with communication.
Frontiers Comput. Sci., April, 2025
2024
Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation.
Frontiers Comput. Sci., December, 2024
Model gradient: unified model and policy learning in model-based reinforcement learning.
Frontiers Comput. Sci., August, 2024
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems.
ACM Trans. Inf. Syst., July, 2024
IEEE Trans. Ind. Informatics, February, 2024
IEEE Trans. Dependable Secur. Comput., 2024
CoRR, 2024
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models.
CoRR, 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation.
CoRR, 2024
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning.
CoRR, 2024
CoRR, 2024
Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts.
CoRR, 2024
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Neural Networks Learn. Syst., December, 2023
Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023
ACM Trans. Graph., October, 2023
Neural Networks, April, 2023
AliExpress Learning-to-Rank: Maximizing Online Model Performance Without Going Online.
IEEE Trans. Knowl. Data Eng., 2023
A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment.
CoRR, 2023
CoRR, 2023
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models.
CoRR, 2023
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments.
CoRR, 2023
CoRR, 2023
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning.
CoRR, 2023
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation.
CoRR, 2023
Proceedings of the Uncertainty in Artificial Intelligence, 2023
Proceedings of the Uncertainty in Artificial Intelligence, 2023
Natural Language Instruction-following with Task-related Language Development and Translation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Proceedings of the Fifth International Conference on Distributed Artificial Intelligence, 2023
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract).
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Vis. Comput., 2022
IEEE Trans. Neural Networks Learn. Syst., 2022
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning.
IEEE Trans. Games, 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Improve generated adversarial imitation learning with reward variance regularization.
Mach. Learn., 2022
J. Artif. Intell. Res., 2022
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games.
CoRR, 2022
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis.
CoRR, 2022
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning.
CoRR, 2022
CoRR, 2022
Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the 9th IEEE International Conference on Cyber Security and Cloud Computing, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Partially observable environment estimation with uplift inference for reinforcement learning based recommendation.
Mach. Learn., 2021
Formal Aspects Comput., 2021
CoRR, 2021
Nearly Minimax Optimal Adversarial Imitation Learning with Known and Unknown Transitions.
CoRR, 2021
Sci. China Inf. Sci., 2021
AI Mag., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract).
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
LB-DESPOT: Efficient Online POMDP Planning Considering Lower Bound in Action Selection (Student Abstract).
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Theor. Comput. Sci., 2020
Validation Set Evaluation can be Wrong: An Evaluator-Generator Approach for Maximizing Online Performance of Ranking in E-commerce.
CoRR, 2020
Comput. Graph. Forum, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Derivative-Free Optimization with Adaptive Experience for Efficient Hyper-Parameter Tuning.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020
Proceedings of the IEEE Conference on Games, 2020
An Efficient Evolutionary Algorithm for Subset Selection with General Cost Constraints.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoRR, 2019
Maximizing submodular or monotone approximately submodular functions by multi-objective evolutionary algorithms.
Artif. Intell., 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the Neural Information Processing - 26th International Conference, 2019
Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Springer, ISBN: 978-981-13-5955-2, 2019
2018
IEEE Trans. Neural Networks Learn. Syst., 2018
On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments.
Evol. Comput., 2018
CoRR, 2018
CoRR, 2018
CoRR, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018
Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
An Alternating Minimization Approach to Optimizing Subarray Configuration for a Large Phased Array.
Proceedings of the 10th IEEE Sensor Array and Multichannel Signal Processing Workshop, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Maximizing Non-monotone/Non-submodular Functions by Multi-objective Evolutionary Algorithms.
CoRR, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the 2017 IEEE Congress on Evolutionary Computation, 2017
Solving High-Dimensional Multi-Objective Optimization Problems with Low Effective Dimensions.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Proceedings of the PRICAI 2016: Trends in Artificial Intelligence, 2016
Symbolic execution of complex program driven by machine learning based constraint solving.
Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Derivative-Free Optimization of High-Dimensional Non-Convex Functions by Sequential Random Embeddings.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
A Lower Bound Analysis of Population-Based Evolutionary Algorithms for Pseudo-Boolean Functions.
Proceedings of the Intelligent Data Engineering and Automated Learning - IDEAL 2016, 2016
Proceedings of the IEEE Congress on Evolutionary Computation, 2016
Proceedings of the Bio-inspired Computing - Theories and Applications, 2016
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
Scaling Simultaneous Optimistic Optimization for High-Dimensional Non-Convex Functions with Low Effective Dimensions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
IEEE Trans. Evol. Comput., 2015
Soft Comput., 2015
Sci. China Inf. Sci., 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Proceedings of the IEEE Congress on Evolutionary Computation, 2015
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments.
Proceedings of the Parallel Problem Solving from Nature - PPSN XIII, 2014
Proceedings of the IEEE Congress on Evolutionary Computation, 2014
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
Artif. Intell., 2013
Proceedings of the Partially Supervised Learning - Second IAPR International Workshop, 2013
On the Approximation Ability of Evolutionary Optimization with Application to Minimum Set Cover: Extended Abstract.
Proceedings of the IJCAI 2013, 2013
2012
On the approximation ability of evolutionary optimization with application to minimum set cover.
Artif. Intell., 2012
Proceedings of the Parallel Problem Solving from Nature - PPSN XII, 2012
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012
2011
Towards Analyzing Crossover Operators in Evolutionary Search via General Markov Chain Switching Theorem
CoRR, 2011
Proceedings of the Data Mining Workshops (ICDMW), 2011
Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011
2010
Knowl. Inf. Syst., 2010
Proceedings of the Parallel Problem Solving from Nature, 2010
2009
Proceedings of the ICDM 2009, 2009
2008
A new approach to estimating the expected first hitting time of evolutionary algorithms.
Artif. Intell., 2008
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008
On the usefulness of infeasible solutions in evolutionary search: A theoretical study.
Proceedings of the IEEE Congress on Evolutionary Computation, 2008
2007
Int. J. Data Warehous. Min., 2007
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007
2005
IEEE Trans. Syst. Man Cybern. Part B, 2005