Yi Wu
Orcid: 0000-0001-9057-5817Affiliations:
- Tsinghua University, Institute of Interdisciplinary Information Sciences (IIIS), Beijing, China
- University of California, Berkeley, CA, USA (PhD 2019)
- Microsoft Research Asia
According to our database1,
Yi Wu
authored at least 88 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
On csauthors.net:
Bibliography
2024
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control.
IEEE Robotics Autom. Lett., 2024
ReaLHF: Optimized RLHF Training for Large Language Models through Parameter Reallocation.
CoRR, 2024
CoRR, 2024
Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models.
CoRR, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization.
Trans. Mach. Learn. Res., 2023
Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning.
Trans. Mach. Learn. Res., 2023
DeRisk: An Effective Deep Learning Framework for Credit Risk Prediction over Real-World Financial Data.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Grounding Object Relations in Language-Conditioned Robotic Manipulation with Semantic-Spatial Reasoning.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
PhyloTransformer: A Self-supervised Discriminative Model for SARS-CoV-2 Viral Mutation Prediction Based on a Multi-head Self-attention Mechanism.
Proceedings of the 6th International Workshop on Knowledge Discovery from Healthcare Data co-located with 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Efficient Bimanual Handover and Rearrangement via Symmetry-Aware Actor-Critic Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard Markov Decision Process (Student Abstract).
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
CoRR, 2022
Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
INFORMS J. Comput., 2021
CoRR, 2021
Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward.
CoRR, 2021
PhyloTransformer: A Discriminative Model for Mutation Prediction Based on a Multi-head Self-attention Mechanism.
CoRR, 2021
Disentangled Attention as Intrinsic Regularization for Bimanual Multi-Object Manipulation.
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021
2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms.
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
A Nearly-Black-Box Online Algorithm for Joint Parameter and State Estimation in Temporal Models.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
2015
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015
2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012