Yi Wu

Orcid: 0000-0001-9057-5817

Affiliations:
  • Tsinghua University, Institute of Interdisciplinary Information Sciences (IIIS), Beijing, China
  • University of California, Berkeley, CA, USA (PhD 2019)
  • Microsoft Research Asia


According to our database1, Yi Wu authored at least 88 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control.
IEEE Robotics Autom. Lett., 2024

Quarl: A Learning-Based Quantum Circuit Optimizer.
Proc. ACM Program. Lang., 2024

Few-shot In-Context Preference Learning Using Large Language Models.
CoRR, 2024

On Designing Effective RL Reward at Training Time for LLM Reasoning.
CoRR, 2024

ReaLHF: Optimized RLHF Training for Large Language Models through Parameter Reallocation.
CoRR, 2024

FlightBench: A Comprehensive Benchmark of Spatial Planning Methods for Quadrotors.
CoRR, 2024

Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models.
CoRR, 2024

Leveraging Symmetry in RL-based Legged Locomotion Control.
CoRR, 2024

Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

LAGOON: Language-Guided Motion Control.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization.
Trans. Mach. Learn. Res., 2023

Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

Learning Agile Bipedal Motions on a Quadrupedal Robot.
CoRR, 2023

BitNet: Scaling 1-bit Transformers for Large Language Models.
CoRR, 2023

DeRisk: An Effective Deep Learning Framework for Credit Risk Prediction over Real-World Financial Data.
CoRR, 2023

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores.
CoRR, 2023

Language-Guided Generation of Physically Realistic Robot Motion and Control.
CoRR, 2023

Grounding Object Relations in Language-Conditioned Robotic Manipulation with Semantic-Spatial Reasoning.
CoRR, 2023

Iteratively Learn Diverse Strategies with State Distance Information.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PhyloTransformer: A Self-supervised Discriminative Model for SARS-CoV-2 Viral Mutation Prediction Based on a Multi-head Self-attention Mechanism.
Proceedings of the 6th International Workshop on Knowledge Discovery from Healthcare Data co-located with 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

Automatic Truss Design with Reinforcement Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Efficient Bimanual Handover and Rearrangement via Symmetry-Aware Actor-Critic Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

SpeedyZero: Mastering Atari with Limited Data and Time.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Asynchronous Multi-Agent Reinforcement Learning for Efficient Real-Time Multi-Robot Cooperative Exploration.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Differentiable Arbitrating in Zero-sum Markov Games.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard Markov Decision Process (Student Abstract).
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process.
CoRR, 2022

Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems.
CoRR, 2022

Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Grounded Reinforcement Learning: Learning to Win the Game under Human Commands.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Efficient Multi-agent Cooperative Visual Exploration.
Proceedings of the Computer Vision - ECCV 2022, 2022

Sequence Level Contrastive Learning for Text Summarization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Near-Linear Time Local Polynomial Nonparametric Estimation with Box Kernels.
INFORMS J. Comput., 2021

Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination.
CoRR, 2021

A Benchmark for Low-Switching-Cost Reinforcement Learning.
CoRR, 2021

Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward.
CoRR, 2021

PhyloTransformer: A Discriminative Model for Mutation Prediction Based on a Multi-head Self-attention Mechanism.
CoRR, 2021

Disentangled Attention as Intrinsic Regularization for Bimanual Multi-Object Manipulation.
CoRR, 2021

The Surprising Effectiveness of MAPPO in Cooperative, Multi-Agent Games.
CoRR, 2021

NovelD: A Simple yet Effective Exploration Criterion.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning to Design and Construct Bridge without Blueprint.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Temporal Induced Self-Play for Stochastic Bayesian Games.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Solving Compositional Reinforcement Learning Problems via Task Reduction.
Proceedings of the 9th International Conference on Learning Representations, 2021

Unlocking the Potential of MAPPO with Asynchronous Optimization.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

2020
BeBold: Exploration Beyond the Boundary of Explored Regions.
CoRR, 2020

Multi-Agent Collaboration via Reward Attribution Decomposition.
CoRR, 2020

Multi-Task Reinforcement Learning with Soft Modularization.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Emergent Tool Use From Multi-Agent Autocurricula.
Proceedings of the 8th International Conference on Learning Representations, 2020

Influence-Based Multi-Agent Exploration.
Proceedings of the 8th International Conference on Learning Representations, 2020

Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
Bayesian Relational Memory for Semantic Visual Navigation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Deep Reinforcement Learning for Green Security Games with Real-Time Information.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Learning and Planning with a Semantic Model.
CoRR, 2018

Near-Linear Time Local Polynomial Nonparametric Estimation.
CoRR, 2018

Meta-Learning MCMC Proposals.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms.
Proceedings of the 35th International Conference on Machine Learning, 2018

Building Generalizable Agents with a Realistic and Rich 3D Environment.
Proceedings of the 6th International Conference on Learning Representations, 2018

Deep Reinforcement Learning for Green Security Game with Online Information.
Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Neural Block Sampling.
CoRR, 2017

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Adversarial Training for Relation Extraction.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Nearly-Black-Box Online Algorithm for Joint Parameter and State Estimation in Temporal Models.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Towards Practical Bayesian Parameter and State Estimation.
CoRR, 2016

Value Iteration Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Swift: Compiled Inference for Probabilistic Programming Languages.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Understanding and Evaluating Sparse Linear Discriminant Analysis.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2012
Dual-Space Analysis of the Sparse Linear Model.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012


  Loading...