Chenjia Bai

Orcid: 0000-0002-8379-9385

According to our database1, Chenjia Bai authored at least 48 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning.
Neural Networks, 2025

2024
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

False Correlation Reduction for Offline Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning.
Artif. Intell., January, 2024

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness.
J. Artif. Intell. Res., 2024

Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning.
Inf. Sci., 2024

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning.
CoRR, 2024

Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control.
CoRR, 2024

Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner.
CoRR, 2024

Forward KL Regularized Preference Optimization for Aligning Diffusion Policies.
CoRR, 2024

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models.
CoRR, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.
CoRR, 2024

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration.
CoRR, 2024

Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning.
CoRR, 2024

Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning.
CoRR, 2024

Regularized Conditional Diffusion Model for Multi-Task Preference Alignment.
CoRR, 2024

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.
CoRR, 2024

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

How Does Goal Relabeling Improve Sample Efficiency?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Cross-Domain Policy Adaptation by Capturing Representation Mismatch.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Constrained Ensemble Exploration for Unsupervised Skill Discovery.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SelfBC: Self Behavior Cloning for Offline Reinforcement Learning.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling.
IEEE Trans. Syst. Man Cybern. Syst., December, 2023

Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

Addressing Hindsight Bias in Multigoal Reinforcement Learning.
IEEE Trans. Cybern., 2023

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness.
CoRR, 2023

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.
CoRR, 2023

Privileged Knowledge Distillation for Sim-to-Real Policy Generalization.
CoRR, 2023

On the Value of Myopic Behavior in Policy Reuse.
CoRR, 2023

Cross-Domain Policy Adaptation via Value-Guided Data Filtering.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.
Proceedings of the International Conference on Machine Learning, 2023

2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning.
CoRR, 2021

Exploration in Deep Reinforcement Learning: A Comprehensive Survey.
CoRR, 2021

Dynamic Bottleneck for Robust Self-Supervised Exploration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Principled Exploration via Optimistic Bootstrapping and Backward Induction.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Obtaining accurate estimated action values in categorical distributional reinforcement learning.
Knowl. Based Syst., 2020

Generating attentive goals for prioritized hindsight reinforcement learning.
Knowl. Based Syst., 2020

深度强化学习中稀疏奖励问题研究综述 (Survey on Sparse Reward in Deep Reinforcement Learning).
计算机科学, 2020

Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning.
CoRR, 2020

2019
Guided goal generation for hindsight multi-goal reinforcement learning.
Neurocomputing, 2019


  Loading...