Chenjia Bai

Orcid: 0000-0002-8379-9385

According to our database¹, Chenjia Bai authored at least 48 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning.

[BibT_eX]

[DOI]

Neural Networks, 2025

2024

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., July, 2024

Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., July, 2024

False Correlation Reduction for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Artif. Intell., January, 2024

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2024

Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning.

[BibT_eX]

[DOI]

Inf. Sci., 2024

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control.

[BibT_eX]

[DOI]

CoRR, 2024

Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner.

[BibT_eX]

[DOI]

CoRR, 2024

Forward KL Regularized Preference Optimization for Aligning Diffusion Policies.

[BibT_eX]

[DOI]

CoRR, 2024

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models.

[BibT_eX]

[DOI]

CoRR, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration.

[BibT_eX]

[DOI]

CoRR, 2024

Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Regularized Conditional Diffusion Model for Multi-Task Preference Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

How Does Goal Relabeling Improve Sample Efficiency?

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Cross-Domain Policy Adaptation by Capturing Representation Mismatch.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Constrained Ensemble Exploration for Unsupervised Skill Discovery.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SelfBC: Self Behavior Cloning for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern. Syst., December, 2023

Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., August, 2023

Addressing Hindsight Bias in Multigoal Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2023

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness.

[BibT_eX]

[DOI]

CoRR, 2023

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Privileged Knowledge Distillation for Sim-to-Real Policy Generalization.

[BibT_eX]

[DOI]

CoRR, 2023

On the Value of Myopic Behavior in Policy Reuse.

[BibT_eX]

[DOI]

CoRR, 2023

Cross-Domain Policy Adaptation via Value-Guided Data Filtering.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Behavior Contrastive Learning for Unsupervised Skill Discovery.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Exploration in Deep Reinforcement Learning: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, 2021

Dynamic Bottleneck for Robust Self-Supervised Exploration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Principled Exploration via Optimistic Bootstrapping and Backward Induction.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Obtaining accurate estimated action values in categorical distributional reinforcement learning.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2020

Generating attentive goals for prioritized hindsight reinforcement learning.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2020

深度强化学习中稀疏奖励问题研究综述 (Survey on Sparse Reward in Deep Reinforcement Learning).

[BibT_eX]

[DOI]

计算机科学, 2020

Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Guided goal generation for hindsight multi-goal reinforcement learning.

[BibT_eX]

[DOI]

Neurocomputing, 2019

Chenjia Bai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...