Rui Yang

Orcid: 0000-0003-3525-1726

Affiliations:
  • Hong Kong University of Science and Technology, HKUST, SAR, China
  • Tsinghua University, China (former)


According to our database1, Rui Yang authored at least 35 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models.
Trans. Mach. Learn. Res., 2024

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness.
J. Artif. Intell. Res., 2024

Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning.
CoRR, 2024

Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling.
CoRR, 2024

Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs.
CoRR, 2024

UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment.
CoRR, 2024

REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment.
CoRR, 2024

BATON: Aligning Text-to-Audio Model with Human Preference Feedback.
CoRR, 2024

BATON: Aligning Text-to-Audio Model Using Human Preference Feedback.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
ETDNet: Efficient Transformer-Based Detection Network for Surface Defect Detection.
IEEE Trans. Instrum. Meas., 2023

Efficient Multi-Goal Reinforcement Learning via Value Consistency Prioritization.
J. Artif. Intell. Res., 2023

GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models.
CoRR, 2023

Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness.
CoRR, 2023

UniHead: Unifying Multi-Perception for Detection Heads.
CoRR, 2023

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
CoRR, 2023

Corruption-Robust Offline Reinforcement Learning with General Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Proceedings of the International Conference on Machine Learning, 2023

BoxSnake: Polygonal Instance Segmentation with Box Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SSGD: A Smartphone Screen Glass Dataset for Defect Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

GH-QFL: Enhancing Industrial Defect Detection Through Hard Example Mining.
Proceedings of the Artificial Neural Networks and Machine Learning, 2023

2022
UniInst: Unique representation for end-to-end instance segmentation.
Neurocomputing, 2022

Exploiting Reward Shifting in Value-Based Deep RL.
CoRR, 2022

RORL: Robust Offline Reinforcement Learning via Conservative Smoothing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ScalableViT: Rethinking the Context-Oriented Generalization of Vision Transformer.
Proceedings of the Computer Vision, 2022

2021
Simulation and Mitigation of the Wrap-Around Artifact in the MRI Image.
Frontiers Comput. Neurosci., 2021

MHER: Model-based Hindsight Experience Replay.
CoRR, 2021

Bias-reduced multi-step hindsight experience replay.
CoRR, 2021

Combining Hindsight with Goal-enhanced Prediction for Multi-goal Reinforcement Learning.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

2020
Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization.
CoRR, 2020


  Loading...