Jie Liu

Orcid: 0000-0002-1782-2081

Affiliations:
  • Chinese University of Hong Kong, CUHK, MMLab, Hong Kong
  • Shanghai AI Laboratory, Intelligence Laboratory, China


According to our database1, Jie Liu authored at least 20 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Adaptive pessimism via target Q-value for offline reinforcement learning.
Neural Networks, 2024

DDK: Distilling Domain Knowledge for Efficient Large Language Models.
CoRR, 2024

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level.
CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.
CoRR, 2024

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.
CoRR, 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.
CoRR, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Masked Pretraining for Multi-Agent Decision Making.
CoRR, 2023

Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations.
CoRR, 2023

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.
CoRR, 2023

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2021
Inception Convolution With Efficient Dilation Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Adaptive Gradient Method with Resilience and Momentum.
CoRR, 2020


  Loading...