Jie Liu

Orcid: 0000-0002-1782-2081

Affiliations:

Chinese University of Hong Kong, CUHK, MMLab, Hong Kong
Shanghai AI Laboratory, Intelligence Laboratory, China

According to our database¹, Jie Liu authored at least 20 papers between 2020 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Adaptive pessimism via target Q-value for offline reinforcement learning.

[BibT_eX]

[DOI]

Neural Networks, 2024

DDK: Distilling Domain Knowledge for Efficient Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level.

[BibT_eX]

[DOI]

CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.

[BibT_eX]

[DOI]

CoRR, 2024

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Masked Pretraining for Multi-Agent Decision Making.

[BibT_eX]

[DOI]

CoRR, 2023

Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations.

[BibT_eX]

[DOI]

CoRR, 2023

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2021

Inception Convolution With Efficient Dilation Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Adaptive Gradient Method with Resilience and Momentum.

[BibT_eX]

[DOI]

CoRR, 2020

Jie Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...