Tianqi Liu

Orcid: 0000-0003-4497-3317

Affiliations:
  • Google DeepMind


According to our database1, Tianqi Liu authored at least 26 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2021
2022
2023
2024
0
5
10
15
20
9
5
1
9
2

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Evolving Alignment via Asymmetric Self-Play.
CoRR, 2024

RRM: Robust Reward Model Training Mitigates Reward Hacking.
CoRR, 2024

Building Math Agents with Multi-Turn Iterative Preference Learning.
CoRR, 2024

LAMPO: Large Language Models as Preference Machines for Few-shot Ordinal Classification.
CoRR, 2024

Boosting Reward Model with Preference-Conditional Multi-Aspect Synthetic Data Generation.
CoRR, 2024

Offline Regularised Reinforcement Learning for Large Language Models Alignment.
CoRR, 2024

Human Alignment of Large Language Models through Online Preference Optimisation.
CoRR, 2024

Direct Language Model Alignment from Online AI Feedback.
CoRR, 2024

LiPO: Listwise Preference Optimization through Learning-to-Rank.
CoRR, 2024

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Knowledge Distillation with Perturbed Loss: From a Vanilla Teacher to a Proxy Teacher.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Human Alignment of Large Language Models through Online Preference Optimisation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Statistical Rejection Sampling Improves Preference Optimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multilingual Fine-Grained News Headline Hallucination Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

VIEWS: Entity-Aware News Video Captioning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Predicting Text Preference Via Structured Comparative Reasoning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Video Summarization: Towards Entity-Aware Captions.
CoRR, 2023

On What Basis? Predicting Text Preference Via Structured Comparative Reasoning.
CoRR, 2023

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.
CoRR, 2023

SLiC-HF: Sequence Likelihood Calibration with Human Feedback.
CoRR, 2023

Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation.
CoRR, 2023

2022
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass.
CoRR, 2022

2021
NewsEmbed: Modeling News through Pre-trained Document Representations.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Training ELECTRA Augmented with Multi-word Selection.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021


  Loading...