Tianqi Liu

Orcid: 0000-0003-4497-3317

Affiliations:
  • Google DeepMind


According to our database1, Tianqi Liu authored at least 28 papers between 2021 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Harnessing Pairwise Ranking Prompting Through Sample-Efficient Ranking Distillation.
CoRR, July, 2025

Gemma 3 Technical Report.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, March, 2025

LiPO: Listwise Preference Optimization through Learning-to-Rank.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Building Math Agents with Multi-Turn Iterative Preference Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RRM: Robust Reward Model Training Mitigates Reward Hacking.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Evolving Alignment via Asymmetric Self-Play.
CoRR, 2024

RRM: Robust Reward Model Training Mitigates Reward Hacking.
CoRR, 2024

LAMPO: Large Language Models as Preference Machines for Few-shot Ordinal Classification.
CoRR, 2024

Boosting Reward Model with Preference-Conditional Multi-Aspect Synthetic Data Generation.
CoRR, 2024

Offline Regularised Reinforcement Learning for Large Language Models Alignment.
CoRR, 2024

Direct Language Model Alignment from Online AI Feedback.
CoRR, 2024

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Knowledge Distillation with Perturbed Loss: From a Vanilla Teacher to a Proxy Teacher.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Human Alignment of Large Language Models through Online Preference Optimisation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Statistical Rejection Sampling Improves Preference Optimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multilingual Fine-Grained News Headline Hallucination Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

VIEWS: Entity-Aware News Video Captioning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Predicting Text Preference Via Structured Comparative Reasoning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Video Summarization: Towards Entity-Aware Captions.
CoRR, 2023

On What Basis? Predicting Text Preference Via Structured Comparative Reasoning.
CoRR, 2023

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.
CoRR, 2023

SLiC-HF: Sequence Likelihood Calibration with Human Feedback.
CoRR, 2023

Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation.
CoRR, 2023

2022
All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass.
CoRR, 2022

2021
NewsEmbed: Modeling News through Pre-trained Document Representations.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Training ELECTRA Augmented with Multi-word Selection.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021


  Loading...