We stand with Ukraine

We stand with Ukraine

Tianqi Liu

Orcid: 0000-0003-4497-3317

Affiliations:

Google DeepMind

According to our database¹, Tianqi Liu authored at least 28 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

Harnessing Pairwise Ranking Prompting Through Sample-Efficient Ranking Distillation.

[BibT_eX]

[DOI]

,

,

,

,

Paul Suganthan G. C.

,

,

,

,

Harrie Oosterhuis

CoRR, July, 2025

LiPO: Listwise Preference Optimization through Learning-to-Rank.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Simon Baumgartner

,

,

,

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Reward-Guided Prompt Evolving in Reinforcement Learning for LLMs.

[BibT_eX]

[DOI]

,

Rishabh Agarwal

,

,

,

Sarmishta Velury

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Building Math Agents with Multi-Turn Iterative Preference Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Daniele Calandriello

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RRM: Robust Reward Model Training Mitigates Reward Hacking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Anastasia Makarova

,

Jeremiah Zhe Liu

,

,

,

Abe Ittycheriah

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Evolving Alignment via Asymmetric Self-Play.

[BibT_eX]

[DOI]

,

Rishabh Agarwal

,

,

,

Sarmishta Velury

,

,

,

CoRR, 2024

RRM: Robust Reward Model Training Mitigates Reward Hacking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Anastasiia Makarova

,

Jeremiah Z. Liu

,

,

,

Abe Ittycheriah

,

,

CoRR, 2024

LAMPO: Large Language Models as Preference Machines for Few-shot Ordinal Classification.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Boosting Reward Model with Preference-Conditional Multi-Aspect Synthetic Data Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Simon Baumgartner

,

Michael Bendersky

CoRR, 2024

Offline Regularised Reinforcement Learning for Large Language Models Alignment.

[BibT_eX]

[DOI]

Pierre Harvey Richemond

,

,

,

Daniele Calandriello

,

Mohammad Gheshlaghi Azar

,

Rafael Rafailov

,

Bernardo Ávila Pires

,

Eugene Tarassov

,

,

,

Aliaksei Severyn

,

Jonathan Mallinson

,

,

,

,

,

,

CoRR, 2024

Direct Language Model Alignment from Online AI Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

Felipe Llinares

,

Alexandre Ramé

,

,

,

,

,

Mathieu Blondel

CoRR, 2024

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Michael Bendersky

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Knowledge Distillation with Perturbed Loss: From a Vanilla Teacher to a Proxy Teacher.

[BibT_eX]

[DOI]

,

,

,

,

Michael Bendersky

,

,

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Human Alignment of Large Language Models through Online Preference Optimisation.

[BibT_eX]

[DOI]

Daniele Calandriello

,

Zhaohan Daniel Guo

,

,

,

,

Bernardo Ávila Pires

,

Pierre Harvey Richemond

,

Charline Le Lan

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Statistical Rejection Sampling Improves Preference Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multilingual Fine-Grained News Headline Hallucination Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Simon Baumgartner

,

Michael Bendersky

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

VIEWS: Entity-Aware News Video Captioning.

[BibT_eX]

[DOI]

Hammad A. Ayyubi

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Simon Baumgartner

,

Michael Bendersky

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning.

[BibT_eX]

[DOI]

,

,

,

,

Jing Nathan Yan

,

,

,

Michael Bendersky

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Predicting Text Preference Via Structured Comparative Reasoning.

[BibT_eX]

[DOI]

Jing Nathan Yan

,

,

,

,

,

,

Charumathi Lakshmanan

,

,

Alexander M. Rush

,

,

Michael Bendersky

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Video Summarization: Towards Entity-Aware Captions.

[BibT_eX]

[DOI]

Hammad A. Ayyubi

,

,

,

,

,

,

,

,

,

CoRR, 2023

On What Basis? Predicting Text Preference Via Structured Comparative Reasoning.

[BibT_eX]

[DOI]

Jing Nathan Yan

,

,

,

,

,

,

,

Charu Lakshmanan

,

,

Alexander M. Rush

,

,

Michael Bendersky

CoRR, 2023

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Michael Bendersky

CoRR, 2023

SLiC-HF: Sequence Likelihood Calibration with Human Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation.

[BibT_eX]

[DOI]

,

,

,

,

Michael Bendersky

,

,

CoRR, 2023

2022

All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass.

[BibT_eX]

[DOI]

,

,

,

Ádám D. Lelkes

,

,

CoRR, 2022

2021

NewsEmbed: Modeling News through Pre-trained Document Representations.

[BibT_eX]

[DOI]

,

,

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Training ELECTRA Augmented with Multi-word Selection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Loading...