Misha Khalman

According to our database1, Misha Khalman authored at least 9 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
LiPO: Listwise Preference Optimization through Learning-to-Rank.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Building Math Agents with Multi-Turn Iterative Preference Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Direct Language Model Alignment from Online AI Feedback.
CoRR, 2024

Statistical Rejection Sampling Improves Preference Optimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Gemini: A Family of Highly Capable Multimodal Models.
CoRR, 2023

Calibrating Likelihoods towards Consistency in Summarization Models.
CoRR, 2023

SLiC-HF: Sequence Likelihood Calibration with Human Feedback.
CoRR, 2023

Calibrating Sequence likelihood Improves Conditional Language Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2021
ForumSum: A Multi-Speaker Conversation Summarization Dataset.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021


  Loading...