Rafael Rafailov

According to our database1, Rafael Rafailov authored at least 15 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Disentangling Length from Quality in Direct Preference Optimization.
CoRR, 2024

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning.
CoRR, 2024

2023
Diffusion Model Alignment Using Direct Preference Optimization.
CoRR, 2023

Contrastive Preference Learning: Learning from Human Feedback without RL.
CoRR, 2023

An Emulator for Fine-Tuning Large Language Models using Small Language Models.
CoRR, 2023

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias.
CoRR, 2023

Direct Preference Optimization: Your Language Model is Secretly a Reward Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Contrastive Example-Based Control.
Proceedings of the Learning for Dynamics and Control Conference, 2023

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning.
Proceedings of the Conference on Robot Learning, 2023

2022
Vision-Based Manipulators Need to Also See from Their Hands.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
COMBO: Conservative Offline Model-Based Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Visual Adversarial Imitation Learning using Variational Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Offline Reinforcement Learning from Images with Latent Space Models.
Proceedings of the 3rd Annual Conference on Learning for Dynamics and Control, 2021

Offline Meta-Reinforcement Learning with Advantage Weighting.
Proceedings of the 38th International Conference on Machine Learning, 2021


  Loading...