Rahul Madhavan

According to our database1, Rahul Madhavan authored at least 12 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Robust Reward Modeling via Causal Rubrics.
CoRR, June, 2025

AMPO: Active Multi-Preference Optimization.
CoRR, February, 2025

CARMO: Dynamic Criteria Generation for Context Aware Reward Modelling.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
REFA: Reference Free Alignment for multi-preference optimization.
CoRR, 2024

SWEPO: Simultaneous Weighted Preference Optimization for Group Contrastive Alignment.
CoRR, 2024

Causal Contextual Bandits with Adaptive Context.
RLJ, 2024

Time-Reversal Provides Unsupervised Feedback to LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Causal ATE Mitigates Unintended Bias in Controlled Text Generation.
CoRR, 2023

Learning good interventions in causal graphs via covering.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

CFL: Causally Fair Language Models Through Token-level Attribute Controlled Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2021
Intervention Efficient Algorithm for Two-Stage Causal MDPs.
CoRR, 2021

Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning.
CoRR, 2021


  Loading...