Aswin RRV
According to our database1,
Aswin RRV authored at least 11 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Mid-Training with Self-Generated Data Improves Reinforcement Learning in Language Models.
CoRR, May, 2026
2025
CoRR, October, 2025
Triple Preference Optimization: Achieving Better Alignment using a Single Step Optimization.
Trans. Mach. Learn. Res., 2025
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
2024
Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization.
CoRR, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies.
Proceedings of the Findings of the Association for Computational Linguistics, 2024