Aswin RRV

According to our database1, Aswin RRV authored at least 11 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Mid-Training with Self-Generated Data Improves Reinforcement Learning in Language Models.
CoRR, May, 2026

Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution.
CoRR, April, 2026

2025
PHANTOM RECALL: When Familiar Puzzles Fool Smart Models.
CoRR, October, 2025

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time.
CoRR, October, 2025

Triple Preference Optimization: Achieving Better Alignment using a Single Step Optimization.
Trans. Mach. Learn. Res., 2025

ToW: Thoughts of Words Improve Reasoning in Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

ThinkTuning: Instilling Cognitive Reflections without Distillation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
Triple Preference Optimization: Achieving Better Alignment with Less Data in a Single Step Optimization.
CoRR, 2024

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies.
Proceedings of the Findings of the Association for Computational Linguistics, 2024


  Loading...