Soumye Singhal

According to our database1, Soumye Singhal authored at least 10 papers between 2019 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Llama-Nemotron: Efficient Reasoning Models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, May, 2025

Adversarial Training of Reward Models.
CoRR, April, 2025

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment.
CoRR, February, 2025

Effective Backdoor Mitigation in Vision-Language Models Depends on the Pre-training Objective.
Trans. Mach. Learn. Res., 2025

2023
Effective Backdoor Mitigation Depends on the Pre-training Objective.
CoRR, 2023

2022
Multi-label Iterated Learning for Image Classification with Label Ambiguity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2020
Jointly Trained Image and Video Generation using Residual Vectors.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Countering Language Drift with Seeded Iterated Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Supervised Seeded Iterated Learning for Interactive Language Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Recall Traces: Backtracking Models for Efficient Reinforcement Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019


  Loading...