Sadegh Mahdavi
According to our database1,
Sadegh Mahdavi authored at least 12 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients.
Trans. Mach. Learn. Res., 2026
2025
Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision.
CoRR, December, 2025
Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection.
CoRR, November, 2025
CoRR, July, 2025
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025
2024
Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models.
CoRR, 2024
Leveraging Environment Interaction for Automated PDDL Translation and Planning with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Revisiting the Equivalence of In-Context Learning and Gradient Descent: The Impact of Data Distribution.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks.
Trans. Mach. Learn. Res., 2023