Sadegh Mahdavi

According to our database1, Sadegh Mahdavi authored at least 12 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Learning Generative Selection for Best-of-N.
CoRR, February, 2026

Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients.
Trans. Mach. Learn. Res., 2026

2025
Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision.
CoRR, December, 2025

Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection.
CoRR, November, 2025

The Challenge of Teaching Reasoning to LLMs Without RL or Distillation.
CoRR, July, 2025

Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
From Graph Diffusion to Graph Classification.
CoRR, 2024

Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models.
CoRR, 2024

Leveraging Environment Interaction for Automated PDDL Translation and Planning with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Memorization Capacity of Multi-Head Attention in Transformers.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Revisiting the Equivalence of In-Context Learning and Gradient Descent: The Impact of Data Distribution.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Towards Better Out-of-Distribution Generalization of Neural Algorithmic Reasoning Tasks.
Trans. Mach. Learn. Res., 2023


  Loading...