Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data.

[BibT_eX]

[DOI]

Shubham Toshniwal

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Confidence-based Ensembles of End-to-End Speech Recognition Models.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Powerful and Extensible WFST Framework for Rnn-Transducer Losses.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2019

Understanding the Role of Momentum in Stochastic Gradient Methods.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models.

[BibT_eX]

[DOI]

CoRR, 2018

Novel Prediction Techniques Based on Clusterwise Linear Regression.

[BibT_eX]

[DOI]

CoRR, 2018

Convergence Analysis of Gradient Descent Algorithms with Proportional Updates.

[BibT_eX]

[DOI]

Igor Gitman

Deepak Dilipkumar

Ben Parr

CoRR, 2018

2017

Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification.

[BibT_eX]

[DOI]

Igor Gitman

Boris Ginsburg

CoRR, 2017

Scaling SGD Batch Size to 32K for ImageNet Training.

[BibT_eX]

[DOI]

Yang You

Igor Gitman

Boris Ginsburg

CoRR, 2017

Igor Gitman

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...