We stand with Ukraine

We stand with Ukraine

Shubham Toshniwal

According to our database¹, Shubham Toshniwal authored at least 34 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

GenSelect: A Generative Approach to Best-of-N.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

Aleksander Ficek

,

,

CoRR, July, 2025

The Challenge of Teaching Reasoning to LLMs Without RL or Distillation.

[BibT_eX]

[DOI]

CoRR, July, 2025

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset.

[BibT_eX]

[DOI]

,

,

,

Shubham Toshniwal

,

Christof Henkel

,

Benedikt Schifferer

,

,

CoRR, April, 2025

IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMs.

[BibT_eX]

[DOI]

Kawshik Manikantan

,

Makarand Tapaswi

,

,

Shubham Toshniwal

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

,

Branislav Kisacanin

,

Alexan Ayrapetyan

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark.

[BibT_eX]

[DOI]

Kawshik Manikantan

,

Makarand Tapaswi

,

,

Shubham Toshniwal

CoRR, 2024

Major Entity Identification: A Generalizable Alternative to Coreference Resolution.

[BibT_eX]

[DOI]

Kawshik Manikantan

,

Shubham Toshniwal

,

Makarand Tapaswi

,

CoRR, 2024

Nemotron-4 340B Technical Report.

[BibT_eX]

[DOI]

,

,

,

,

Pallab Bhattacharya

,

,

,

Bryan Catanzaro

,

,

Jonathan M. Cohen

,

,

Ayush Dattagupta

,

Olivier Delalleau

,

Leon Derczynski

,

,

,

,

Aleksander Ficek

,

,

,

,

,

Tomasz Grzegorzek

,

,

,

,

Joseph Jennings

,

Aastha Jhunjhunwala

,

,

,

Oleksii Kuchaiev

,

Patrick LeGresley

,

,

,

,

,

Ameya Sunil Mahabaleshwarkar

,

Somshubra Majumdar

,

,

Miguel Martinez

,

Maer Rodrigues de Melo

,

,

Deepak Narayanan

,

Sean Narenthiran

,

,

,

,

,

Guruprasad Nutheti

,

Christopher Parisien

,

Jupinder Parmar

,

Mostofa Patwary

,

Krzysztof Pawelec

,

,

Shrimai Prabhumoye

,

,

,

Vasanth Rao Naik Sabavat

,

Sanjeev Satheesh

,

Jane Polak Scowcroft

,

,

,

,

Mohammad Shoeybi

,

,

Misha Smelyanskiy

,

,

Makesh Narsimhan Sreedhar

,

,

Sandeep Subramanian

,

,

Shubham Toshniwal

,

,

,

,

,

,

,

,

,

CoRR, 2024

Code Pretraining Improves Entity Tracking Abilities of Language Models.

[BibT_eX]

[DOI]

,

Sebastian Schuster

,

Shubham Toshniwal

CoRR, 2024

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

Sean Narenthiran

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Major Entity Identification: A Generalizable Alternative to Coreference Resolution.

[BibT_eX]

[DOI]

,

Shubham Toshniwal

,

Makarand Tapaswi

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Aarohi Srivastava

,

Abhinav Rastogi

,

,

Abu Awal Md Shoeb

,

,

,

,

,

,

Adrià Garriga-Alonso

,

Agnieszka Kluska

,

Aitor Lewkowycz

,

,

,

,

,

Alexander W. Kocurek

,

,

,

,

,

,

,

,

,

,

,

Anantharaman S. Iyer

,

Anders Andreassen

,

,

Andrea Santilli

,

Andreas Stuhlmüller

,

,

,

Andrew K. Lampinen

,

,

,

,

,

,

,

Antonio Norelli

,

,

Arash Gholamidavoodi

,

,

,

Arun Kirubarajan

,

Asher Mullokandov

,

Ashish Sabharwal

,

,

,

,

,

B. Ryan Roberts

,

,

,

Bartlomiej Bojanowski

,

Batuhan Özyurt

,

Behnam Hedayatnia

,

Behnam Neyshabur

,

,

,

,

Bill Yuchen Lin

,

,

,

,

,

Catherine Stinson

,

Cedrick Argueta

,

Cèsar Ferri Ramírez

,

,

Charles Rathkopf

,

,

,

,

Chris Callison-Burch

,

,

Christian Voigt

,

Christopher D. Manning

,

Christopher Potts

,

,

Clara E. Rivera

,

,

,

Courtney Ashcraft

,

Cristina Garbacea

,

,

,

,

,

,

,

Daniel Khashabi

,

,

Daniel Moseguí González

,

Danielle Perszyk

,

Danny Hernandez

,

,

Daphne Ippolito

,

,

,

,

,

Debajyoti Datta

,

,

,

,

,

,

,

,

,

,

Dimitri Coelho Mollo

,

,

,

,

Ekaterina Shutova

,

Ekin Dogus Cubuk

,

,

Eleanor Hagerman

,

Elizabeth Barnes

,

Elizabeth Donoway

,

,

Emanuele Rodolà

,

,

,

,

,

,

,

,

Ethan J. Jerzak

,

,

Eunice Engefu Manyasi

,

Evgenii Zheltonozhskii

,

,

,

Fernando Martínez-Plumed

,

Francesca Happé

,

François Chollet

,

,

,

Genta Indra Winata

,

,

Germán Kruszewski

,

Giambattista Parascandolo

,

Giorgio Mariani

,

,

Gonzalo Jaimovitch-López

,

,

,

Hana Galijasevic

,

,

,

Hannaneh Hajishirzi

,

,

,

,

Hinrich Schütze

,

,

,

,

,

,

,

Jack Geissinger

,

Jackson Kernion

,

,

,

Jaime Fernández Fisac

,

,

,

,

,

,

,

Janelle Wingfield

,

,

,

Jascha Sohl-Dickstein

,

,

,

,

Jekaterina Novikova

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Jonathan Batchelder

,

Jonathan Berant

,

,

,

José Hernández-Orallo

,

Joseph Boudeman

,

,

,

Joshua B. Tenenbaum

,

,

,

,

,

,

Karthik Gopalakrishnan

,

Katerina Ignatyeva

,

,

Kaustubh D. Dhole

,

,

,

Kory W. Mathewson

,

Kristen Chiafullo

,

Ksenia Shkaruta

,

,

,

Kyle Richardson

,

,

,

,

,

,

Lidia Contreras Ochando

,

Louis-Philippe Morency

,

,

,

,

,

,

Luis Oliveros Colón

,

,

Lütfi Kerem Senel

,

,

,

Maartje ter Hoeve

,

,

,

,

,

,

,

María José Ramírez-Quintana

,

,

Mario Giulianelli

,

,

Martin Potthast

,

Matthew L. Leavitt

,

,

Mátyás Schubert

,

Medina Baitemirova

,

,

Melvin McElrath

,

,

,

,

Michael I. Ivanitskiy

,

Michael Starritt

,

,

Michal Swedrowski

,

Michele Bevilacqua

,

Michihiro Yasunaga

,

,

,

,

,

,

,

,

Moin Aminnaseri

,

,

,

Mukund Varma T.

,

,

,

,

Neta Gur-Ari Krakover

,

Nicholas Cameron

,

Nicholas Roberts

,

,

Nicole Martinez

,

,

,

Niklas Muennighoff

,

Nitish Shirish Keskar

,

,

,

,

,

,

,

Omar Elbaghdadi

,

,

,

Pablo Antonio Moreno Casares

,

,

,

,

,

Pegah Alipoormolabashi

,

,

,

,

Peter Eckersley

,

,

,

Piotr Milkowski

,

,

Pouya Pezeshkpour

,

,

,

,

,

,

Rachel Etta Rudolph

,

,

,

,

Raphaël Millière

,

,

,

,

,

Robbe Raymaekers

,

,

,

,

,

,

,

,

,

Ruslan Salakhutdinov

,

,

,

,

,

,

,

Saif M. Mohammad

,

,

,

,

,

Samuel Gruetter

,

Samuel R. Bowman

,

Samuel S. Schoenholz

,

,

,

,

Sarik Ghazarian

,

,

,

Sebastian Bischoff

,

Sebastian Gehrmann

,

Sebastian Schuster

,

Sepideh Sadeghi

,

,

,

Shashank Srivastava

,

,

,

,

Shixiang Shane Gu

,

Shubh Pachchigar

,

Shubham Toshniwal

,

,

Shyamolima (Shammie) Debnath

,

,

Simon Thormeyer

,

,

,

Sneha Priscilla Makini

,

,

,

Sriharsha Hatwar

,

Stanislas Dehaene

,

,

,

Stella Biderman

,

,

,

Steven T. Piantadosi

,

Stuart M. Shieber

,

Summer Misherghi

,

Svetlana Kiritchenko

,

,

,

,

,

,

,

Tatsu Hashimoto

,

,

Théo Desbordes

,

Theodore Rothschild

,

,

,

Tiberius Nkinyili

,

,

,

,

Tobias Gerstenberg

,

,

Trishala Neeraj

,

,

,

,

,

,

Victoria Nyamai

,

,

Vinay V. Ramasesh

,

Vinay Uday Prabhu

,

Vishakh Padmakumar

,

,

,

William Saunders

,

,

,

,

,

,

,

,

Yadollah Yaghoobzadeh

,

,

,

,

,

,

,

,

Yonatan Belinkov

,

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2023

Learning to Reason and Memorize with Self-Notes.

[BibT_eX]

[DOI]

Jack Lanchantin

,

Shubham Toshniwal

,

,

,

Sainbayar Sukhbaatar

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Adapting Pretrained Text-to-Text Models for Long Text Sequences.

[BibT_eX]

[DOI]

,

,

Shubham Toshniwal

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Robustness of Named-Entity Replacements for In-Context Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Shubham Toshniwal

,

,

Jack Lanchantin

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022

Efficient and Interpretable Neural Models for Entity Tracking.

[BibT_eX]

[DOI]

Shubham Toshniwal

CoRR, 2022

Baked-in State Probing.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Chess as a Testbed for Language Model State Tracking.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

On Generalization in Coreference Resolution.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

,

,

CoRR, 2021

Learning Chess Blindfolded: Evaluating Language Models on State Tracking.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

,

CoRR, 2021

2020

A Cross-Task Analysis of Text Span Representations.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

,

,

,

Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

Allyson Ettinger

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

PeTra: A Sparsely Supervised Memory Model for People Tracking.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

Allyson Ettinger

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Tara N. Sainath

,

,

Chung-Cheng Chiu

,

,

,

,

Stella Laurenzo

,

,

,

Wolfgang Macherey

,

,

,

,

,

,

Rohit Prabhavalkar

,

,

,

,

,

,

Sébastien Jean

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Kuan-Chieh Wang

,

Ekaterina Gonina

,

,

,

,

,

,

,

,

,

George F. Foster

,

John Richardson

,

,

Antoine Bruguier

,

,

,

,

,

,

,

Vijayaditya Peddinti

,

,

Michiel Bacchiani

,

Thomas B. Jablin

,

Robert Suderman

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Dmitry Lepikhin

,

,

,

,

Shubham Toshniwal

,

,

Michael Nirschl

,

CoRR, 2019

Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

,

Shinji Watanabe

,

,

,

Shubham Toshniwal

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Hierarchical Multitask Learning for CTC-based Speech Recognition.

[BibT_eX]

[DOI]

Kalpesh Krishna

,

Shubham Toshniwal

,

CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

Chung-Cheng Chiu

,

,

Tara N. Sainath

,

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information.

[BibT_eX]

[DOI]

,

Shubham Toshniwal

,

,

,

,

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Multilingual Speech Recognition with a Single End-to-End Model.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

Tara N. Sainath

,

,

,

Pedro J. Moreno

,

Eugene Weinstein

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing.

[BibT_eX]

[DOI]

,

Shubham Toshniwal

,

,

,

,

CoRR, 2017

Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Jointly learning to align and convert graphemes to phonemes with neural attention models.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

2015

VibRein: An Engaging and Assistive Mobile Learning Companion for Students with Intellectual Disabilities.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

Nitendra Rajput

,

Saurabh Srivastava

Proceedings of the Annual Meeting of the Australian Special Interest Group for Computer Human Interaction, 2015

USHER: An Intelligent Tour Companion.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

Parikshit Sharma

,

Saurabh Srivastava

,

Proceedings of the 20th International Conference on Intelligent User Interfaces Companion, 2015

Loading...