Siddharth Dalmia

Orcid: 0000-0003-0437-5988

According to our database¹, Siddharth Dalmia authored at least 44 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

LOFT: Scalable and More Realistic Long-Context Evaluation.

[BibT_eX]

[DOI]

Devendra Singh Sachan

Michael Boratko

Yi Luan

Sébastien M. R. Arnold

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Revisiting In-Context Learning with Long Context Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

[BibT_eX]

[DOI]

Devendra Singh Sachan

Michael Boratko

Yi Luan

Sébastien M. R. Arnold

CoRR, 2024

LLM Augmented LLMs: Expanding Capabilities through Composition.

[BibT_eX]

[DOI]

CoRR, 2024

Transforming LLMs into Cross-modal and Cross-lingual Retrieval Systems.

[BibT_eX]

[DOI]

Gustavo Hernández Ábrego

Proceedings of the 21st International Conference on Spoken Language Translation, 2024

LLM Augmented LLMs: Expanding Capabilities through Composition.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multimodal Modeling for Spoken Language Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Exploiting Compositionality in Sequence Models

[BibT_eX]

[DOI]

Siddharth Dalmia

PhD thesis, 2023

LegoNN: Building Modular Encoder-Decoder Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Multimodal Modeling For Spoken Language Identification.

[BibT_eX]

[DOI]

CoRR, 2023

Align, Write, Re-Order: Explainable End-to-End Speech Translation via Operation Sequence Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

CTC Alignments Improve Autoregressive Translation.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2023

2022

A Study on the Integration of Pre-Trained SSL, ASR, LM and SLU Models for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

CMU's IWSLT 2022 Dialect Speech Translation System.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Two-Pass Low Latency End-to-End Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

ESPnet-ST IWSLT 2021 Offline Speech Translation System.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Differentiable Allophone Graphs for Language-Universal Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transformer-Transducers for Code-Switched Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

NoiseQA: Challenge Set Evaluation for User-Centric Question Answering.

[BibT_eX]

[DOI]

Abhilasha Ravichander

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Universal Phone Recognition with a Multilingual Allophone System.

[BibT_eX]

[DOI]

Antonios Anastasopoulos

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

On Long-Tailed Phenomena in Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Towards Zero-Shot Learning for Automatic Phonemic Transcription.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models.

[BibT_eX]

[DOI]

CoRR, 2019

The ARIEL-CMU Systems for LoReHLT18.

[BibT_eX]

[DOI]

CoRR, 2019

SANTLR: Speech Annotation Toolkit for Low Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multilingual Speech Recognition with Corpus Relatedness Sampling.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Cross-Attention End-to-End ASR for Two-Party Conversations.

[BibT_eX]

[DOI]

Suyoun Kim

Siddharth Dalmia

Florian Metze

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Phoneme Level Language Models for Sequence Based Low Resource ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion.

[BibT_eX]

[DOI]

Suyoun Kim

Siddharth Dalmia

Florian Metze

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Domain Robust Feature Extraction for Rapid Low Resource ASR Development.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Epitran: Precision G2P for Many Languages.

[BibT_eX]

[DOI]

David R. Mortensen

Siddharth Dalmia

Patrick Littell

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Sequence-Based Multi-Lingual Low Resource Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

A novel similarity measure: Voronoi audio similarity for genre classification.

[BibT_eX]

[DOI]

Int. J. Intell. Syst. Technol. Appl., 2017

An approach for self-training audio event detectors using web data.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

2015

Robust ASR using neural network based speech enhancement and feature simulation.

[BibT_eX]

[DOI]

Sunit Sivasankaran

Aditya Arie Nugraha

Emmanuel Vincent

Juan Andres Morales-Cordovilla

Siddharth Dalmia

Irina Illina

Antoine Liutkus

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Siddharth Dalmia

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...