Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music.

[BibT_eX]

[DOI]

Jiatong Shi

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

OpusLM: A Family of Open Unified Speech Language Models.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Scalable Spontaneous Speech Dataset (SSSD): Crowdsourcing Data Collection to Promote Dialogue Research.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Chain-of-Thought Training for Open E2E Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Context-aware Dynamic Pruning for Speech Foundation Models.

[BibT_eX]

[DOI]

Athanasios Mouchtaris

Grant P. Strimel

Jing Liu

Shinji Watanabe

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

VERSA-v2: A Modular and Scalable Toolkit for Speech and Audio Evaluation with Expanded Metrics, Visualization, and LLM Integration.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

AURA: Agent for Understanding, Reasoning, and Automated Tool Use in Voice-Driven Tasks.

[BibT_eX]

[DOI]

Leander Melroy Maben

Gayathri Ganesh Lakshmy

Srijith Radhakrishnan

Siddhant Arora

Shinji Watanabe

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.

[BibT_eX]

[DOI]

Fabian Ritter Gutierrez

CoRR, 2024

Task Arithmetic for Language Expansion in Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2024

ESPnet-EZ: Python-Only ESPnet For Easy Fine-Tuning And Integration.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Decoder-only Architecture for Streaming End-to-end Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

To what extent can ASV systems naturally defend against spoofing attacks?

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Phoneme-Aware Encoding for Prefix-Tree-Based Contextual ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Semi-Autoregressive Streaming ASR with Label Context.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Creation and Analysis of an International Corpus of Privacy Laws.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.

[BibT_eX]

[DOI]

CoRR, 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation.

[BibT_eX]

[DOI]

CoRR, 2023

CMU's IWSLT 2023 Simultaneous Speech Translation System.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Tensor decomposition for minimization of E2E SLU model toward on-device processing.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

BASS: Block-wise Adaptation for Speech Summarization.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

E-Branchformer-Based E2E SLU Toward Stop on-Device Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Joint Speech Recognition and Disfluency Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

A Study on the Integration of Pipeline and E2E SLU Systems for Spoken Semantic Parsing Toward Stop Quality Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Joint Modelling of Spoken Language Understanding Tasks with Integrated Dialog History.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Teaching Old DB Neu(ral) Tricks: Learning Embeddings on Multi-tabular Databases.

[BibT_eX]

[DOI]

Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Creation and Analysis of an International Corpus of Privacy Laws.

[BibT_eX]

[DOI]

CoRR, 2022

A Study on the Integration of Pre-Trained SSL, ASR, LM and SLU Models for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Tale of Two Regulatory Regimes: Creation and Analysis of a Bilingual Privacy Policy Corpus.

[BibT_eX]

[DOI]

Siddhant Arora

Henry Hosseini

Christine Utz

Vinayshekhar Bannihatti Kumar

Tristan Dhellemmes

Abhilasha Ravichander

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Two-Pass Low Latency End-to-End Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

BERT Meets Relational DB: Contextual Representations of Relational Databases.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

A Survey on Graph Neural Networks for Knowledge Graph Completion.

[BibT_eX]

[DOI]

Siddhant Arora

CoRR, 2020

On Embeddings in Relational Databases.

[BibT_eX]

[DOI]

Siddhant Arora

Srikanta Bedathur

CoRR, 2020

Capreolus: A Toolkit for End-to-End Neural Ad Hoc Retrieval.

[BibT_eX]

[DOI]

Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

IterefinE: Iterative KG Refinement Embeddings using Symbolic Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Conference on Automated Knowledge Base Construction, 2020

2019

Understanding Community Rivalry on Social Media: A Case Study of Two Footballing Giants.

[BibT_eX]

[DOI]

Anandhavelu Natarajan

Proceedings of the Joint Proceedings of the ACM IUI 2019 Workshops co-located with the 24th ACM Conference on Intelligent User Interfaces (ACM IUI 2019), 2019

Investigating Retrieval Method Selection with Axiomatic Features.

[BibT_eX]

[DOI]

Siddhant Arora

Andrew Yates

Proceedings of the 1st Interdisciplinary Workshop on Algorithm Selection and Meta-Learning in Information Retrieval co-located with the 41st European Conference on Information Retrieval (ECIR 2019), 2019

2018

A Naive Deep Nets Based Approach for Authenticating Viral Textual Content on Social Media.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Systems and Applications, 2018

Siddhant Arora

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...