Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not.

[BibT_eX]

[DOI]

Francesco Verdini

Pierfrancesco Melucci

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Granary: Speech Recognition and Translation Dataset in 25 European Languages.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2024

Findings of the IWSLT 2024 Evaluation Campaign.

[BibT_eX]

[DOI]

CoRR, 2024

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.

[BibT_eX]

[DOI]

CoRR, 2024

SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2024

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MOSEL: 950, 000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

How Do Hyenas Deal with Human Speech? Speech Recognition and Translation with ConfHyena.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MAGNET - MAchines GeNErating Translations: A CALAMITA Challenge.

[BibT_eX]

[DOI]

Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), 2024

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Direct Speech Translation for Automatic Subtitling.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2023

Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP.

[BibT_eX]

[DOI]

CoRR, 2023

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023.

[BibT_eX]

[DOI]

Sara Papi

Marco Gaido

Matteo Negri

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation.

[BibT_eX]

[DOI]

Sara Papi

Marco Turchi

Matteo Negri

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Joint Speech Translation and Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Attention as a Guide for Simultaneous Speech Translation.

[BibT_eX]

[DOI]

Sara Papi

Matteo Negri

Marco Turchi

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2022

Efficient yet Competitive Speech Translation: FBK@IWSLT2022.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora.

[BibT_eX]

[DOI]

Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Does Simultaneous Speech Translation need Simultaneous Models?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Simultaneous Speech Translation for Live Subtitling: from Delay to Display.

[BibT_eX]

[DOI]