Sara Papi

Orcid: 0000-0002-4494-8886

Affiliations:
  • Fondazione Bruno Kessler, Trento, Italy


According to our database1, Sara Papi authored at least 37 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks.
CoRR, July, 2025

The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence.
CoRR, May, 2025

FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian.
CoRR, May, 2025

Granary: Speech Recognition and Translation Dataset in 25 European Languages.
CoRR, May, 2025

NUTSHELL: A Dataset for Abstract Generation from Scientific Talks.
CoRR, February, 2025

How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?
Trans. Assoc. Comput. Linguistics, 2025

Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

2024
Findings of the IWSLT 2024 Evaluation Campaign.
CoRR, 2024

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
CoRR, 2024

How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not.
CoRR, 2024

SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation.
CoRR, 2024

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024

What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MOSEL: 950, 000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

How Do Hyenas Deal with Human Speech? Speech Recognition and Translation with ConfHyena.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MAGNET - MAchines GeNErating Translations: A CALAMITA Challenge.
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), 2024

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Direct Speech Translation for Automatic Subtitling.
Trans. Assoc. Comput. Linguistics, 2023

Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP.
CoRR, 2023

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Joint Speech Translation and Named Entity Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Attention as a Guide for Simultaneous Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation.
CoRR, 2022

Efficient yet Competitive Speech Translation: FBK@IWSLT2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Does Simultaneous Speech Translation need Simultaneous Models?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Simultaneous Speech Translation for Live Subtitling: from Delay to Display.
Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings, 2021

Dealing with training and test segmentation mismatch: FBK@IWSLT2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Speechformer: Reducing Information Loss in Direct Speech Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Visualization: The Missing Factor in Simultaneous Speech Translation.
Proceedings of the Eighth Italian Conference on Computational Linguistics, 2021

2020
Mixtures of Deep Neural Experts for Automated Speech Scoring.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020


  Loading...