Juan Pino

Affiliations:
  • Meta AI
  • University of Cambridge, Department of Engineering, UK (former)
  • Carnegie Mellon University, Language Technologies Institute, Pittsburgh, PA, USA (foremr)


According to our database1, Juan Pino authored at least 70 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception.
CoRR, 2024

SpiRit-LM: Interleaved Spoken and Written Language Model.
CoRR, 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

Multilingual Speech-to-Speech Translation into Multiple Target Languages.
CoRR, 2023

Exploration on HuBERT with Multiple Resolutions.
CoRR, 2023

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation.
CoRR, 2023


Pre-training for Speech Translation: CTC Meets Optimal Transport.
Proceedings of the International Conference on Machine Learning, 2023

Enhancing Speech-To-Speech Translation with Multiple TTS Targets.
Proceedings of the IEEE International Conference on Acoustics, 2023

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

Simple and Effective Unsupervised Speech Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-to-Speech Translation for a Real-world Unwritten Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Textless Speech-to-Speech Translation on Real Data.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022


Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation.
Proceedings of the Interspeech 2022, 2022

From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation.
Proceedings of the Interspeech 2022, 2022

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.
Proceedings of the Interspeech 2022, 2022

Unified Speech-Text Pre-training for Speech Translation and Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Direct Speech-to-Speech Translation With Discrete Units.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Textless Speech-to-Speech Translation on Real Data.
CoRR, 2021

Direct simultaneous speech to speech translation.
CoRR, 2021

Incremental Speech Synthesis For Speech-To-Speech Translation.
CoRR, 2021

Direct speech-to-speech translation with discrete units.
CoRR, 2021

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021


Large-Scale Self- and Semi-Supervised Learning for Speech Translation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

CoVoST 2 and Massively Multilingual Speech Translation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Streaming Simultaneous Speech Translation with Augmented Memory Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2021

fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multilingual Speech Translation from Efficient Finetuning of Pretrained Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Lightweight Adapter Tuning for Multilingual Speech Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Cross-Modal Transfer Learning for Multilingual Speech-to-Text Translation.
CoRR, 2020

CoVoST 2: A Massively Multilingual Speech-to-Text Translation Corpus.
CoRR, 2020

Findings of the WMT 2020 Shared Task on Machine Translation Robustness.
Proceedings of the Fifth Conference on Machine Translation, 2020

CoVoST: A Diverse Multilingual Speech-To-Text Translation Corpus.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


Self-Supervised Representations Improve End-to-End Speech Translation.
Proceedings of the Interspeech 2020, 2020

Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation.
Proceedings of the Interspeech 2020, 2020

Self-Training for End-to-End Speech Translation.
Proceedings of the Interspeech 2020, 2020

Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations, 2020

SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Monotonic Multihead Attention.
Proceedings of the 8th International Conference on Learning Representations, 2020

SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SIMULEVAL: An Evaluation Toolkit for Simultaneous Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Leveraging Out-of-Task Data for End-to-End Automatic Speech Translation.
CoRR, 2019

Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English.
CoRR, 2019

Findings of the First Shared Task on Machine Translation Robustness.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions.
Proceedings of the Fourth Conference on Machine Translation, 2019

On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications.
CoRR, 2018

2013
The University of Cambridge Russian-English System at WMT13.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

2012
Simple and Efficient Model Filtering in Statistical Machine Translation.
Prague Bull. Math. Linguistics, 2012

2010
The CUED HiFST System for the WMT10 Translation Shared Task.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Hierarchical Phrase-Based Translation Grammars Extracted from Alignment Posterior Probabilities.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

2009
Semi-automatic generation of cloze question distractors effect of students' L1.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

REAP.PT, a tutoring system for teaching Portuguese.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Porting REAP to European Portuguese.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Measuring Hint Level in Open Cloze Questions.
Proceedings of the Twenty-Second International Florida Artificial Intelligence Research Society Conference, 2009

An Application of Latent Semantic Analysis to Word Sense Discrimination for Words with Related and Unrelated Meanings.
Proceedings of the Fourth Workshop on Innovative Use of NLP for Building Educational Applications, 2009


  Loading...