We stand with Ukraine

We stand with Ukraine

Ann Lee

Affiliations:

Facebook, USA
Massachusetts Institute of Technology, Cambridge, USA (PhD 2016)

According to our database¹, Ann Lee authored at least 39 papers between 2012 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2023

Seamless: Multilingual Expressive and Streaming Speech Translation.

[BibT_eX]

[DOI]

,

,

Mariano Coria Meglioli

,

,

,

Mark Duppenthaler

,

Paul-Ambroise Duquenne

,

,

,

,

,

,

Hirofumi Inaguma

,

Christopher Klaiber

,

,

,

,

,

Ruslan Mavlyutov

,

Alice Rakotoarison

,

Kaushik Ram Sadagopan

,

Abinesh Ramakrishnan

,

,

Guillaume Wenzek

,

,

,

,

Pierre Fernandez

,

,

Prangthip Hansanti

,

,

,

Artyom Kozhevnikov

,

Gabriel Mejia Gonzalez

,

Robin San Roman

,

Christophe Touret

,

,

,

,

,

,

,

Marta R. Costa-jussà

,

,

,

Francisco Guzmán

,

Kevin Heffernan

,

,

,

,

,

Alexandre Mourachko

,

Benjamin Peloquin

,

,

,

Christophe Ropers

,

Safiyyah Saleem

,

,

,

Paden Tomasello

,

,

,

,

Mary Williamson

CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.

[BibT_eX]

[DOI]

Seamless Communication

,

,

,

Mariano Coria Meglioli

,

,

,

Paul-Ambroise Duquenne

,

,

,

Kevin Heffernan

,

,

Christopher Klaiber

,

,

,

,

Alice Rakotoarison

,

Kaushik Ram Sadagopan

,

Guillaume Wenzek

,

,

,

,

,

,

Gabriel Mejia Gonzalez

,

,

Prangthip Hansanti

,

,

,

,

Hirofumi Inaguma

,

,

,

,

,

,

,

,

Ruslan Mavlyutov

,

Benjamin Peloquin

,

Mohamed Ramadan

,

Abinesh Ramakrishnan

,

,

,

,

,

,

,

,

,

,

,

Marta R. Costa-jussà

,

,

,

,

Francisco Guzmán

,

,

,

Alexandre Mourachko

,

,

,

Christophe Ropers

,

Safiyyah Saleem

,

,

Paden Tomasello

,

,

,

CoRR, 2023

Multilingual Speech-to-Speech Translation into Multiple Target Languages.

[BibT_eX]

[DOI]

,

,

,

Vedanuj Goswami

,

,

CoRR, 2023

Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

Gabriel Synnaeve

,

Emmanuel Dupoux

,

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Enhancing Speech-To-Speech Translation with Multiple TTS Targets.

[BibT_eX]

[DOI]

,

,

,

Hirofumi Inaguma

,

,

,

Shinji Watanabe

Proceedings of the IEEE International Conference on Acoustics, 2023

Bridging Speech and Textual Pre-Trained Models With Unsupervised ASR.

[BibT_eX]

[DOI]

,

,

,

,

,

Shinji Watanabe

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation.

[BibT_eX]

[DOI]

,

Benjamin Peloquin

,

,

,

,

Elizabeth Salesky

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units.

[BibT_eX]

[DOI]

Hirofumi Inaguma

,

,

,

,

,

,

,

,

Shinji Watanabe

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations.

[BibT_eX]

[DOI]

Paul-Ambroise Duquenne

,

,

,

,

,

Vedanuj Goswami

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-to-Speech Translation for a Real-world Unwritten Language.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Paden Tomasello

,

Paul-Ambroise Duquenne

,

,

,

Hirofumi Inaguma

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

On The Robustness of Self-Supervised Representations for Spoken Language Modeling.

[BibT_eX]

[DOI]

,

,

,

,

Gabriel Synnaeve

,

Emmanuel Dupoux

,

CoRR, 2022

textless-lib: a Library for Textless Spoken Language Processing.

[BibT_eX]

[DOI]

Eugene Kharitonov

,

,

Kushal Lakhotia

,

,

Paden Tomasello

,

,

,

,

Abdelrahman Mohamed

,

Emmanuel Dupoux

,

CoRR, 2022

Textless Speech-to-Speech Translation on Real Data.

[BibT_eX]

[DOI]

,

,

Paul-Ambroise Duquenne

,

,

,

,

,

,

Juan Miguel Pino

,

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Interspeech 2022, 2022

Flashlight: Enabling Innovation in Tools for Machine Learning.

[BibT_eX]

[DOI]

,

,

Tatiana Likhomanenko

,

,

,

,

Paden Tomasello

,

,

,

,

,

Vitaliy Liptchinsky

,

Gabriel Synnaeve

,

Ronan Collobert

Proceedings of the International Conference on Machine Learning, 2022

Direct Speech-to-Speech Translation With Discrete Units.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Text-Free Prosody-Aware Generative Spoken Language Modeling.

[BibT_eX]

[DOI]

Eugene Kharitonov

,

,

,

,

,

Kushal Lakhotia

,

,

Morgane Rivière

,

Abdelrahman Mohamed

,

Emmanuel Dupoux

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Textless Speech-to-Speech Translation on Real Data.

[BibT_eX]

[DOI]

,

,

Paul-Ambroise Duquenne

,

,

,

,

,

Juan Miguel Pino

,

,

CoRR, 2021

Direct simultaneous speech to speech translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Kenneth Heafield

,

,

Juan Miguel Pino

CoRR, 2021

Direct speech-to-speech translation with discrete units.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Juan Miguel Pino

,

CoRR, 2021

Semi-Supervised end-to-end Speech Recognition via Local Prior Matching.

[BibT_eX]

[DOI]

,

,

Gabriel Synnaeve

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training.

[BibT_eX]

[DOI]

,

,

,

Tatiana Likhomanenko

,

,

,

,

,

Ronan Collobert

,

Gabriel Synnaeve

,

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation.

[BibT_eX]

[DOI]

,

Morgane Rivière

,

,

,

Chaitanya Talnikar

,

,

Mary Williamson

,

Juan Miguel Pino

,

Emmanuel Dupoux

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Discriminative Reranking for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

Marc'Aurelio Ranzato

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Few-shot Sequence Learning with Transformers.

[BibT_eX]

[DOI]

Lajanugen Logeswaran

,

,

,

,

Marc'Aurelio Ranzato

,

CoRR, 2020

Semi-Supervised Speech Recognition via Local Prior Matching.

[BibT_eX]

[DOI]

,

,

Gabriel Synnaeve

,

CoRR, 2020

Facebook AI's WMT20 News Translation Task Submission.

[BibT_eX]

[DOI]

,

,

,

,

,

Mary Williamson

,

Proceedings of the Fifth Conference on Machine Translation, 2020

Self-Training for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions.

[BibT_eX]

[DOI]

,

,

,

Ronan Collobert

Proceedings of the Interspeech 2019, 2019

2016

Language-independent methods for computer-assisted pronunciation training.

[BibT_eX]

[DOI]

PhD thesis, 2016

Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Interspeech 2016, 2016

Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Mispronunciation detection without nonnative training data.

[BibT_eX]

[DOI]

,

Proceedings of the INTERSPEECH 2015, 2015

2014

Context-dependent pronunciation error pattern discovery with limited annotations.

[BibT_eX]

[DOI]

,

Proceedings of the INTERSPEECH 2014, 2014

2013

Pronunciation assessment via a comparison-based system.

[BibT_eX]

[DOI]

,

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

A comparison-based approach to mispronunciation detection.

[BibT_eX]

[DOI]

,

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Sentence Detection Using Multiple Annotations.

[BibT_eX]

[DOI]

,

Proceedings of the INTERSPEECH 2012, 2012

Loading...