Ann Lee

Affiliations:
  • Facebook, USA
  • Massachusetts Institute of Technology, Cambridge, USA (PhD 2016)


According to our database1, Ann Lee authored at least 39 papers between 2012 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

Multilingual Speech-to-Speech Translation into Multiple Target Languages.
CoRR, 2023

Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Enhancing Speech-To-Speech Translation with Multiple TTS Targets.
Proceedings of the IEEE International Conference on Acoustics, 2023

Bridging Speech and Textual Pre-Trained Models With Unsupervised ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Speech-to-Speech Translation for a Real-world Unwritten Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
On The Robustness of Self-Supervised Representations for Spoken Language Modeling.
CoRR, 2022

textless-lib: a Library for Textless Spoken Language Processing.
CoRR, 2022

Textless Speech-to-Speech Translation on Real Data.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation.
Proceedings of the Interspeech 2022, 2022

Flashlight: Enabling Innovation in Tools for Machine Learning.
Proceedings of the International Conference on Machine Learning, 2022

Direct Speech-to-Speech Translation With Discrete Units.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Text-Free Prosody-Aware Generative Spoken Language Modeling.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Textless Speech-to-Speech Translation on Real Data.
CoRR, 2021

Direct simultaneous speech to speech translation.
CoRR, 2021

Direct speech-to-speech translation with discrete units.
CoRR, 2021

Semi-Supervised end-to-end Speech Recognition via Local Prior Matching.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Discriminative Reranking for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Few-shot Sequence Learning with Transformers.
CoRR, 2020

Semi-Supervised Speech Recognition via Local Prior Matching.
CoRR, 2020

Facebook AI's WMT20 News Translation Task Submission.
Proceedings of the Fifth Conference on Machine Translation, 2020

Self-Training for End-to-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions.
Proceedings of the Interspeech 2019, 2019

2016
Language-independent methods for computer-assisted pronunciation training.
PhD thesis, 2016

Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Mispronunciation detection without nonnative training data.
Proceedings of the INTERSPEECH 2015, 2015

2014
Context-dependent pronunciation error pattern discovery with limited annotations.
Proceedings of the INTERSPEECH 2014, 2014

2013
Pronunciation assessment via a comparison-based system.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013

Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A comparison-based approach to mispronunciation detection.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Sentence Detection Using Multiple Annotations.
Proceedings of the INTERSPEECH 2012, 2012


  Loading...