Alexis Conneau

According to our database1, Alexis Conneau authored at least 35 papers between 2016 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Scaling Speech Technology to 1, 000+ Languages.
CoRR, 2023

Textually Pretrained Speech Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Scaling Laws for Generative Mixed-Modal Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Toward Joint Language Modeling for Speech Units and Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
mSLAM: Massively multilingual joint pre-training for speech and text.
CoRR, 2022

FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation.
Proceedings of the Interspeech 2022, 2022


XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.
Proceedings of the Interspeech 2022, 2022

Improved Language Identification Through Cross-Lingual Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training.
CoRR, 2021

Larger-Scale Transformers for Multilingual Masked Language Modeling.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Unsupervised Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-training Improves Pre-training for Natural Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Large-Scale Self- and Semi-Supervised Learning for Speech Translation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Unsupervised Cross-Lingual Representation Learning for Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Self-Training and Pre-Training are Complementary for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multilingual Speech Translation from Efficient Finetuning of Pretrained Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Emerging Cross-lingual Structure in Pretrained Language Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Unsupervised Cross-lingual Representation Learning at Scale.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Learning distributed representations of sentences using neural networks. (Apprentissage et applications de représentations multilingues distribuées).
PhD thesis, 2019

Cross-lingual Language Model Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Learning Visually Grounded Sentence Representations.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

SentEval: An Evaluation Toolkit for Universal Sentence Representations.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Word translation without parallel data.
Proceedings of the 6th International Conference on Learning Representations, 2018

Unsupervised Machine Translation Using Monolingual Corpora Only.
Proceedings of the 6th International Conference on Learning Representations, 2018

Phrase-Based & Neural Unsupervised Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

XNLI: Evaluating Cross-lingual Sentence Representations.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

What you can cram into a single \$&!#* vector: Probing sentence embeddings for linguistic properties.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Very Deep Convolutional Networks for Text Classification.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2016
Very Deep Convolutional Networks for Natural Language Processing.
CoRR, 2016

Meta-Prod2Vec: Product Embeddings Using Side-Information for Recommendation.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016


  Loading...