Benjamin Muller

Affiliations:
  • Meta, NYC, USA


According to our database1, Benjamin Muller authored at least 21 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SpiRit-LM: Interleaved Spoken and Written Language Model.
CoRR, 2024

2023
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning.
CoRR, 2023

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants.
CoRR, 2023

The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages.
Proceedings of the Eighth Conference on Machine Translation, 2023

Evaluating and Modeling Attribution for Cross-Lingual Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
How Can We Make Language Models Better at Handling the Diversity and Variability of Natural Languages ? (Comment rendre les modèles de langue meilleurs face à la grande diversité et variabilité des langues ?).
PhD thesis, 2022

Inria-ALMAnaCH at WMT 2022: Does Transcription Help Cross-Script Machine Translation?
Proceedings of the Seventh Conference on Machine Translation, 2022

Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer.
Proceedings of the Transfer Learning for Natural Language Processing Workshop, 2022

Quand être absent de mBERT n'est que le commencement : Gérer de nouvelles langues à l'aide de modèles de langues multilingues (When Being Unseen from mBERT is just the Beginning : Handling New Languages With Multilingual Language Models).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

2021
Cross-Lingual GenQA: A Language-Agnostic Generative Question Answering Approach for Open-Domain Question Answering.
CoRR, 2021

When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi.
CoRR, 2020

Les modèles de langue contextuels Camembert pour le français : impact de la taille et de l'hétérogénéité des données d'entrainement (C AMEM BERT Contextual Language Models for French: Impact of Training Data Size and Heterogeneity ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Establishing a New State-of-the-Art for French Named Entity Recognition.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

CamemBERT: a Tasty French Language Model.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Enhancing BERT for Lexical Normalization.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

2018
ELMoLex: Connecting ELMo and Lexicon Features for Dependency Parsing.
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018


  Loading...