Naman Goyal

According to our database1, Naman Goyal authored at least 47 papers between 2016 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants.
CoRR, 2023

Llama 2: Open Foundation and Fine-Tuned Chat Models.
CoRR, 2023

A Theory on Adam Instability in Large-Scale Machine Learning.
CoRR, 2023

LLaMA: Open and Efficient Foundation Language Models.
CoRR, 2023

Text-To-4D Dynamic Scene Generation.
Proceedings of the International Conference on Machine Learning, 2023

Scaling Laws for Generative Mixed-Modal Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Don't forget the nullspace! Nullspace occupancy as a mechanism for out of distribution failure.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation.
Trans. Assoc. Comput. Linguistics, 2022

Multilingual Autoregressive Entity Linking.
Trans. Assoc. Comput. Linguistics, 2022

A survey on Self Supervised learning approaches for improving Multimodal representation learning.
CoRR, 2022

BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage.
CoRR, 2022

OPT: Open Pre-trained Transformer Language Models.
CoRR, 2022

Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations.
CoRR, 2022

CM3: A Causal Masked Multimodal Model of the Internet.
CoRR, 2022

Lifting the Curse of Multilinguality by Pre-training Modular Transformers.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale.
Proceedings of the Interspeech 2022, 2022

Few-shot Learning with Multilingual Generative Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

On the Role of Bidirectionality in Language Model Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), 2022

2021
Beyond English-Centric Multilingual Machine Translation.
J. Mach. Learn. Res., 2021

Efficient Large Scale Language Modeling with Mixtures of Experts.
CoRR, 2021

Few-shot Learning with Multilingual Language Models.
CoRR, 2021

Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Larger-Scale Transformers for Multilingual Masked Language Modeling.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

BASE Layers: Simplifying Training of Large, Sparse Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

Better Fine-Tuning by Reducing Representational Collapse.
Proceedings of the 9th International Conference on Learning Representations, 2021

Recipes for Building an Open-Domain Chatbot.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Multilingual Translation from Denoising Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Multilingual Denoising Pre-training for Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2020

Beyond English-Centric Multilingual Machine Translation.
CoRR, 2020

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning.
CoRR, 2020

Recipes for building an open-domain chatbot.
CoRR, 2020

Findings of the WMT 2020 Shared Task on Parallel Corpus Filtering and Alignment.
Proceedings of the Fifth Conference on Machine Translation, 2020

Facebook AI's WMT20 News Translation Task Submission.
Proceedings of the Fifth Conference on Machine Translation, 2020

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Unsupervised Cross-lingual Representation Learning at Scale.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach.
CoRR, 2019

2017
LearningToQuestion at SemEval 2017 Task 3: Ranking Similar Questions by Learning to Rank Using Rich Features.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

2016
The Social Dynamics of Language Change in Online Networks.
Proceedings of the Social Informatics - 8th International Conference, 2016

Poster: Understanding the Routine Activities of Students in Campus using Smartphone Sensors.
Proceedings of the 14th Annual International Conference on Mobile Systems, 2016

Active Learning in Multi-objective Evolutionary Algorithms for Sustainable Building Design.
Proceedings of the 2016 on Genetic and Evolutionary Computation Conference, Denver, CO, USA, July 20, 2016

A Joint Model of Rhetorical Discourse Structure and Summarization.
Proceedings of the Workshop on Structured Prediction for NLP@EMNLP 2016, 2016


  Loading...