Isaac Caswell

According to our database1, Isaac Caswell authored at least 22 papers between 2019 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Separating the Wheat from the Chaff with BREAD: An open-source benchmark and metrics to detect redundancy in text.
CoRR, 2023

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset.
CoRR, 2023

Bilex Rx: Lexical Data Augmentation for Massively Multilingual Machine Translation.
CoRR, 2023

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


GATITOS: Using a New Multilingual Lexicon for Low-resource Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets.
Trans. Assoc. Comput. Linguistics, 2022

Building Machine Translation Systems for the Next Thousand Languages.
CoRR, 2022

Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning.
CoRR, 2022

Writing System and Speaker Metadata for 2, 800+ Language Varieties.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2020
BLEU might be Guilty but References are not Innocent.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Learning a Multi-Domain Curriculum for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Translationese as a Language in "Multilingual" NMT.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Investigating Multilingual NMT Representations at Scale.
CoRR, 2019

Learning a Multitask Curriculum for Neural Machine Translation.
CoRR, 2019

Text Repair Model for Neural Machine Translation.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

APE at Scale and Its Implications on MT Evaluation Biases.
Proceedings of the Fourth Conference on Machine Translation, 2019

Tagged Back-Translation.
Proceedings of the Fourth Conference on Machine Translation, 2019

Investigating Multilingual NMT Representations at Scale.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Dynamically Composing Domain-Data Selection with Clean-Data Selection by "Co-Curricular Learning" for Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019


  Loading...