Joshua Ainslie

According to our database1, Joshua Ainslie authored at least 28 papers between 2020 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Functional Interpolation for Relative Positions Improves Long Context Transformers.
CoRR, 2023

MEMORY-VQ: Compression for Tractable Internet-Scale Memory.
CoRR, 2023

GLIMMER: generalized late-interaction memory reranker.
CoRR, 2023

WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset.
CoRR, 2023

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute.
Proceedings of the International Conference on Machine Learning, 2023

Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CoLT5: Faster Long-Range Transformers with Conditional Computation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
LogicInference: A New Dataset for Teaching Logical Inference to seq2seq Models.
CoRR, 2022

FNet: Mixing Tokens with Fourier Transforms.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

LongT5: Efficient Text-To-Text Transformer for Long Sequences.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Generate-and-Retrieve: Use Your Predictions to Improve Retrieval for Semantic Parsing.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Making Transformers Solve Compositional Tasks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Iterative Decoding for Compositional Generalization in Transformers.
CoRR, 2021

ShopTalk: A System for Conversational Faceted Search.
CoRR, 2021

ReadTwice: Reading Very Large Documents with Memories.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Improving Compositional Generalization in Classification Tasks via Structure Annotations.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

RealFormer: Transformer Likes Residual Attention.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
ETC: Encoding Long and Structured Data in Transformers.
CoRR, 2020

Big Bird: Transformers for Longer Sequences.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ETC: Encoding Long and Structured Inputs in Transformers.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020


  Loading...