Mandy Guo

According to our database1, Mandy Guo authored at least 20 papers between 2018 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset.
CoRR, 2023

CoBIT: A Contrastive Bi-directional Image-Text Generation Model.
CoRR, 2023

mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CoLT5: Faster Long-Range Transformers with Conditional Computation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
LongT5: Efficient Text-To-Text Transformer for Long Sequences.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

2021
MURAL: Multimodal, Multitask Retrieval Across Languages.
CoRR, 2021

Towards Universality in Multilingual Text Rewriting.
CoRR, 2021

MURAL: Multimodal, Multitask Representations Across Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling.
CoRR, 2020

MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models.
CoRR, 2020

Wiki-40B: Multilingual Language Model Dataset.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Multilingual Universal Sentence Encoder for Semantic Retrieval.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019
Bridging the Gap for Tokenizer-Free Language Models.
CoRR, 2019

Hierarchical Document Encoder for Parallel Corpus Mining.
Proceedings of the Fourth Conference on Machine Translation, 2019

Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Character-Level Language Modeling with Deeper Self-Attention.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Effective Parallel Corpus Mining using Bilingual Sentence Embeddings.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018


  Loading...