Rami Al-Rfou

Affiliations:
  • Google AI, Mountain View, CA, USA
  • Stony Brook University, Department of Computer Science, NY, USA


According to our database1, Rami Al-Rfou authored at least 40 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Let Your Graph Do the Talking: Encoding Structured Data for LLMs.
CoRR, 2024

2023
VN-Transformer: Rotation-Equivariant Attention for Vector Neurons.
Trans. Mach. Learn. Res., 2023

Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization.
CoRR, 2023

Wayformer: Motion Forecasting via Simple & Efficient Attention Networks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

MotionLM: Multi-Agent Motion Forecasting as Language Modeling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models.
Trans. Assoc. Comput. Linguistics, 2022

Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion Forecasting.
CoRR, 2022

Narrowing the coordinate-frame gap in behavior prediction models: Distillation for efficient and accurate scene-centric motion forecasting.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

The Power of Scale for Parameter-Efficient Prompt Tuning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

nmT5 - Is parallel data still relevant for pre-training massively multilingual language models?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Large Scale Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training.
CoRR, 2020

Wiki-40B: Multilingual Language Model Dataset.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

LAReQA: Language-Agnostic Answer Retrieval from a Multilingual Pool.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Bridging the Gap for Tokenizer-Free Language Models.
CoRR, 2019

DDGK: Learning Graph Representations for Deep Divergence Graph Kernels.
Proceedings of the World Wide Web Conference, 2019

Character-Level Language Modeling with Deeper Self-Attention.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
A Tutorial on Network Embeddings.
CoRR, 2018

Watch Your Step: Learning Node Embeddings via Graph Attention.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Watch Your Step: Learning Graph Embeddings Through Attention.
CoRR, 2017

Creating Virtual Universes Using Generative Adversarial Networks.
CoRR, 2017

Efficient Natural Language Response Suggestion for Smart Reply.
CoRR, 2017

Detecting English Writing Styles For Non Native Speakers.
CoRR, 2017

Learning Edge Representations via Low-Rank Asymmetric Projections.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
A Growing Long-term Episodic & Semantic Memory.
CoRR, 2016

Visualizing Linguistic Shift.
CoRR, 2016

Conversational Contextual Cues: The Case of Personalization and History for Response Ranking.
CoRR, 2016

Theano: A Python framework for fast computation of mathematical expressions.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2016

2015
Statistically Significant Detection of Linguistic Change.
Proceedings of the 24th International Conference on World Wide Web, 2015

POLYGLOT-NER: Massive Multilingual Named Entity Recognition.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

2014
Exploring the power of GPU's for training Polyglot language models.
CoRR, 2014

DeepWalk: online learning of social representations.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Inducing Language Networks from Continuous Space Word Representations.
Proceedings of the Complex Networks V, 2014

2013
The Expressive Power of Word Embeddings
CoRR, 2013

Polyglot: Distributed Word Representations for Multilingual NLP.
Proceedings of the Seventeenth Conference on Computational Natural Language Learning, 2013

2012
Detecting English Writing Styles For Non-native Speakers
CoRR, 2012

TrackMeNot-so-good-after-all
CoRR, 2012

SpeedRead: A Fast Named Entity Recognition Pipeline.
Proceedings of the COLING 2012, 2012


  Loading...