We stand with Ukraine

We stand with Ukraine

Rami Al-Rfou

Affiliations:

Google AI, Mountain View, CA, USA
Stony Brook University, Department of Computer Science, NY, USA

According to our database¹, Rami Al-Rfou authored at least 43 papers between 2012 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

On csauthors.net:

Bibliography

2025

Scaling Laws of Motion Forecasting and Planning - A Technical Report.

[DOI]

Mustafa Baniodeh

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Dragomir Anguelov

CoRR, June, 2025

2024

Let Your Graph Do the Talking: Encoding Structured Data for LLMs.

[DOI]

,

,

,

Anton Tsitsulin

,

Seyed Mehran Kazemi

,

,

Jonathan Halcrow

CoRR, 2024

Scaling Motion Forecasting Models with Ensemble Distillation.

[DOI]

,

,

Avikalp Srivastava

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

MoST: Multi-modality Scene Tokenization for Motion Prediction.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Dragomir Anguelov

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

VN-Transformer: Rotation-Equivariant Attention for Vector Neurons.

[DOI]

,

,

,

Nigamaa Nayakanti

,

Trans. Mach. Learn. Res., 2023

Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization.

[DOI]

Bashar Al-Rfooh

,

Gheith A. Abandah

,

CoRR, 2023

Wayformer: Motion Forecasting via Simple & Efficient Attention Networks.

[DOI]

Nigamaa Nayakanti

,

,

,

,

Khaled S. Refaat

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

MotionLM: Multi-Agent Motion Forecasting as Language Modeling.

[DOI]

,

,

,

,

,

Nigamaa Nayakanti

,

Khaled S. Refaat

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models.

[DOI]

,

,

,

,

,

,

,

Trans. Assoc. Comput. Linguistics, 2022

Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion Forecasting.

[DOI]

,

Bertrand Douillard

,

,

,

CoRR, 2022

Narrowing the coordinate-frame gap in behavior prediction models: Distillation for efficient and accurate scene-centric motion forecasting.

[DOI]

,

Bertrand Douillard

,

,

,

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer.

[DOI]

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer.

[DOI]

,

,

,

,

,

Aditya Siddhant

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training.

[DOI]

,

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

The Power of Scale for Parameter-Efficient Prompt Tuning.

[DOI]

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

nmT5 - Is parallel data still relevant for pre-training massively multilingual language models?

[DOI]

,

Aditya Siddhant

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Large Scale Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training.

[DOI]

,

,

,

CoRR, 2020

Wiki-40B: Multilingual Language Model Dataset.

[DOI]

,

,

Denny Vrandecic

,

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

LAReQA: Language-Agnostic Answer Retrieval from a Multilingual Pool.

[DOI]

,

,

,

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Bridging the Gap for Tokenizer-Free Language Models.

[DOI]

,

,

,

,

CoRR, 2019

DDGK: Learning Graph Representations for Deep Divergence Graph Kernels.

[DOI]

,

,

Proceedings of the World Wide Web Conference, 2019

Character-Level Language Modeling with Deeper Self-Attention.

[DOI]

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

A Tutorial on Network Embeddings.

[DOI]

,

,

,

CoRR, 2018

Watch Your Step: Learning Node Embeddings via Graph Attention.

[DOI]

Sami Abu-El-Haija

,

,

,

Alexander A. Alemi

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017

Watch Your Step: Learning Graph Embeddings Through Attention.

[DOI]

Sami Abu-El-Haija

,

,

,

CoRR, 2017

Creating Virtual Universes Using Generative Adversarial Networks.

[DOI]

Mustafa Mustafa

,

,

,

,

CoRR, 2017

Efficient Natural Language Response Suggestion for Smart Reply.

[DOI]

Matthew L. Henderson

,

,

,

,

László Lukács

,

,

,

,

CoRR, 2017

Detecting English Writing Styles For Non Native Speakers.

[DOI]

,

,

CoRR, 2017

Learning Edge Representations via Low-Rank Asymmetric Projections.

[DOI]

Sami Abu-El-Haija

,

,

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016

A Growing Long-term Episodic & Semantic Memory.

[DOI]

,

,

,

CoRR, 2016

Visualizing Linguistic Shift.

[DOI]

,

,

CoRR, 2016

Conversational Contextual Cues: The Case of Personalization and History for Response Ranking.

[DOI]

,

,

,

,

,

CoRR, 2016

Theano: A Python framework for fast computation of mathematical expressions.

[DOI]

,

Guillaume Alain

,

Amjad Almahairi

,

Christof Angermüller

,

Dzmitry Bahdanau

,

,

Frédéric Bastien

,

,

Anatoly Belikov

,

Alexander Belopolsky

,

,

Arnaud Bergeron

,

,

Valentin Bisson

,

Josh Bleecher Snyder

,

Nicolas Bouchard

,

Nicolas Boulanger-Lewandowski

,

Xavier Bouthillier

,

Alexandre de Brébisson

,

Olivier Breuleux

,

Pierre Luc Carrier

,

,

,

Paul F. Christiano

,

,

Marc-Alexandre Côté

,

,

Aaron C. Courville

,

Yann N. Dauphin

,

Olivier Delalleau

,

,

Guillaume Desjardins

,

Sander Dieleman

,

,

Melanie Ducoffe

,

Vincent Dumoulin

,

Samira Ebrahimi Kahou

,

,

,

,

Mathieu Germain

,

,

Ian J. Goodfellow

,

,

Çaglar Gülçehre

,

,

Iban Harlouchet

,

Jean-Philippe Heng

,

,

,

,

Sébastien Jean

,

,

Mikhail Korobov

,

,

,

,

,

,

,

Simon Lefrançois

,

,

Nicholas Léonard

,

,

Jesse A. Livezey

,

,

,

,

Pierre-Antoine Manzagol

,

Olivier Mastropietro

,

Robert McGibbon

,

Roland Memisevic

,

Bart van Merriënboer

,

Vincent Michalski

,

,

Alberto Orlandi

,

Christopher Joseph Pal

,

,

Mohammad Pezeshki

,

,

,

Matthew Rocklin

,

,

,

,

,

François Savard

,

,

,

Gabriel Schwartz

,

Iulian Vlad Serban

,

Dmitriy Serdyuk

,

Samira Shabanian

,

,

Sigurd Spieckermann

,

S. Ramana Subramanyam

,

Jakub Sygnowski

,

Jérémie Tanguay

,

Gijs van Tulder

,

Joseph P. Turian

,

Sebastian Urban

,

,

Francesco Visin

,

,

David Warde-Farley

,

,

Matthew Willson

,

,

,

,

,

CoRR, 2016

2015

Statistically Significant Detection of Linguistic Change.

[DOI]

,

,

,

Proceedings of the 24th International Conference on World Wide Web, 2015

POLYGLOT-NER: Massive Multilingual Named Entity Recognition.

[DOI]

,

,

,

Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

2014

Exploring the power of GPU's for training Polyglot language models.

[DOI]

,

,

,

CoRR, 2014

DeepWalk: online learning of social representations.

[DOI]

,

,

Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Inducing Language Networks from Continuous Space Word Representations.

[DOI]

,

,

,

Proceedings of the Complex Networks V, 2014

2013

The Expressive Power of Word Embeddings

[DOI]

,

,

,

CoRR, 2013

Polyglot: Distributed Word Representations for Multilingual NLP.

[DOI]

,

,

Proceedings of the Seventeenth Conference on Computational Natural Language Learning, 2013

2012

Detecting English Writing Styles For Non-native Speakers

[DOI]

CoRR, 2012

TrackMeNot-so-good-after-all

[DOI]

,

,

Nikhil Patwardhan

CoRR, 2012

SpeedRead: A Fast Named Entity Recognition Pipeline.

[DOI]

,

Proceedings of the COLING 2012, 2012

Loading...