We stand with Ukraine

We stand with Ukraine

Ashish Vaswani

According to our database¹, Ashish Vaswani authored at least 42 papers between 2006 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2023

Guest Editorial Introduction to the Special Section on Transformer Models in Vision.

[BibT_eX]

[DOI]

,

Fahad Shahbaz Khan

,

,

,

Ming-Hsuan Yang

,

IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

2022

Scale Efficiently: Insights from Pretraining and Finetuning Transformers.

[BibT_eX]

[DOI]

,

Mostafa Dehghani

,

,

,

,

Hyung Won Chung

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

The Efficiency Misnomer.

[BibT_eX]

[DOI]

Mostafa Dehghani

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Efficient Content-Based Sparse Attention with Routing Transformers.

[BibT_eX]

[DOI]

,

Mohammad Saffar

,

,

Trans. Assoc. Comput. Linguistics, 2021

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers.

[BibT_eX]

[DOI]

,

Mostafa Dehghani

,

,

,

,

Hyung Won Chung

,

,

,

,

CoRR, 2021

Simple and Efficient ways to Improve REALM.

[BibT_eX]

[DOI]

Vidhisha Balachandran

,

,

,

CoRR, 2021

Scaling Local Self-Attention for Parameter Efficient Visual Backbones.

[BibT_eX]

[DOI]

,

Prajit Ramachandran

,

Aravind Srinivas

,

,

Blake A. Hechtman

,

Jonathon Shlens

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bottleneck Transformers for Visual Recognition.

[BibT_eX]

[DOI]

Aravind Srinivas

,

,

,

Jonathon Shlens

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2019

Stand-Alone Self-Attention in Vision Models.

[BibT_eX]

[DOI]

,

Prajit Ramachandran

,

,

,

Anselm Levskaya

,

Jonathon Shlens

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Music Transformer: Generating Music with Long-Term Structure.

[BibT_eX]

[DOI]

Cheng-Zhi Anna Huang

,

,

Jakob Uszkoreit

,

,

Curtis Hawthorne

,

,

,

Matthew D. Hoffman

,

Monica Dinculescu

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Attention Augmented Convolutional Networks.

[BibT_eX]

[DOI]

,

,

,

,

Jonathon Shlens

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation.

[BibT_eX]

[DOI]

,

Gabriel Magalhães

,

,

,

,

Jason Baldridge

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation.

[BibT_eX]

[DOI]

Cheng-Zhi Anna Huang

,

,

Jakob Uszkoreit

,

,

Curtis Hawthorne

,

,

Matthew D. Hoffman

,

CoRR, 2018

Relational inductive biases, deep learning, and graph networks.

[BibT_eX]

[DOI]

CoRR, 2018

Theory and Experiments on Vector Quantized Autoencoders.

[BibT_eX]

[DOI]

,

,

Arvind Neelakantan

,

CoRR, 2018

Image Transformer.

[BibT_eX]

[DOI]

,

,

Jakob Uszkoreit

,

,

,

CoRR, 2018

Mesh-TensorFlow: Deep Learning for Supercomputers.

[BibT_eX]

[DOI]

,

,

,

,

,

Penporn Koanantakool

,

,

,

,

,

,

Blake A. Hechtman

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Self-Attention with Relative Position Representations.

[BibT_eX]

[DOI]

,

Jakob Uszkoreit

,

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Image Transformer.

[BibT_eX]

[DOI]

,

,

Jakob Uszkoreit

,

,

,

,

Proceedings of the 35th International Conference on Machine Learning, 2018

Fast Decoding in Sequence Models Using Discrete Latent Variables.

[BibT_eX]

[DOI]

,

,

,

,

,

Jakob Uszkoreit

,

Proceedings of the 35th International Conference on Machine Learning, 2018

Tensor2Tensor for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

François Chollet

,

,

,

,

,

Nal Kalchbrenner

,

,

,

,

Jakob Uszkoreit

Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Wolfgang Macherey

,

George F. Foster

,

,

,

,

,

,

Jakob Uszkoreit

,

,

,

,

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

One Model To Learn Them All.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Jakob Uszkoreit

CoRR, 2017

Attention is All you Need.

[BibT_eX]

[DOI]

,

,

,

Jakob Uszkoreit

,

,

,

,

Illia Polosukhin

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016

Efficient Structured Inference for Transition-Based Parsing with Neural Networks and Error States.

[BibT_eX]

[DOI]

,

Trans. Assoc. Comput. Linguistics, 2016

Simple, Fast Noise-Contrastive Estimation for Large RNN Vocabularies.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the NAACL HLT 2016, 2016

Name Tagging for Low-resource Incident Languages based on Expectation-driven Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the NAACL HLT 2016, 2016

Supertagging With LSTMs.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the NAACL HLT 2016, 2016

Unsupervised Neural Hidden Markov Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Workshop on Structured Prediction for NLP@EMNLP 2016, 2016

2015

Model Invertibility Regularization: Sequence Alignment With or Without Parallel Data.

[BibT_eX]

[DOI]

Tomer Levinboim

,

,

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Unifying Bayesian Inference and Vector Space Models for Improved Decipherment.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014

Aligning context-based statistical models of language with brain activity during reading.

[BibT_eX]

[DOI]

,

,

,

Tom M. Mitchell

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013

Learning Whom to Trust with MACE.

[BibT_eX]

[DOI]

,

Taylor Berg-Kirkpatrick

,

,

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Decoding with Large-Scale Neural Language Models Improves Translation.

[BibT_eX]

[DOI]

,

,

Victoria Fossum

,

Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012

Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm.

[BibT_eX]

[DOI]

,

,

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011

Rule Markov Models for Fast Tree-to-String Translation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Models and Training for Unsupervised Preposition Sense Disambiguation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010

Fast, Greedy Model Minimization for Unsupervised Tagging.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the COLING 2010, 2010

Efficient Optimization of an MDL-Inspired Objective Function for Unsupervised Part-Of-Speech Tagging.

[BibT_eX]

[DOI]

,

,

Proceedings of the ACL 2010, 2010

2007

Hassan: A Virtual Human for Tactical Questioning.

[BibT_eX]

[DOI]

,

,

,

Panayiotis G. Georgiou

,

,

Bilyana Martinovski

,

Shrikanth Narayanan

,

,

Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

2006

Radiobot-CFF: a spoken dialogue system for military training.

[BibT_eX]

[DOI]

,

,

Vivek Kumar Rangarajan Sridhar

,

,

,

Shrikanth S. Narayanan

,

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Loading...