Ashish Vaswani

According to our database1, Ashish Vaswani authored at least 42 papers between 2006 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Guest Editorial Introduction to the Special Section on Transformer Models in Vision.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

2022
Scale Efficiently: Insights from Pretraining and Finetuning Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

The Efficiency Misnomer.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Efficient Content-Based Sparse Attention with Routing Transformers.
Trans. Assoc. Comput. Linguistics, 2021

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers.
CoRR, 2021

Simple and Efficient ways to Improve REALM.
CoRR, 2021

Scaling Local Self-Attention for Parameter Efficient Visual Backbones.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bottleneck Transformers for Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2019
Stand-Alone Self-Attention in Vision Models.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Music Transformer: Generating Music with Long-Term Structure.
Proceedings of the 7th International Conference on Learning Representations, 2019

Attention Augmented Convolutional Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
An Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation.
CoRR, 2018

Relational inductive biases, deep learning, and graph networks.
CoRR, 2018

Theory and Experiments on Vector Quantized Autoencoders.
CoRR, 2018

Image Transformer.
CoRR, 2018

Mesh-TensorFlow: Deep Learning for Supercomputers.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Self-Attention with Relative Position Representations.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Image Transformer.
Proceedings of the 35th International Conference on Machine Learning, 2018

Fast Decoding in Sequence Models Using Discrete Latent Variables.
Proceedings of the 35th International Conference on Machine Learning, 2018

Tensor2Tensor for Neural Machine Translation.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
One Model To Learn Them All.
CoRR, 2017

Attention is All you Need.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016
Efficient Structured Inference for Transition-Based Parsing with Neural Networks and Error States.
Trans. Assoc. Comput. Linguistics, 2016

Simple, Fast Noise-Contrastive Estimation for Large RNN Vocabularies.
Proceedings of the NAACL HLT 2016, 2016

Name Tagging for Low-resource Incident Languages based on Expectation-driven Learning.
Proceedings of the NAACL HLT 2016, 2016

Supertagging With LSTMs.
Proceedings of the NAACL HLT 2016, 2016

Unsupervised Neural Hidden Markov Models.
Proceedings of the Workshop on Structured Prediction for NLP@EMNLP 2016, 2016

2015
Model Invertibility Regularization: Sequence Alignment With or Without Parallel Data.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Unifying Bayesian Inference and Vector Space Models for Improved Decipherment.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Aligning context-based statistical models of language with brain activity during reading.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Learning Whom to Trust with MACE.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Decoding with Large-Scale Neural Language Models Improves Translation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Rule Markov Models for Fast Tree-to-String Translation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Models and Training for Unsupervised Preposition Sense Disambiguation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Fast, Greedy Model Minimization for Unsupervised Tagging.
Proceedings of the COLING 2010, 2010

Efficient Optimization of an MDL-Inspired Objective Function for Unsupervised Part-Of-Speech Tagging.
Proceedings of the ACL 2010, 2010

2007
Hassan: A Virtual Human for Tactical Questioning.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

2006
Radiobot-CFF: a spoken dialogue system for military training.
Proceedings of the INTERSPEECH 2006, 2006


  Loading...