Ruslan Salakhutdinov

Orcid: 0000-0002-3752-2756

Affiliations:
  • Carnegie Mellon University, Machine Learning Department, Pittsburgh, PA, USA
  • University of Toronto, Departments of Statistics and Computer Science, ON, Canada
  • Massachusetts Institute of Technology, Artificial Intelligence Lab, Cambridge, MA, USA


According to our database1, Ruslan Salakhutdinov authored at least 326 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation.
CoRR, 2024

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference.
CoRR, 2024

Automatic Question-Answer Generation for Long-Tail Knowledge.
CoRR, 2024

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web.
CoRR, 2024

VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks.
CoRR, 2024

2023
Probing Predictions on OOD Images via Nearest Categories.
Trans. Mach. Learn. Res., 2023

MultiZoo and MultiBench: A Standardized Toolkit for Multimodal Deep Learning.
J. Mach. Learn. Res., 2023

Manifold Preserving Guided Diffusion.
CoRR, 2023

MMOE: Mixture of Multimodal Interaction Experts.
CoRR, 2023

MultiIoT: Towards Large-scale Multisensory Learning for the Internet of Things.
CoRR, 2023

Contrastive Difference Predictive Coding.
CoRR, 2023

Multimodal Graph Learning for Generative Tasks.
CoRR, 2023

Confronting Reward Model Overoptimization with Constrained RLHF.
CoRR, 2023

Answering Ambiguous Questions with a Database of Questions, Answers, and Revisions.
CoRR, 2023

A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning.
CoRR, 2023

MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning.
CoRR, 2023

Localized Text-to-Image Generation for Free via Cross Attention Control.
CoRR, 2023

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy.
CoRR, 2023

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications.
CoRR, 2023

Stabilizing Contrastive RL: Techniques for Offline Goal Reaching.
CoRR, 2023

SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning.
CoRR, 2023

Plan, Eliminate, and Track - Language Models are Good Teachers for Embodied Agents.
CoRR, 2023

Quantifying & Modeling Feature Interactions: An Information Decomposition Framework.
CoRR, 2023

Effective Data Augmentation With Diffusion Models.
CoRR, 2023

Grounding Language Models to Images for Multimodal Generation.
CoRR, 2023

SPRING: Studying Papers and Reasoning to play Games.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Generating Images with Multimodal Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Contrastive Example-Based Control.
Proceedings of the Learning for Dynamics and Control Conference, 2023

Self-Supervised Object Goal Navigation with In-Situ Finetuning.
IROS, 2023

Graph Generative Model for Benchmarking Graph Neural Networks.
Proceedings of the International Conference on Machine Learning, 2023

Grounding Language Models to Images for Multimodal Inputs and Outputs.
Proceedings of the International Conference on Machine Learning, 2023

A Connection between One-Step RL and Critic Regularization in Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Multimodal Fusion Interactions: A Study of Human and Automatic Quantification.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

A Simple Approach for Visual Room Rearrangement: 3D Mapping and Semantic Search.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Scenario-based Question Answering with Interacting Contextual Properties.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

MultiViz: Towards Visualizing and Understanding Multimodal Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Imitating Task and Motion Planning with Visuomotor Transformers.
Proceedings of the Conference on Robot Learning, 2023

MultiViz: Towards User-Centric Visualizations and Interpretations of Multimodal Models.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Object Goal Navigation with End-to-End Self-Supervision.
CoRR, 2022

Paraphrasing Is All You Need for Novel Object Captioning.
CoRR, 2022

Scalable Privacy-enhanced Benchmark Graph Generative Model for Graph Convolutional Networks.
CoRR, 2022

MultiViz: An Analysis Benchmark for Visualizing and Understanding Multimodal Models.
CoRR, 2022

A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search.
CoRR, 2022

Reasoning over Logically Interacted Conditions for Question Answering.
CoRR, 2022

Zero-shot Domain Adaptation of Heterogeneous Graphs via Knowledge Transfer Networks.
CoRR, 2022

HighMMT: Towards Modality and Task Generalization for High-Modality Representation Learning.
CoRR, 2022

Feature-Robust Optimal Transport for High-Dimensional Data.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Zero-shot Transfer Learning within a Heterogeneous Graph via Knowledge Transfer Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Paraphrasing Is All You Need for Novel Object Captioning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Contrastive Learning as Goal-Conditioned Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Imitating Past Successes can be Very Suboptimal.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mismatched No More: Joint Model-Policy Optimization for Model-Based RL.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs.
Proceedings of the International Conference on Machine Learning, 2022

C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Conditional Contrastive Learning with Kernel.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Weakly-supervised Contrastive Representations.
Proceedings of the Tenth International Conference on Learning Representations, 2022

FILM: Following Instructions in Language with Modular Methods.
Proceedings of the Tenth International Conference on Learning Representations, 2022

The Information Geometry of Unsupervised Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Don't Copy the Teacher: Data and Model Challenges in Embodied Dialogue.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PACS: A Dataset for Physical Audiovisual CommonSense Reasoning.
Proceedings of the Computer Vision - ECCV 2022, 2022

DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local Explanations.
Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recurrent Model-Free RL is a Strong Baseline for Many POMDPs.
CoRR, 2021

FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding.
CoRR, 2021

Online Sub-Sampling for Reinforcement Learning with General Function Approximation.
CoRR, 2021

Integrating Auxiliary Information in Self-supervised Learning.
CoRR, 2021

Conditional Contrastive Learning: Removing Undesirable Information in Self-Supervised Representations.
CoRR, 2021

End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents.
CoRR, 2021

A Note on Connecting Barlow Twins with Negative-Sample-Free Contrastive Learning.
CoRR, 2021

The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors.
CoRR, 2021

Understanding the Tradeoffs in Client-Side Privacy for Speech Recognition.
CoRR, 2021

LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Robust Predictable Control.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Focused Attention Improves Document-Grounded Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Case Study: Deontological Ethics in NLP.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multimodal Speech Summarization Through Semantic Concept Learning.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Instabilities of Offline RL with Pre-Trained Neural Representation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Reasoning Over Virtual Knowledge Bases With Open Predicate Relations.
Proceedings of the 38th International Conference on Machine Learning, 2021

Information Obfuscation of Graph Neural Networks.
Proceedings of the 38th International Conference on Machine Learning, 2021

Towards Understanding and Mitigating Social Biases in Language Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

On Proximal Policy Optimization's Heavy-tailed Gradients.
Proceedings of the 38th International Conference on Machine Learning, 2021

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

From Differentiable Reasoning to Self-supervised Embodied Active Learning.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

Self-supervised Representation Learning with Relative Predictive Coding.
Proceedings of the 9th International Conference on Learning Representations, 2021

Self-supervised Learning from a Multi-view Perspective.
Proceedings of the 9th International Conference on Learning Representations, 2021

Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation.
Proceedings of the 9th International Conference on Learning Representations, 2021

C-Learning: Learning to Achieve Goals via Recursive Classification.
Proceedings of the 9th International Conference on Learning Representations, 2021

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers.
Proceedings of the 9th International Conference on Learning Representations, 2021

Learning to Hallucinate Examples from Extrinsic and Intrinsic Supervision.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Hubert: How Much Can a Bad Teacher Benefit ASR Pre-Training?
Proceedings of the IEEE International Conference on Acoustics, 2021

Understanding the Tradeoffs in Client-side Privacy for Downstream Speech Tasks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Combining Programmable Potentials and Neural Networks for Materials Problems.
Proceedings of the AAAI 2021 Spring Symposium on Combining Artificial Intelligence and Machine Learning with Physical Sciences, Stanford, CA, USA, March 22nd - to, 2021

2020
Close Category Generalization.
CoRR, 2020

Unsupervised Domain Adaptation for Visual Navigation.
CoRR, 2020

Planning with Submodular Objective Functions.
CoRR, 2020

Graph Adversarial Networks: Protecting Information against Adversarial Attacks.
CoRR, 2020

Few-Shot Learning with Intra-Class Knowledge Transfer.
CoRR, 2020

Demystifying Self-Supervised Learning: An Information-Theoretical Framework.
CoRR, 2020

Feature Robust Optimal Transport for High-dimensional Data.
CoRR, 2020

Provably Efficient Reinforcement Learning with General Value Function Approximation.
CoRR, 2020

Guaranteeing Reproducibility in Deep Learning Competitions.
CoRR, 2020

Interpretable Multimodal Routing for Human Multimodal Language.
CoRR, 2020

Adversarial Robustness Through Local Lipschitzness.
CoRR, 2020

Learning Not to Learn in the Presence of Noisy Labels.
CoRR, 2020

Think Locally, Act Globally: Federated Learning with Local and Global Representations.
CoRR, 2020

A Closer Look at Accuracy vs. Robustness.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Planning with General Objective Functions: Going Beyond Total Rewards.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On Reward-Free Reinforcement Learning with Linear Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neural Methods for Point-wise Dependency Estimation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Weakly-Supervised Reinforcement Learning for Controllable Behavior.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Object Goal Navigation using Goal-Oriented Semantic Exploration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Embodied Multimodal Multitask Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Capsules with Inverted Dot-Product Attention Routing.
Proceedings of the 8th International Conference on Learning Representations, 2020

Differentiable Reasoning over a Virtual Knowledge Base.
Proceedings of the 8th International Conference on Learning Representations, 2020

Learning To Explore Using Active Neural SLAM.
Proceedings of the 8th International Conference on Learning Representations, 2020

Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks.
Proceedings of the 8th International Conference on Learning Representations, 2020

Complex Transformer: A Framework for Modeling Complex-Valued Sequence.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Neural Topological SLAM for Visual Navigation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Exploring Controllable Text Generation Techniques.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

On Emergent Communication in Competitive Multi-Agent Teams.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Topological Sort for Sentence Ordering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Politeness Transfer: A Tag and Generate Approach.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Towards Debiasing Sentence Representations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
External vs. Internal: An Essay on Machine Learning Agents for Autonomous Database Management Systems.
IEEE Data Eng. Bull., 2019

Geometric Capsule Autoencoders for 3D Point Clouds.
CoRR, 2019

Enhanced Convolutional Neural Tangent Kernels.
CoRR, 2019

On Universal Approximation by Neural Networks with Uniform Guarantees on Approximation of Infinite Dimensional Maps.
CoRR, 2019

LSMI-Sinkhorn: Semi-supervised Squared-Loss Mutual Information Estimation with Optimal Transport.
CoRR, 2019

"My Way of Telling a Story": Persona based Grounded Story Generation.
CoRR, 2019

Efficient Exploration via State Marginal Matching.
CoRR, 2019

The MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors.
CoRR, 2019

Concurrent Meta Reinforcement Learning.
CoRR, 2019

The Omniglot Challenge: A 3-Year Progress Report.
CoRR, 2019

Mixtape: Breaking the Softmax Bottleneck Efficiently.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

XLNet: Generalized Autoregressive Pretraining for Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Multiple Futures Prediction.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Deep Gamblers: Learning to Abstain with Portfolio Theory.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Data Manipulation for Augmentation and Weighting.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Search on the Replay Buffer: Bridging Planning and Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Exact Computation with an Infinitely Wide Neural Net.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Neural Networks with Adaptive Regularization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Strong and Simple Baselines for Multimodal Utterance Embeddings.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Integrating Domain-Knowledge into Deep Learning.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

MineRL: A Large-Scale Dataset of Minecraft Demonstrations.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Post Selection Inference with Incomplete Maximum Mean Discrepancy Estimator.
Proceedings of the 7th International Conference on Learning Representations, 2019

AutoLoss: Learning Discrete Schedule for Alternate Optimization.
Proceedings of the 7th International Conference on Learning Representations, 2019

Learning Factorized Multimodal Representations.
Proceedings of the 7th International Conference on Learning Representations, 2019

Connecting the Dots Between MLE and RL for Sequence Generation.
Proceedings of the Deep Reinforcement Learning Meets Structured Prediction, 2019

Point Cloud GAN.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Worst Cases Policy Gradients.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Deep Neural Networks with Multi-Branch Architectures Are Intrinsically Less Non-Convex.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Multimodal Transformer for Unaligned Multimodal Language Sequences.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Transformer-XL: Attentive Language Models beyond a Fixed-Length Context.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Revisiting LSTM Networks for Semi-Supervised Text Classification via Mixed Objective Function.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Stackelberg GAN: Towards Provable Minimax Equilibrium via Multi-Generator Architectures.
CoRR, 2018

On the Complexity of Exploration in Goal-Driven Navigation.
CoRR, 2018

AutoLoss: Learning Discrete Schedules for Alternate Optimization.
CoRR, 2018

Style Transfer Through Multilingual and Feedback-Based Back-Translation.
CoRR, 2018

GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations.
CoRR, 2018

Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex.
CoRR, 2018

How Many Samples are Needed to Learn a Convolutional Neural Network?
CoRR, 2018

"Dependency Bottleneck" in Auto-encoding Architectures: an Empirical Study.
CoRR, 2018

On Characterizing the Capacity of Neural Networks using Algebraic Topology.
CoRR, 2018

Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

GLoMo: Unsupervised Learning of Transferable Relational Graphs.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Deep Generative Models with Learnable Knowledge Constraints.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

How Many Samples are Needed to Estimate a Convolutional Neural Network?
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Neural Models for Reasoning over Multiple Mentions Using Coreference.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Structured Control Nets for Deep Reinforcement Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Transformation Autoregressive Networks.
Proceedings of the 35th International Conference on Machine Learning, 2018

Gated Path Planning Networks.
Proceedings of the 35th International Conference on Machine Learning, 2018

Breaking the Softmax Bottleneck: A High-Rank RNN Language Model.
Proceedings of the 6th International Conference on Learning Representations, 2018

Selecting the Best in GANs Family: a Post Selection Inference Framework.
Proceedings of the 6th International Conference on Learning Representations, 2018

Neural Map: Structured Memory for Deep Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

LSTM Iteration Networks: An Exploration of Differentiable Path Finding.
Proceedings of the 6th International Conference on Learning Representations, 2018

On Unifying Deep Generative Models.
Proceedings of the 6th International Conference on Learning Representations, 2018

Active Neural Localization.
Proceedings of the 6th International Conference on Learning Representations, 2018

HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Open Domain Question Answering Using Early Fusion of Knowledge Bases and Text.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Global Pose Estimation With an Attention-Based Recurrent Network.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Investigating the Working of Text Classifiers.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

A Generic Approach for Escaping Saddle points.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Learning Cognitive Models Using Neural Networks.
Proceedings of the Artificial Intelligence in Education - 19th International Conference, 2018

Style Transfer Through Back-Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Gated-Attention Architectures for Task-Oriented Language Grounding.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Knowledge-based Word Sense Disambiguation using Topic Models.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Discovering Order in Unordered Datasets: Generative Markov Networks.
CoRR, 2017

Improving One-Shot Learning through Fusing Side Information.
CoRR, 2017

Normalized Gradient with Adaptive Stepsize Method for Deep Neural Network Training.
CoRR, 2017

Question Answering from Unstructured Text by Retrieval and Comprehension.
CoRR, 2017

Geometry of Optimization and Implicit Regularization in Deep Learning.
CoRR, 2017

Controllable Text Generation.
CoRR, 2017

Linguistic Knowledge as Memory for Recurrent Neural Networks.
CoRR, 2017

A Comparative Study of Word Embeddings for Reading Comprehension.
CoRR, 2017

Deep Sets.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Good Semi-supervised Learning That Requires a Bad GAN.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Improved Variational Autoencoders for Text Modeling using Dilated Convolutions.
Proceedings of the 34th International Conference on Machine Learning, 2017

Toward Controlled Generation of Text.
Proceedings of the 34th International Conference on Machine Learning, 2017

Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

Words or Characters? Fine-grained Gating for Reading Comprehension.
Proceedings of the 5th International Conference on Learning Representations, 2017

On the Quantitative Analysis of Decoder-Based Generative Models.
Proceedings of the 5th International Conference on Learning Representations, 2017

Deep Determinantal Point Process for Large-Scale Multi-label Classification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Robust Visual-Semantic Embeddings.
Proceedings of the IEEE International Conference on Computer Vision, 2017

The More You Know: Using Knowledge Graphs for Image Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Spatially Adaptive Computation Time for Residual Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Semi-Supervised QA with Generative Domain-Adaptive Nets.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Gated-Attention Readers for Text Comprehension.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Encode, Review, and Decode: Reviewer Module for Caption Generation.
CoRR, 2016

Multi-Task Cross-Lingual Sequence Tagging from Scratch.
CoRR, 2016

Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning.
Proceedings of the 4th International Conference on Learning Representations, 2016

Data-Dependent Path Normalization in Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Generating Images from Captions with Attention.
Proceedings of the 4th International Conference on Learning Representations, 2016

Gated-Attention Readers for Text Comprehension.
CoRR, 2016

Importance Weighted Autoencoders.
Proceedings of the 4th International Conference on Learning Representations, 2016

Architectural Complexity Measures of Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Review Networks for Caption Generation.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

On Multiplicative Integration with Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Stochastic Variational Deep Kernel Learning.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Iterative Refinement of the Approximate Posterior for Directed Belief Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Revisiting Semi-Supervised Learning with Graph Embeddings.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep Neural Networks with Massive Learned Knowledge.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Deep Kernel Learning.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
Action Recognition using Visual Attention.
CoRR, 2015

Initialization Strategies of Spatio-Temporal Convolutional Neural Networks.
CoRR, 2015

Iterative Refinement of Approximate Posterior for Training Directed Belief Networks.
CoRR, 2015

Path-SGD: Path-Normalized Optimization in Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Skip-Thought Vectors.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning Wake-Sleep Recurrent Attention Models.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Unsupervised Learning of Video Representations using LSTMs.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Scaling up Natural Gradient by Sparsely Factorizing the Inverse Fisher Matrix.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

segDeepM: Exploiting segmentation and context in deep neural networks for object detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Exploiting Image-trained CNN Architectures for Unconstrained Video Classification.
Proceedings of the British Machine Vision Conference 2015, 2015

Accurate and conservative estimates of MRF log-likelihood using reverse annealing.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Restricted Boltzmann machines for neuroimaging: An application in identifying intrinsic networks.
NeuroImage, 2014

Multimodal learning with deep Boltzmann machines.
J. Mach. Learn. Res., 2014

Dropout: a simple way to prevent neural networks from overfitting.
J. Mach. Learn. Res., 2014

Deep learning for neuroimaging: a validation study.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models.
CoRR, 2014

Multi-task Neural Networks for QSAR Predictions.
CoRR, 2014

BBN VISER TRECVID 2014 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Learning Generative Models with Visual Attention.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

A Multiplicative Model for Learning Distributed Text-Based Attribute Representations.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Deep learning.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Multimodal Neural Language Models.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Learning with Hierarchical-Deep Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Guest Editors' Introduction: Special Section on Learning Deep Architectures.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Modeling Documents with Deep Boltzmann Machines.
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Learning Stochastic Feedforward Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Discriminative Transfer Learning with Tree-based Priors.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

The Power of Asymmetry in Binary Hashing.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

One-shot learning by inverting a compositional causal process.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Annealing between distributions by averaging moments.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Tensor Analyzers.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
An Efficient Learning Procedure for Deep Boltzmann Machines.
Neural Comput., 2012

One-Shot Learning with a Hierarchical Nonparametric Bayesian Model.
Proceedings of the Unsupervised and Transfer Learning, 2012

Domain Adaptation: A Small Sample Statistical Approach.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Improving neural networks by preventing co-adaptation of feature detectors
CoRR, 2012

Exploiting compositionality to explore a large space of model structures.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Cardinality Restricted Boltzmann Machines.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A Better Way to Pretrain Deep Boltzmann Machines.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Matrix reconstruction with the local max norm.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Hamming Distance Metric Learning.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Deep Lambertian Networks.
Proceedings of the 29th International Conference on Machine Learning, 2012

Deep Mixtures of Factor Analysers.
Proceedings of the 29th International Conference on Machine Learning, 2012

Resource configurable spoken query detection using Deep Boltzmann Machines.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Robust Boltzmann Machines for recognition and denoising.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Concept learning as motor program induction: A large-scale empirical study.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012

2011
Discovering Binary Codes for Documents by Learning Deep Generative Models.
Top. Cogn. Sci., 2011

Domain Adaptation: Overfitting and Small Sample Statistics
CoRR, 2011

Learning to Learn with Compound HD Models.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Transfer Learning by Borrowing Examples for Multiclass Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Learning with the weighted trace-norm under arbitrary sampling distributions.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Learning to share visual appearance for multiclass object detection.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

One shot learning of simple visual concepts.
Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011

2010
Learning Deep Generative Models.
PhD thesis, 2010

Efficient Learning of Deep Boltzmann Machines.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Collaborative Filtering in a Non-Uniform World: Learning with the Weighted Trace Norm.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Practical Large-Scale Optimization for Max-norm Regularization.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Learning Deep Boltzmann Machines using Adaptive MCMC.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

2009
Deep Boltzmann Machines.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Semantic hashing.
Int. J. Approx. Reason., 2009

Modelling Relational Data using Bayesian Clustered Tensor Factorization.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Replicated Softmax: an Undirected Topic Model.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Learning in Markov Random Fields using Tempered Transitions.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Workshop summary: Workshop on learning feature hierarchies.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Evaluation methods for topic models.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Learning nonlinear dynamic models.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2008
Evaluating probabilities under high-dimensional latent variable models.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Bayesian probabilistic matrix factorization using Markov chain Monte Carlo.
Proceedings of the Machine Learning, 2008

On the quantitative analysis of deep belief networks.
Proceedings of the Machine Learning, 2008

2007
Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure.
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007

Probabilistic Matrix Factorization.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Using Deep Belief Nets to Learn Covariance Kernels for Gaussian Processes.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Restricted Boltzmann machines for collaborative filtering.
Proceedings of the Machine Learning, 2007

2004
Neighbourhood Components Analysis.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Semi-Supervised Mixture-of-Experts Classification.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

2003
On the Convergence of Bound Optimization Algorithms.
Proceedings of the UAI '03, 2003

Optimization with EM and Expectation-Conjugate-Gradient.
Proceedings of the Machine Learning, 2003

Adaptive Overrelaxed Bound Optimization Methods.
Proceedings of the Machine Learning, 2003

Simultaneous Localization and Surveying with Multiple Agents.
Proceedings of the Switching and Learning in Feedback Systems, 2003


  Loading...