Wei Xu

CoRR, 2019

UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Zero-Shot Transfer VQA Dataset.

[BibT_eX]

[DOI]

CoRR, 2018

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos.

[BibT_eX]

[DOI]

CoRR, 2018

Tracklet Association Tracker: An End-to-End Learning-based Association Approach for Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, 2018

DeepTransport: Learning Spatial-Temporal Dependency for Traffic Condition Forecasting.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Interactive Grounded Language Acquisition and Generalization in a 2D World.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Every Pixel Counts: Unsupervised Geometry Learning with Holistic 3D Motion Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

LEGO: Learning Edge With Geometry All at Once by Watching Videos.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

DeLS-3D: Deep Localization and Segmentation With a 3D Semantic Map.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Occlusion Aware Unsupervised Learning of Optical Flow.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Interactive Language Acquisition with One-shot Visual Concept Learning through a Conversational Game.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Unsupervised Learning of Geometry From Videos With Edge-Aware Depth-Normal Consistency.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

State Tracking Networks for Dialog State Tracking.

[BibT_eX]

[DOI]

Proceedings of the Workshops of the The Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Occlusion Aware Unsupervised Learning of Optical Flow.

[BibT_eX]

[DOI]

CoRR, 2017

Unsupervised Learning of Geometry with Edge-aware Depth-Normal Consistency.

[BibT_eX]

[DOI]

CoRR, 2017

Unsupervised Learning Layers for Video Analysis.

[BibT_eX]

[DOI]

CoRR, 2017

Listen, Interact and Talk: Learning to Speak via Interaction.

[BibT_eX]

[DOI]

CoRR, 2017

A Deep Compositional Framework for Human-like Language Acquisition in Virtual Environment.

[BibT_eX]

[DOI]

CoRR, 2017

Dynamic Computational Time for Visual Attention.

[BibT_eX]

[DOI]

CoRR, 2017

Optimal switching for linear quadratic problem of switched systems in discrete time.

[BibT_eX]

[DOI]

Autom., 2017

Dynamic Computational Time for Visual Attention.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2016

Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2016

Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

CNN-RNN: A Unified Framework for Multi-label Image Classification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Attention to Scale: Scale-Aware Semantic Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

CFO: Conditional Focused Neural Question Answering with Large-scale Knowledge Bases.

[BibT_eX]

[DOI]

Zihang Dai

Lei Li

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN).

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Learning Representations, 2015

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images.

[BibT_eX]

[DOI]

CoRR, 2015

Bidirectional LSTM-CRF Models for Sequence Tagging.

[BibT_eX]

[DOI]

Zhiheng Huang

Kai Yu

CoRR, 2015

Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering.

[BibT_eX]

[DOI]

CoRR, 2015

ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2015

Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

End-to-end learning of semantic role labeling using recurrent neural networks.

[BibT_eX]

[DOI]

Jie Zhou

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014

Explain Images with Multimodal Recurrent Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2014

2008

Fast exact maximum likelihood estimation for mixture of language model.

[BibT_eX]

[DOI]

Yi Zhang

Inf. Process. Manag., 2008

2007

Fast exact maximum likelihood estimation for mixture of language models.

[BibT_eX]

[DOI]

Yi Zhang