Yi Yang

Affiliations:
  • Google DeepMind, London, UK
  • Baidu Research, Institute of Deep Learning, Sunnyvale, CA, USA
  • University of California Irvine, CA, USA (PhD 2013)


According to our database1, Yi Yang authored at least 37 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
BootsTAP: Bootstrapped Training for Tracking-Any-Point.
CoRR, 2024

2023
Learning from One Continuous Video Stream.
CoRR, 2023

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation.
CoRR, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
TAP-Vid: A Benchmark for Tracking Any Point in a Video.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2020
Large-scale multilingual audio visual dubbing.
CoRR, 2020

2019
Feedback Convolutional Neural Network for Visual Localization and Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

A Refined 3D Pose Dataset for Fine-Grained Object Categories.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Recognizing Part Attributes With Insufficient Data.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Depth-Based Hand Pose Estimation: Methods, Data, and Challenges.
Int. J. Comput. Vis., 2018

Zero-Shot Transfer VQA Dataset.
CoRR, 2018

Improving Annotation for 3D Pose Dataset of Fine-Grained Object Categories.
CoRR, 2018

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos.
CoRR, 2018

3D Pose Estimation for Fine-Grained Object Categories.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Occlusion Aware Unsupervised Learning of Optical Flow.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Occlusion Aware Unsupervised Learning of Optical Flow.
CoRR, 2017

Unsupervised Learning Layers for Video Analysis.
CoRR, 2017

Dynamic Computational Time for Visual Attention.
CoRR, 2017

Dynamic Computational Time for Visual Attention.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2016
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

CNN-RNN: A Unified Framework for Multi-label Image Classification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Attention to Scale: Scale-Aware Semantic Image Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN).
Proceedings of the 3rd International Conference on Learning Representations, 2015

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images.
CoRR, 2015

Depth-Based Hand Pose Estimation: Data, Methods, and Challenges.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning Like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Explain Images with Multimodal Recurrent Neural Networks.
CoRR, 2014

AutoCaption: Automatic caption generation for personal photos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Parsing Occluded People.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Articulated Human Detection with Flexible Mixtures of Parts.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

2012
Layered Object Models for Image Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Recognizing proxemics in personal photos.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Articulated pose estimation with flexible mixtures-of-parts.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Layered object detection for multi-class segmentation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010


  Loading...