Yusuf Aytar

According to our database¹, Yusuf Aytar authored at least 50 papers between 2007 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Motion Prompting: Controlling Video Generation with Motion Trajectories.

[BibT_eX]

[DOI]

Tatiana Lopez-Guevara

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Motion Prompting: Controlling Video Generation with Motion Trajectories.

[BibT_eX]

[DOI]

Tatiana Lopez-Guevara

CoRR, 2024

A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos.

[BibT_eX]

[DOI]

CoRR, 2024

OVR: A Dataset for Open Vocabulary Temporal Repetition Counting in Videos.

[BibT_eX]

[DOI]

CoRR, 2024

FlexCap: Generating Rich, Localized, and Flexible Captions in Images.

[BibT_eX]

[DOI]

CoRR, 2024

Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models.

[BibT_eX]

[DOI]

Sjoerd van Steenkiste

Kelsey R. Allen

Thomas Kipf

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

FlexCap: Describe Anything in Images in Controllable Detail.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Genie: Generative Interactive Environments.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning from One Continuous Video Stream.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning transferable motor skills with hierarchical latent mixture policies.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation.

[BibT_eX]

[DOI]

Todor Davchev

Oleg Olegovich Sushkov

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Manipulator-Independent Representations for Visual Imitation.

[BibT_eX]

[DOI]

Yuxiang Zhou

Yusuf Aytar

Konstantinos Bousmalis

Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Semi-supervised reward learning for offline reinforcement learning.

[BibT_eX]

[DOI]

CoRR, 2020

Offline Learning from Demonstrations and Unlabeled Experience.

[BibT_eX]

[DOI]

CoRR, 2020

Large-scale multilingual audio visual dubbing.

[BibT_eX]

[DOI]

CoRR, 2020

Scaling data-driven robotics with reward sketching and batch reinforcement learning.

[BibT_eX]

[DOI]

Serkan Cabi

Sergio Gómez Colmenarejo

Proceedings of the Robotics: Science and Systems XVI, 2020

Self-Supervised Sim-to-Real Adaptation for Visual Robotic Manipulation.

[BibT_eX]

[DOI]

Konstantinos Bousmalis

Francesco Nori

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning rich touch representations through cross-modal self-supervision.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

2019

A Framework for Data-Driven Robotics.

[BibT_eX]

[DOI]

Serkan Cabi

Sergio Gómez Colmenarejo

CoRR, 2019

Temporal Cycle-Consistency Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Cross-Modal Scene Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL.

[BibT_eX]

[DOI]

Tom Le Paine

Sergio Gomez Colmenarejo

CoRR, 2018

Playing hard exploration games by watching YouTube.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017

See, Hear, and Read: Deep Aligned Representations.

[BibT_eX]

[DOI]

Yusuf Aytar

Carl Vondrick

Antonio Torralba

CoRR, 2017

Is Saki #delicious?: The Food Perception Gap on Instagram and Its Relation to Health.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on World Wide Web, 2017

Face-to-BMI: Using Computer Vision to Infer Body Mass Index on Social Media.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Exploiting Convolution Filter Patterns for Transfer Learning.

[BibT_eX]

[DOI]

Mehmet Aygun

Yusuf Aytar

Hazim Kemal Ekenel

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

SoundNet: Learning Sound Representations from Unlabeled Video.

[BibT_eX]

[DOI]

Yusuf Aytar

Carl Vondrick

Antonio Torralba

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Learning Aligned Cross-Modal Representations from Weakly Aligned Data.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

How Transferable Are CNN-Based Features for Age and Gender Classification?

[BibT_eX]

[DOI]

Gökhan Özbulak

Yusuf Aytar

Hazim Kemal Ekenel

Proceedings of the 2016 International Conference of the Biometrics Special Interest Group, 2016

2015

Part level transfer regularization for enhancing exemplar SVMs.

[BibT_eX]

[DOI]

Yusuf Aytar

Andrew Zisserman

Comput. Vis. Image Underst., 2015

2014

Transfer learning for object category detection.

[BibT_eX]

[DOI]

Yusuf Aytar

PhD thesis, 2014

Multi-Task Multi-Sample Learning.

[BibT_eX]

[DOI]

Yusuf Aytar

Andrew Zisserman

Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Immediate, Scalable Object Category Detection.

[BibT_eX]

[DOI]

Yusuf Aytar

Andrew Zisserman

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2012

Enhancing Exemplar SVMs using Part Level Transfer Regularization.

[BibT_eX]

[DOI]

Yusuf Aytar

Andrew Zisserman

Proceedings of the British Machine Vision Conference, 2012

2011

Tabula rasa: Model transfer for object category detection.

[BibT_eX]

[DOI]

Yusuf Aytar

Andrew Zisserman

Proceedings of the IEEE International Conference on Computer Vision, 2011

2008

Utilizing semantic word similarity measures for video retrieval.

[BibT_eX]

[DOI]

Yusuf Aytar

Mubarak Shah

Jiebo Luo

Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007

University of Central Florida at TRECVID 2007 Semantic Video Classification and Automatic Search.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Improving Semantic Concept Detection and Retrieval using Contextual Estimates.

[BibT_eX]

[DOI]

Yusuf Aytar

Omer Bilal Orhan

Mubarak Shah

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Yusuf Aytar

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...