Fabian Caba Heilbron

Orcid: 0000-0002-3129-1985

According to our database¹, Fabian Caba Heilbron authored at least 56 papers between 2014 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

VRAgent: Self-Refining Agent for Zero-Shot Multimodal Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition.

[BibT_eX]

[DOI]

Quynh Phung

Long Mai

Fabian David Caba Heilbron

Feng Liu

Jia-Bin Huang

Cusuh Ham

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025

EditDuet: A Multi-Agent System for Video Non-Linear Editing.

[BibT_eX]

[DOI]

Marcelo Sandoval-Castañeda

Bryan C. Russell

Josef Sivic

Gregory Shakhnarovich

Fabian Caba Heilbron

Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

ResidualViT for Efficient Temporally Dense Video Encoding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Discovering Divergent Representations Between Text-To-Image Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Improving Personalized Search with Regularized Low-Rank Parameter Updates.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Generative Timelines for Instructed Visual Assembly.

[BibT_eX]

[DOI]

CoRR, 2024

Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval.

[BibT_eX]

[DOI]

Jiacheng Cheng

Hijung Valentina Shin

Nuno Vasconcelos

Bryan C. Russell

Fabian Caba Heilbron

CoRR, 2024

Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets.

[BibT_eX]

[DOI]

Ishan Rajendrakumar Dave

Fabian Caba Heilbron

Mubarak Shah

Simon Jenni

Proceedings of the Computer Vision - ECCV 2024, 2024

Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Automated Movie Trailer Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Scaling Up Video Summarization Pretraining with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Videogenic: Identifying Highlight Moments in Videos with Professional Photographs as a Prior.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference on Creativity & Cognition, 2024

VideoMap: Supporting Video Exploration, Brainstorming, and Prototyping in the Latent Space.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference on Creativity & Cognition, 2024

2023

Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Localizing Moments in Long Video Via Multimodal Guidance.

[BibT_eX]

[DOI]

Wayner Barrios

Mattia Soldan

Alberto Mario Ceballos-Arroyo

Fabian Caba Heilbron

Bernard Ghanem

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Long-range Multimodal Pretraining for Movie Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Meta-Personalizing Vision-Language Models to Find Named Instances in Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PIVOT: Prompting for Video Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Videogenic: Video Highlights via Photogenic Moments.

[BibT_eX]

[DOI]

CoRR, 2022

VideoMap: Video Editing in Latent Space.

[BibT_eX]

[DOI]

CoRR, 2022

Video-ReTime: Learning Temporally Varying Speediness for Time Remapping.

[BibT_eX]

[DOI]

Simon Jenni

Markus Woodson

Fabian Caba Heilbron

CoRR, 2022

OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context.

[BibT_eX]

[DOI]

CoRR, 2022

Transcript to Video: Efficient Clip Sequencing from Texts.

[BibT_eX]

[DOI]

Yu Xiong

Fabian Caba Heilbron

Dahua Lin

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MovieCuts: A New Dataset and Benchmark for Cut Type Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

vCLIMB: A Novel Video Class Incremental Learning Benchmark.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding Tasks.

[BibT_eX]

[DOI]

Santiago Castro

Fabian Caba

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021

Real-Time Semantic Segmentation With Fast Attention.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Learning Where to Cut from Edited Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Learning to Cut by Watching Movies.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MAAS: Multi-modal Assignation for Active Speaker Detection.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

APES: Audiovisual Person Search in Untrimmed Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020

Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol.

[BibT_eX]

[DOI]

Marcos Baptista-Ríos

Roberto Javier López-Sastre

Fabian Caba Heilbron

Jan C. van Gemert

Francisco Javier Acevedo-Rodríguez

Saturnino Maldonado-Bascón

IEEE Access, 2020

Temporally Distributed Networks for Fast Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Active Speakers in Context.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization.

[BibT_eX]

[DOI]

CoRR, 2019

Clothing Recognition in the Wild using the Amazon Catalog.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

The Instantaneous Accuracy: a Novel Metric for the Problem of Online Human Behaviour Recognition in Untrimmed Videos.

[BibT_eX]

[DOI]

Marcos Baptista-Ríos

Roberto Javier López-Sastre

Fabian Caba Heilbron

Jan van Gemert

Francisco Javier Acevedo-Rodríguez

Saturnino Maldonado-Bascón

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018

The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary.

[BibT_eX]

[DOI]

CoRR, 2018

What Do I Annotate Next? An Empirical Study of Active Learning for Action Localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization.

[BibT_eX]

[DOI]

Humam Alwassel

Fabian Caba Heilbron

Bernard Ghanem

Proceedings of the Computer Vision - ECCV 2018, 2018

Diagnosing Error in Temporal Action Detectors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

ActivityNet Challenge 2017 Summary.

[BibT_eX]

[DOI]

CoRR, 2017

Action Search: Learning to Search for Human Activities in Untrimmed Videos.

[BibT_eX]

[DOI]

Humam Alwassel

Fabian Caba Heilbron

Bernard Ghanem

CoRR, 2017

SCC: Semantic Context Cascade for Efficient Action Detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

DAPs: Deep Action Proposals for Action Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos.

[BibT_eX]

[DOI]

Fabian Caba Heilbron

Juan Carlos Niebles

Bernard Ghanem

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

ActivityNet: A large-scale video benchmark for human activity understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Robust Manhattan Frame estimation from a single RGB-D image.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Collecting and Annotating Human Activities in Web Videos.

[BibT_eX]

[DOI]

Fabian Caba Heilbron

Juan Carlos Niebles

Proceedings of the International Conference on Multimedia Retrieval, 2014

Camera Motion and Surrounding Scene Appearance as Context for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

Fabian Caba Heilbron

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...