Fabian Caba Heilbron

Orcid: 0000-0002-3129-1985

According to our database1, Fabian Caba Heilbron authored at least 42 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

2023
Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Localizing Moments in Long Video Via Multimodal Guidance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Long-range Multimodal Pretraining for Movie Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Meta-Personalizing Vision-Language Models to Find Named Instances in Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PIVOT: Prompting for Video Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Videogenic: Video Highlights via Photogenic Moments.
CoRR, 2022

VideoMap: Video Editing in Latent Space.
CoRR, 2022

Video-ReTime: Learning Temporally Varying Speediness for Time Remapping.
CoRR, 2022

OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context.
CoRR, 2022

Transcript to Video: Efficient Clip Sequencing from Texts.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MovieCuts: A New Dataset and Benchmark for Cut Type Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing.
Proceedings of the Computer Vision - ECCV 2022, 2022

vCLIMB: A Novel Video Class Incremental Learning Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding Tasks.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Real-Time Semantic Segmentation With Fast Attention.
IEEE Robotics Autom. Lett., 2021

RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Learning Where to Cut from Edited Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Learning to Cut by Watching Movies.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MAAS: Multi-modal Assignation for Active Speaker Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

APES: Audiovisual Person Search in Untrimmed Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol.
IEEE Access, 2020

Temporally Distributed Networks for Fast Video Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Active Speakers in Context.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization.
CoRR, 2019

Clothing Recognition in the Wild using the Amazon Catalog.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

The Instantaneous Accuracy: a Novel Metric for the Problem of Online Human Behaviour Recognition in Untrimmed Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary.
CoRR, 2018

What Do I Annotate Next? An Empirical Study of Active Learning for Action Localization.
Proceedings of the Computer Vision - ECCV 2018, 2018

Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization.
Proceedings of the Computer Vision - ECCV 2018, 2018

Diagnosing Error in Temporal Action Detectors.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
ActivityNet Challenge 2017 Summary.
CoRR, 2017

Action Search: Learning to Search for Human Activities in Untrimmed Videos.
CoRR, 2017

SCC: Semantic Context Cascade for Efficient Action Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
DAPs: Deep Action Proposals for Action Understanding.
Proceedings of the Computer Vision - ECCV 2016, 2016

Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
ActivityNet: A large-scale video benchmark for human activity understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Robust Manhattan Frame estimation from a single RGB-D image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Collecting and Annotating Human Activities in Web Videos.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Camera Motion and Surrounding Scene Appearance as Context for Action Recognition.
Proceedings of the Computer Vision - ACCV 2014, 2014


  Loading...