Juan-Manuel Pérez-Rúa

Orcid: 0000-0002-5559-9626

Affiliations:
  • Facebook AI, UK


According to our database1, Juan-Manuel Pérez-Rúa authored at least 30 papers between 2015 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks.
CoRR, 2023

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation.
CoRR, 2023

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing.
CoRR, 2023

Boundary-Denoising for Video Activity Localization.
CoRR, 2023

Where is my Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation.
CoRR, 2022

Negative Frames Matter in Egocentric Visual Query 2D Localization.
CoRR, 2022

2021
SAIC_Cambridge-HuPBA-FBK Submission to the EPIC-Kitchens-100 Action Recognition Challenge 2021.
CoRR, 2021

Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization.
CoRR, 2021

Low-Fidelity Video Encoder Optimization for Temporal Action Localization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Space-time Mixing Attention for Video Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Boundary-sensitive Pre-training for Temporal Localization in Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Few-shot Action Recognition with Prototype-centered Attentive Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

TNT: Text-Conditioned Network with Transductive Inference for Few-Shot Video Classification.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Knowing What, Where and When to Look: Video Action modelling with Attention.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
ROAM: A Rich Object Appearance Model with Application to Rotoscoping.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Egocentric Action Recognition by Video Attention and Temporal Context.
CoRR, 2020

Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention.
CoRR, 2020

Incremental Few-Shot Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
MFAS: Multimodal Fusion Architecture Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Learning how to be robust: Deep polynomial regression.
CoRR, 2018

Efficient Progressive Neural Architecture Search.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Hierarchical motion-based video analysis with applications to video post-production. (Analyse de vidéo par décomposition hiérarchique du mouvement appliquée à la post-production vidéo).
PhD thesis, 2017

Detection and Localization of Anomalous Motion in Video Sequences from Local Histograms of Labeled Affine Flows.
Frontiers ICT, 2017

ROAM: A Rich Object Appearance Model with Application to Rotoscoping.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Object-guided motion estimation.
Comput. Vis. Image Underst., 2016

Hierarchical motion decomposition for dynamic scene parsing.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Determining Occlusions from Space and Time Image Reconstructions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Discovering motion hierarchies via tree-structured coding of trajectories.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
Background-foreground tracking for video object segmentation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015


  Loading...