Oswald Lanz

Adrian Bulat

Juan-Manuel Pérez-Rúa

Georgios Tzimiropoulos

CoRR, 2021

Higher Order Recurrent Space-Time Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

2020

A Spatio-Temporal Multi-Scale Binary Descriptor.

[BibT_eX]

[DOI]

Alessio Xompero

IEEE Trans. Image Process., 2020

FBK-HUPBA Submission to the EPIC-Kitchens Action Recognition 2020 Challenge.

[BibT_eX]

[DOI]

CoRR, 2020

Data Augmentation Techniques for the Video Question Answering Task.

[BibT_eX]

[DOI]

Alex Falcon

Giuseppe Serra

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Gate-Shift Networks for Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Novel-View Human Action Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019

Multi-Speaker Tracking From an Audio-Visual Sensing Device.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

An Analysis of Deep Neural Networks with Attention for Action Recognition from a Neurophysiological Perspective.

[BibT_eX]

[DOI]

CoRR, 2019

FBK-HUPBA Submission to the EPIC-Kitchens 2019 Action Recognition Challenge.

[BibT_eX]

[DOI]

CoRR, 2019

Hierarchical Feature Aggregation Networks for Video Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Learnable Masks for Pose-Guided View Synthesis.

[BibT_eX]

[DOI]

Mohamed Ilyes Lakhal

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

View-LSTM: Novel-View Video Synthesis Through View Decomposition.

[BibT_eX]

[DOI]

Mohamed Ilyes Lakhal

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Accurate Target Annotation in 3D from Multimodal Streams.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

LSTA: Long Short-Term Attention for Egocentric Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Joint Estimation of Human Pose and Conversational Groups from Social Scenes.

[BibT_eX]

[DOI]

Jagannadan Varadarajan

Int. J. Comput. Vis., 2018

MORB: A Multi-Scale Binary Descriptor.

[BibT_eX]

[DOI]

Alessio Xompero

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Camera Matching of Spatio-Temporal Binary Features.

[BibT_eX]

[DOI]

Alessio Xompero

Proceedings of the 21st International Conference on Information Fusion, 2018

Pose Guided Human Image Synthesis by View Disentanglement and Enhanced Weighting Loss.

[BibT_eX]

[DOI]

Mohamed Ilyes Lakhal

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Residual Stacked RNNs for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2018, 2018

Top-Down Attention Recurrent VLAD Encoding for Action Recognition in Videos.

[BibT_eX]

[DOI]

Proceedings of the AI*IA 2018 - Advances in Artificial Intelligence, 2018

2017

An automatic image-to-DEM alignment approach for annotating mountains pictures on a smartphone.

[BibT_eX]

[DOI]

Mach. Vis. Appl., 2017

Convolutional Long Short-Term Memory Networks for Recognizing First Person Interactions.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Learning to detect violent videos using convolutional long short-term memory.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

SALSA: A Multimodal Dataset for the Automated Analysis of Free-Standing Social Interactions.

[BibT_eX]

[DOI]

Xavier Alameda-Pineda

Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

Exploring Multitask and Transfer Learning Algorithms for Head Pose Estimation in Dynamic Multiview Scenarios.

[BibT_eX]

[DOI]

Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016

A Multi-Task Learning Framework for Head Pose Estimation under Target Motion.

[BibT_eX]

[DOI]

Gaowen Liu

IEEE Trans. Pattern Anal. Mach. Intell., 2016

SALSA: A Novel Dataset for Multimodal Group Behavior Analysis.

[BibT_eX]

[DOI]

Xavier Alameda-Pineda

Jacopo Staiano

IEEE Trans. Pattern Anal. Mach. Intell., 2016

2015

Dynamic task decomposition for decentralized object tracking in complex scenes.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2015

Jointly Estimating Interactions and Head, Body Pose of Interactors from Distant Social Scenes.

[BibT_eX]

[DOI]

Jagannadan Varadarajan

Stefan Winkler

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Analyzing Free-standing Conversational Groups: A Multimodal Approach.

[BibT_eX]

[DOI]

Xavier Alameda-Pineda

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations from Surveillance Videos.

[BibT_eX]

[DOI]

Jagannadan Varadarajan

Samuel Rota Bulò

Narendra Ahuja

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Exploring Transfer Learning Approaches for Head Pose Classification from Multi-view Surveillance Images.

[BibT_eX]

[DOI]

Kalpathi Ramakrishnan

Int. J. Comput. Vis., 2014

Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Dynamic Task Decomposition for Probabilistic Tracking in Complex Scenes.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Exploiting Color Constancy for Robust Tracking Under Non-uniform Illumination.

[BibT_eX]

[DOI]

Sinan Mutlu

Samuel Rota Bulò

Proceedings of the Image Analysis and Recognition - 11th International Conference, 2014

Learning Contours for Automatic Annotations of Mountains Pictures on a Smartphone.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Distributed Smart Cameras, 2014

Wide-area Multi-camera Multi-object Tracking with Dynamic Task Decomposition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Distributed Smart Cameras, 2014

Personalizing a smartwatch-based gesture interface with transfer learning.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

2013

On the relationship between head pose, social attention and personality prediction for unstructured and dynamic group interactions.

[BibT_eX]

[DOI]

Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Multi-scale f-formation discovery for group detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2013

Learning the Scene Illumination for Color-Based People Tracking in Dynamic Environment.

[BibT_eX]

[DOI]

Sinan Mutlu

Proceedings of the Image Analysis and Processing - ICIAP 2013, 2013

Multicamera People Tracking Using a Locus-based Probabilistic Occupancy Map.

[BibT_eX]

[DOI]

Sinan Mutlu

Proceedings of the Image Analysis and Processing - ICIAP 2013, 2013

No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

2012

Active transfer learning for multi-view head-pose classification.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Pattern Recognition, 2012

Boosting-based transfer learning for multi-view head-pose classification from surveillance videos.

[BibT_eX]

[DOI]

Kalpathi Ramakrishnan

Proceedings of the 20th European Signal Processing Conference, 2012

An Adaptation Framework for Head-Pose Classification in Dynamic Multi-view Scenarios.

[BibT_eX]

[DOI]

Kalpathi Ramakrishnan

Proceedings of the Computer Vision, 2012

2011

Dynamic resource allocation for probabilistic tracking via attentive sensing and sampling.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010

Space speaks: towards socially and personality aware visual surveillance.

[BibT_eX]

[DOI]

Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do.

[BibT_eX]

[DOI]

Alexandros Potamianos

Hinrich Schütze

Sabine Schulte im Walde

Luca Surian

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Tracking Multiple People with Illumination Maps.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

A joint particle filter to track the position and head orientation of people using audio visual cues.

[BibT_eX]

[DOI]

Alessio Brutti

Proceedings of the 18th European Signal Processing Conference, 2010

Computers in the Human Interaction Loop.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Ambient Intelligence and Smart Environments, 2010

2009

Estimation of Head Pose.

[BibT_eX]

[DOI]

Michael Voit

Nicolas Gourier

Cristian Canton-Ferrer

Rainer Stiefelhagen

Aristodemos Pnevmatikakis

Proceedings of the Computers in the Human Interaction Loop, 2009

Extracting Interaction Cues: Focus of Attention, Body Pose, and Gestures.

[BibT_eX]

[DOI]

Proceedings of the Computers in the Human Interaction Loop, 2009

Person Tracking.

[BibT_eX]

[DOI]

Keni Bernardin

Rainer Stiefelhagen

Proceedings of the Computers in the Human Interaction Loop, 2009

Multimodal Classification of Activities of Daily Living Inside Smart Homes.

[BibT_eX]

[DOI]

Proceedings of the Distributed Computing, 2009

A HJS filter to track visually interacting targets.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A Sampling Algorithm for Occlusion Robust Multi Target Detection.

[BibT_eX]

[DOI]

Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

2008

Optimised Meeting Recording and Annotation Using Real-Time Video Analysis.

[BibT_eX]

[DOI]

Paul Chippendale

Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008

2007

Tracking Visitors in a Museum.

[BibT_eX]

[DOI]

Proceedings of the PEACH - Intelligent Interfaces for Museum Visits, 2007

An information theoretic rule for sample size adaptation in particle filtering.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

An Appearance-Based Particle Filter for Visual Tracking in Smart Rooms.

[BibT_eX]

[DOI]

Paul Chippendale

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video.

[BibT_eX]

[DOI]

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006

Approximate Bayesian Multibody Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2006

Dynamic Head Location and Pose from Video.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2006

A Generative Approach to Audio-Visual Person Tracking.

[BibT_eX]

[DOI]

Proceedings of the Multimodal Technologies for Perception of Humans, 2006

2005

Hybrid Joint-Separable Multibody Tracking.

[BibT_eX]

[DOI]

Roberto Manduchi

Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004

Occlusion robust Tracking Ofmultiple Objects.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer Vision and Graphics, 2004

Automatic Lens distortion estimation for an Active Camera.

[BibT_eX]

[DOI]