We stand with Ukraine

We stand with Ukraine

Xavier Alameda-Pineda

Orcid: 0000-0002-5354-1084

According to our database¹, Xavier Alameda-Pineda authored at least 136 papers between 2008 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org
on openreview.net

On csauthors.net:

Bibliography

2026

Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models.

[DOI]

Massimiliano Pappa

,

,

Valentino Sacco

,

,

Stéphane Lathuilière

,

,

Xavier Alameda-Pineda

,

CoRR, March, 2026

The Equalizer: Introducing Shape-Gain Decomposition in Neural Audio Codecs.

[DOI]

,

,

Xavier Alameda-Pineda

CoRR, February, 2026

Residual Tokens Enhance Masked Autoencoders for Speech Modeling.

[DOI]

,

Stéphane Lathuilière

,

Xavier Alameda-Pineda

CoRR, January, 2026

Diffusion-based Frameworks for Unsupervised Speech Enhancement.

[DOI]

Jean-Eudes Ayilo

,

Mostafa Sadeghi

,

,

Xavier Alameda-Pineda

CoRR, January, 2026

OpenSocInt: A Multi-modal Training Environment for Human-Aware Social Navigation.

[DOI]

,

,

,

Xavier Alameda-Pineda

CoRR, January, 2026

2025

Socially Pertinent Robots in Gerontological Healthcare.

[DOI]

Int. J. Soc. Robotics, December, 2025

Layover or Direct Flight: Rethinking Audio-Guided Image Segmentation.

[DOI]

Joel Alberto Santos

,

,

Xavier Alameda-Pineda

,

CoRR, November, 2025

Modeling strategies for speech enhancement in the latent space of a neural audio codec.

[DOI]

Sofiene Kammoun

,

Xavier Alameda-Pineda

,

CoRR, October, 2025

Autoregressive GAN for Semantic Unconditional Head Motion Generation.

[DOI]

,

Xavier Alameda-Pineda

,

Stéphane Lathuilière

,

Dominique Vaufreydaz

ACM Trans. Multim. Comput. Commun. Appl., January, 2025

Posterior Transition Modeling for Unsupervised Diffusion-Based Speech Enhancement.

[DOI]

Mostafa Sadeghi

,

Jean-Eudes Ayilo

,

,

Xavier Alameda-Pineda

IEEE Signal Process. Lett., 2025

AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder.

[DOI]

,

,

,

,

Xavier Alameda-Pineda

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Diffusion-based Unsupervised Audio-visual Speech Enhancement.

[DOI]

Jean-Eudes Ayilo

,

Mostafa Sadeghi

,

,

Xavier Alameda-Pineda

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MEGA: Masked Generative Autoencoder for Human Mesh Recovery.

[DOI]

Guénolé Fiche

,

,

Xavier Alameda-Pineda

,

Francesc Moreno-Noguer

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Unsupervised performance analysis of 3D face alignment with a statistically robust confidence test.

[DOI]

Mostafa Sadeghi

,

Xavier Alameda-Pineda

,

Neurocomputing, January, 2024

Robust Audio-Visual Contrastive Learning for Proposal-Based Self-Supervised Sound Source Localization in Videos.

[DOI]

,

,

,

,

,

Xavier Alameda-Pineda

,

IEEE Trans. Pattern Anal. Mach. Intell., 2024

A multimodal dynamical variational autoencoder for audiovisual speech representation learning.

[DOI]

,

,

,

Xavier Alameda-Pineda

,

Renaud Séguier

Neural Networks, 2024

A Weighted-Variance Variational Autoencoder Model for Speech Enhancement.

[DOI]

,

Mostafa Sadeghi

,

Xavier Alameda-Pineda

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Lost and Found: Overcoming Detector Failures in Online Multi-object Tracking.

[DOI]

Lorenzo Vaquero

,

,

Xavier Alameda-Pineda

,

Víctor M. Brea

,

Manuel Mucientes

Proceedings of the Computer Vision - ECCV 2024, 2024

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space.

[DOI]

Guénolé Fiche

,

,

Xavier Alameda-Pineda

,

,

Francesc Moreno-Noguer

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Variational meta reinforcement learning for social robotics.

[DOI]

,

Xavier Alameda-Pineda

,

Appl. Intell., November, 2023

TransCenter: Transformers With Dense Representations for Multiple-Object Tracking.

[DOI]

,

,

Guillaume Delorme

,

,

,

Xavier Alameda-Pineda

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Expression-Preserving Face Frontalization Improves Visually Assisted Speech Processing.

[DOI]

,

Mostafa Sadeghi

,

,

Xavier Alameda-Pineda

Int. J. Comput. Vis., May, 2023

Learning and controlling the source-filter representation of speech with a variational autoencoder.

[DOI]

,

,

,

Xavier Alameda-Pineda

,

Renaud Séguier

Speech Commun., March, 2023

Continual Attentive Fusion for Incremental Learning in Semantic Segmentation.

[DOI]

,

,

,

,

,

,

Xavier Alameda-Pineda

,

IEEE Trans. Multim., 2023

Successor Feature Representations.

[DOI]

,

Xavier Alameda-Pineda

Trans. Mach. Learn. Res., 2023

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation.

[DOI]

,

,

Xavier Alameda-Pineda

Trans. Mach. Learn. Res., 2023

SocialInteractionGAN: Multi-Person Interaction Sequence Generation.

[DOI]

,

Dominique Vaufreydaz

,

Xavier Alameda-Pineda

IEEE Trans. Affect. Comput., 2023

Uncertainty-Aware Contrastive Distillation for Incremental Semantic Segmentation.

[DOI]

,

,

,

,

,

,

Xavier Alameda-Pineda

,

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Univariate Radial Basis Function Layers: Brain-inspired Deep Neural Layers for Low-Dimensional Inputs.

[DOI]

Basavasagar Patil

,

Xavier Alameda-Pineda

,

CoRR, 2023

A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation.

[DOI]

,

Dominique Vaufreydaz

,

Xavier Alameda-Pineda

CoRR, 2023

Back to MLP: A Simple Baseline for Human Motion Prediction.

[DOI]

,

,

,

Vincent Lepetit

,

Xavier Alameda-Pineda

,

Francesc Moreno-Noguer

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Motion-DVAE: Unsupervised learning for fast human motion denoising.

[DOI]

Guénolé Fiche

,

,

Xavier Alameda-Pineda

,

Renaud Séguier

Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games, 2023

Unsupervised speech enhancement with deep dynamical generative speech and noise models.

[DOI]

,

,

,

Xavier Alameda-Pineda

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers.

[DOI]

,

Massimiliano Mancini

,

Karteek Alahari

,

Xavier Alameda-Pineda

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Speech Modeling with a Hierarchical Transformer Dynamical VAE.

[DOI]

,

,

,

,

Xavier Alameda-Pineda

Proceedings of the IEEE International Conference on Acoustics, 2023

Semi-supervised learning made simple with self-supervised clustering.

[DOI]

,

,

Karteek Alahari

,

Xavier Alameda-Pineda

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Variational Inference and Learning of Piecewise Linear Dynamical Systems.

[DOI]

Xavier Alameda-Pineda

,

Vincent Drouard

,

Radu Patrice Horaud

IEEE Trans. Neural Networks Learn. Syst., 2022

Unsupervised Speech Enhancement Using Dynamical Variational Autoencoders.

[DOI]

,

,

Xavier Alameda-Pineda

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Probabilistic Graph Attention Network With Conditional Kernels for Pixel-Wise Prediction.

[DOI]

,

Xavier Alameda-Pineda

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Weighted variance variational autoencoder for speech enhancement.

[DOI]

,

Mostafa Sadeghi

,

Xavier Alameda-Pineda

,

CoRR, 2022

Robust Audio-Visual Instance Discrimination via Active Contrastive Set Mining.

[DOI]

,

,

,

,

,

,

Xavier Alameda-Pineda

CoRR, 2022

HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE.

[DOI]

,

,

,

,

Francesc Moreno-Noguer

,

Xavier Alameda-Pineda

CoRR, 2022

Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder.

[DOI]

,

,

Xavier Alameda-Pineda

CoRR, 2022

M4MM '22: 1st International Workshop on Methodologies for Multimedia.

[DOI]

Xavier Alameda-Pineda

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Active Contrastive Set Mining for Robust Audio-Visual Instance Discrimination.

[DOI]

,

,

,

,

,

,

Xavier Alameda-Pineda

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

The Impact of Removing Head Movements on Audio-Visual Speech Enhancement.

[DOI]

,

Mostafa Sadeghi

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos.

[DOI]

,

,

,

,

Xavier Alameda-Pineda

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multi-Person Extreme Motion Prediction.

[DOI]

,

,

Xavier Alameda-Pineda

,

Francesc Moreno-Noguer

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-Supervised Models are Continual Learners.

[DOI]

,

Victor G. Turrisi da Costa

,

Xavier Alameda-Pineda

,

,

Karteek Alahari

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

ExPI Dataset.

[DOI]

,

,

Francesc Moreno-Noguer

,

Xavier Alameda-Pineda

Dataset, October, 2021

ExPI Dataset.

[DOI]

,

,

Francesc Moreno-Noguer

,

Xavier Alameda-Pineda

Dataset, October, 2021

Mixture of Inference Networks for VAE-Based Audio-Visual Speech Enhancement.

[DOI]

Mostafa Sadeghi

,

Xavier Alameda-Pineda

IEEE Trans. Signal Process., 2021

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers.

[DOI]

,

Xavier Alameda-Pineda

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Dynamical Variational Autoencoders: A Comprehensive Review.

[DOI]

,

,

,

,

,

Xavier Alameda-Pineda

Found. Trends Mach. Learn., 2021

Successor Feature Neural Episodic Control.

[DOI]

,

Xavier Alameda-Pineda

,

CoRR, 2021

Xi-Learning: Successor Feature Transfer Learning for General Reward Functions.

[DOI]

,

Xavier Alameda-Pineda

CoRR, 2021

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders.

[DOI]

,

,

Xavier Alameda-Pineda

,

CoRR, 2021

Multi-Person Extreme Motion Prediction with Cross-Interaction Attention.

[DOI]

,

,

Xavier Alameda-Pineda

,

Francesc Moreno-Noguer

CoRR, 2021

TransCenter: Transformers with Dense Queries for Multiple-Object Tracking.

[DOI]

,

,

Guillaume Delorme

,

,

,

Xavier Alameda-Pineda

CoRR, 2021

Variational Structured Attention Networks for Deep Visual Representation Learning.

[DOI]

,

,

Xavier Alameda-Pineda

,

,

,

CoRR, 2021

PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation.

[DOI]

,

,

Francesc Moreno-Noguer

,

Xavier Alameda-Pineda

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Deep Variational Generative Models for Audio-Visual Speech Separation.

[DOI]

Viet-Nhat Nguyen

,

Mostafa Sadeghi

,

,

Xavier Alameda-Pineda

Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling.

[DOI]

,

,

,

,

Xavier Alameda-Pineda

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Switching Variational Auto-Encoders for Noise-Agnostic Audio-Visual Speech Enhancement.

[DOI]

Mostafa Sadeghi

,

Xavier Alameda-Pineda

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Learning How to Smile: Expression Video Generation With Conditional Adversarial Recurrent Nets.

[DOI]

,

Xavier Alameda-Pineda

,

,

,

IEEE Trans. Multim., 2020

Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders.

[DOI]

Mostafa Sadeghi

,

,

Xavier Alameda-Pineda

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Comprehensive Analysis of Deep Regression.

[DOI]

Stéphane Lathuilière

,

,

Xavier Alameda-Pineda

,

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Special Issue on Generating Realistic Visual Data of Human Behavior.

[DOI]

Xavier Alameda-Pineda

,

,

Albert Ali Salah

,

,

Int. J. Comput. Vis., 2020

GMM-UNIT: Unsupervised Multi-Domain and Multi-Modal Image-to-Image Translation via Attribute Gaussian Mixture Modeling.

[DOI]

,

,

,

,

,

Xavier Alameda-Pineda

CoRR, 2020

Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach.

[DOI]

,

,

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

FATE/MM 20: 2nd International Workshop on Fairness, Accountability, Transparency and Ethics in MultiMedia.

[DOI]

Xavier Alameda-Pineda

,

,

Jahna Otterbacher

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentification.

[DOI]

Guillaume Delorme

,

,

Stéphane Lathuilière

,

,

Xavier Alameda-Pineda

Proceedings of the 25th International Conference on Pattern Recognition, 2020

ODANet: Online Deep Appearance Network for Identity-Consistent Multi-person Tracking.

[DOI]

Guillaume Delorme

,

,

Guillaume Sarrazin

,

Xavier Alameda-Pineda

Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Robust Unsupervised Audio-Visual Speech Enhancement Using a Mixture of Variational Autoencoders.

[DOI]

Mostafa Sadeghi

,

Xavier Alameda-Pineda

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Recurrent Variational Autoencoder for Speech Enhancement.

[DOI]

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

How to Train Your Deep Multi-Object Tracker.

[DOI]

,

,

,

,

Laura Leal-Taixé

,

Xavier Alameda-Pineda

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Towards Probabilistic Generative Models for Socially Intelligent Robots.

[DOI]

Xavier Alameda-Pineda

, 2020

2019

Increasing Image Memorability with Neural Style Transfer.

[DOI]

Aliaksandr Siarohin

,

,

Cveta Majtanovic

,

Xavier Alameda-Pineda

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2019

Special Section on Multimodal Understanding of Social, Affective, and Subjective Attributes.

[DOI]

Xavier Alameda-Pineda

,

,

Mohammad Soleymani

,

,

,

Samuel D. Gosling

ACM Trans. Multim. Comput. Commun. Appl., 2019

Tracking Multiple Audio Sources With the von Mises Distribution and Variational EM.

[DOI]

,

Xavier Alameda-Pineda

,

Christine Evers

,

IEEE Signal Process. Lett., 2019

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments.

[DOI]

,

,

,

Xavier Alameda-Pineda

,

IEEE J. Sel. Top. Signal Process., 2019

Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoder.

[DOI]

Mostafa Sadeghi

,

,

Xavier Alameda-Pineda

,

,

CoRR, 2019

DeepMOT: A Differentiable Framework for Training Multiple Object Trackers.

[DOI]

,

,

Xavier Alameda-Pineda

,

CoRR, 2019

Camera Adversarial Transfer for Unsupervised Person Re-Identification.

[DOI]

Guillaume Delorme

,

Xavier Alameda-Pineda

,

Stéphane Lathuilière

,

CoRR, 2019

FAT/MM'19: 1st International Workshop on Fairness, Accountability, and Transparency in MultiMedia.

[DOI]

Xavier Alameda-Pineda

,

,

,

,

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots.

[DOI]

Xavier Alameda-Pineda

,

,

,

Guillaume Delorme

,

,

,

,

Bastien Mourgue

,

Guillaume Sarrazin

Proceedings of the 27th ACM International Conference on Multimedia, 2019

The Predicting Media Memorability Task at MediaEval 2019.

[DOI]

Mihai Gabriel Constantin

,

,

Claire-Hélène Demarty

,

Ngoc Q. K. Duong

,

Xavier Alameda-Pineda

,

Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019

2018

Cross-Paced Representation Learning With Partial Curricula for Sketch-Based Image Retrieval.

[DOI]

,

Xavier Alameda-Pineda

,

,

,

IEEE Trans. Image Process., 2018

A cascaded multiple-speaker localization and tracking system.

[DOI]

,

,

,

Xavier Alameda-Pineda

,

CoRR, 2018

Every Smile is Unique: Landmark-Guided Diverse Smile Generation.

[DOI]

,

Xavier Alameda-Pineda

,

,

,

CoRR, 2018

EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked Emotions.

[DOI]

Xavier Alameda-Pineda

,

,

,

,

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking.

[DOI]

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model.

[DOI]

Stéphane Lathuilière

,

,

Xavier Alameda-Pineda

,

Proceedings of the Computer Vision - ECCV 2018, 2018

Every Smile Is Unique: Landmark-Guided Diverse Smile Generation.

[DOI]

,

Xavier Alameda-Pineda

,

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multimodal analysis of free-standing conversational groups.

[DOI]

Xavier Alameda-Pineda

,

,

Proceedings of the Frontiers of Multimedia Research, 2018

2017

Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping.

[DOI]

,

,

Xavier Alameda-Pineda

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract.

[DOI]

,

,

,

Xavier Alameda-Pineda

,

Speech Commun., 2017

Exploiting the intermittency of speech for joint separation and diarization.

[DOI]

Dionyssos Kounades-Bastian

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction.

[DOI]

,

,

Xavier Alameda-Pineda

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

MUSA2: First ACM Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes.

[DOI]

Xavier Alameda-Pineda

,

,

Mohammad Soleymani

,

,

,

Samuel D. Gosling

Proceedings of the 2017 ACM on Multimedia Conference, 2017

How to Make an Image More Memorable?: A Deep Style Transfer Approach.

[DOI]

Aliaksandr Siarohin

,

,

Cveta Majtanovic

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Tracking a varying number of people with a visually-controlled robotic head.

[DOI]

,

Xavier Alameda-Pineda

,

,

,

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Exploiting the Complementarity of Audio and Visual Data in Multi-speaker Tracking.

[DOI]

,

,

Xavier Alameda-Pineda

,

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

An EM algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures.

[DOI]

Dionyssos Kounades-Bastian

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework.

[DOI]

,

,

Xavier Alameda-Pineda

Proceedings of the Latent Variable Analysis and Signal Separation, 2017

Viraliency: Pooling Local Virality.

[DOI]

Xavier Alameda-Pineda

,

,

,

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

SALSA: A Multimodal Dataset for the Automated Analysis of Free-Standing Social Interactions.

[DOI]

Xavier Alameda-Pineda

,

Ramanathan Subramanian

,

,

,

Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016

A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures.

[DOI]

Dionyssos Kounades-Bastian

,

,

Xavier Alameda-Pineda

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2016

EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis.

[DOI]

Israel Dejene Gebru

,

Xavier Alameda-Pineda

,

Florence Forbes

,

IEEE Trans. Pattern Anal. Mach. Intell., 2016

SALSA: A Novel Dataset for Multimodal Group Behavior Analysis.

[DOI]

Xavier Alameda-Pineda

,

,

Ramanathan Subramanian

,

Ligia Maria Batrinca

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2016

An on-line variational Bayesian model for multi-person tracking from cluttered scenes.

[DOI]

,

Xavier Alameda-Pineda

,

Alessio Xompero

,

Comput. Vis. Image Underst., 2016

Academic Coupled Dictionary Learning for Sketch-based Image Retrieval.

[DOI]

,

Xavier Alameda-Pineda

,

,

,

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Emerging Topics in Learning from Noisy and Missing Data.

[DOI]

Xavier Alameda-Pineda

,

Timothy M. Hospedales

,

,

,

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Multi-Paced Dictionary Learning for cross-domain retrieval and recognition.

[DOI]

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

An inverse-gamma source variance prior with factorized parameterization for audio source separation.

[DOI]

Dionyssos Kounades-Bastian

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Tracking Multiple Persons Based on a Variational Bayesian Model.

[DOI]

,

,

Xavier Alameda-Pineda

,

Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions.

[DOI]

Sergey Tulyakov

,

Xavier Alameda-Pineda

,

,

,

Jeffrey F. Cohn

,

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recognizing Emotions from Abstract Paintings Using Non-Linear Matrix Completion.

[DOI]

Xavier Alameda-Pineda

,

,

,

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Projective Unsupervised Flexible Embedding with Optimal Graph.

[DOI]

,

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the British Machine Vision Conference 2016, 2016

2015

Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression.

[DOI]

,

,

Xavier Alameda-Pineda

,

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Vision-guided robot hearing.

[DOI]

Xavier Alameda-Pineda

,

Int. J. Robotics Res., 2015

A variational EM algorithm for the separation of moving sound sources.

[DOI]

Dionyssos Kounades-Bastian

,

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Analyzing Free-standing Conversational Groups: A Multimodal Approach.

[DOI]

Xavier Alameda-Pineda

,

,

,

,

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2014

A Geometric Approach to Sound Source Localization from Time-Delay Estimates.

[DOI]

Xavier Alameda-Pineda

,

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Audio-visual speaker localization via weighted clustering.

[DOI]

Israel D. Gebru

,

Xavier Alameda-Pineda

,

,

Florence Forbes

Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014

Sound representation and classification benchmark for domestic robots.

[DOI]

,

Xavier Alameda-Pineda

,

,

Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

2013

Egocentric Audio-Visual Scene Analysis. A Machine Learning and Signal Processing Approach. (Analyse Égocentrique de Scènes Audio-Visuelles. Une approche par Apprentissage Automatique et Traitement du Signal).

[DOI]

Xavier Alameda-Pineda

PhD thesis, 2013

RAVEL: an annotated corpus for training robots with audiovisual abilities.

[DOI]

Xavier Alameda-Pineda

,

Jordi Sanchez-Riera

,

Johannes Wienke

,

,

,

Kaustubh Kulkarni

,

Antoine Deleforge

,

J. Multimodal User Interfaces, 2013

The geometry of sound-source localization using non-coplanar microphone arrays.

[DOI]

Xavier Alameda-Pineda

,

,

Bernard Mourrain

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Benchmarking methods for audio-visual recognition using tiny training sets.

[DOI]

Xavier Alameda-Pineda

,

Jordi Sanchez-Riera

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Audio-visual robot command recognition: D-META'12 grand challenge.

[DOI]

Jordi Sanchez-Riera

,

Xavier Alameda-Pineda

,

Proceedings of the International Conference on Multimodal Interaction, 2012

Online multimodal speaker detection for humanoid robots.

[DOI]

Jordi Sanchez-Riera

,

Xavier Alameda-Pineda

,

Johannes Wienke

,

Antoine Deleforge

,

,

,

Sebastian Wrede

,

Proceedings of the 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), Osaka, Japan, November 29, 2012

Sound-event recognition with a companion humanoid.

[DOI]

,

Xavier Alameda-Pineda

,

,

Proceedings of the 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), Osaka, Japan, November 29, 2012

Geometrically-constrained robust time delay estimation using non-coplanar microphone arrays.

[DOI]

Xavier Alameda-Pineda

,

Proceedings of the 20th European Signal Processing Conference, 2012

2011

Finding audio-visual events in informal social gatherings.

[DOI]

Xavier Alameda-Pineda

,

,

,

Florence Forbes

Proceedings of the 13th International Conference on Multimodal Interfaces, 2011

2008

Image compression with Generalized Lifting and partial knowledge of the signal pdf.

[DOI]

Julio C. Rolón

,

Philippe Salembier

,

Xavier Alameda-Pineda

Proceedings of the International Conference on Image Processing, 2008

Loading...