We stand with Ukraine

We stand with Ukraine

Jordi Luque

Orcid: 0000-0002-4507-4930

According to our database¹, Jordi Luque authored at least 60 papers between 2006 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

"OK Aura, Be Fair With Me": Demographics-Agnostic Training for Bias Mitigation in Wake-up Word Detection.

[DOI]

Fernando López

,

Paula Delgado-Santos

,

,

,

CoRR, April, 2026

Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition.

[DOI]

,

,

Carlos Escolano

CoRR, March, 2026

2025

Robustness assessment of large audio language models in multiple-choice evaluation.

[DOI]

Fernando López

,

Santosh Kesiraju

,

CoRR, October, 2025

The Eloquence team submission for task 1 of MLC-SLM challenge.

[DOI]

Lorenzo Concina

,

,

,

Marco Matassoni

,

CoRR, July, 2025

2024

Word Sense Disambiguation in Native Spanish: A Comprehensive Lexical Evaluation Resource.

[DOI]

,

,

,

,

Richard Benjamins

CoRR, 2024

On the Relationship of Social Gender Equality and Grammatical Gender in Pre-Trained Large Language Models.

[DOI]

Magdalena Biesialska

,

,

,

Proceedings of the Poster Proceedings of the 40th Annual Conference of the Spanish Association for Natural Language Processing 2024 (SEPLN-P 2024) co-located with the 40th International Conference of the Spanish Society for Natural Language Processing (SEPLN 2024), 2024

2023

Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles.

[DOI]

Fernando López

,

,

,

CoRR, 2023

Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning.

[DOI]

Gabriele Castellano

,

Juan-José Nieto

,

,

,

,

,

Flavio Esposito

,

,

CoRR, 2023

2022

Data Augmentation for Low-Resource Quechua ASR Improvement.

[DOI]

Rodolfo Zevallos

,

,

Guillermo Cámbara

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

TID Spanish ASR system for the Albayzin 2022 Speech-to-Text Transcription Challenge.

[DOI]

Fernando López

,

Proceedings of the 6th International Conference, 2022

Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation.

[DOI]

Fernando López

,

Proceedings of the 6th International Conference, 2022

BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge.

[DOI]

,

,

Martin Karafiát

,

,

Fernando López

,

,

,

,

,

,

,

Proceedings of the 6th International Conference, 2022

Recycle Your Wav2Vec2 Codebook: A Speech Perceiver for Keyword Spotting.

[DOI]

Guillermo Cámbara

,

,

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Voice Quality and Pitch Features in Transformer-Based Speech Recognition.

[DOI]

Guillermo Cámbara

,

,

CoRR, 2021

Influence of ASR and Language Model on Alzheimer's Disease Detection.

[DOI]

Joan Codina-Filbà

,

Guillermo Cámbara

,

,

CoRR, 2021

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System.

[DOI]

Guillermo Cámbara

,

Alex Peiró Lilja

,

,

CoRR, 2021

Speech Enhancement for Wake-Up-Word detection in Voice Assistants.

[DOI]

,

Guillermo Cámbara

,

Fernando López

,

,

,

CoRR, 2021

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge.

[DOI]

,

Guillermo Cámbara

,

,

,

,

Martin Karafiát

,

,

Proceedings of the Fifth International Conference, 2021

Speech Enhancement for Wake-Up-Word detection in Voice Assistants.

[DOI]

,

Guillermo Cámbara

,

Fernando López

,

,

,

,

Proceedings of the Fifth International Conference, 2021

Efficient Keyword Spotting by Capturing Long-Range Interactions with Temporal Lambda Networks.

[DOI]

,

Santiago Escuder

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Convolutional Speech Recognition with Pitch and Voice Quality Features.

[DOI]

Guillermo Cámbara

,

,

CoRR, 2020

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos.

[DOI]

,

,

,

Xavier Giró-i-Nieto

CoRR, 2020

A unifying framework for modeling acoustic/prosodic entrainment: definition and evaluation on two large corpora.

[DOI]

Ramiro H. Gálvez

,

,

,

Agustín Gravano

Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2020

Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models.

[DOI]

,

,

,

Olga Slizovskaia

,

José F. Núñez

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Detection of Speech Events and Speaker Characteristics through Photo-Plethysmographic Signal Neural Processing.

[DOI]

Guillermo Cámbara

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

How Much Does Audio Matter to Recognize Egocentric Object Interactions?

[DOI]

Alejandro Cartas

,

,

,

,

Mariella Dimiccoli

CoRR, 2019

Seeing and Hearing Egocentric Actions: How Much Can We Learn?

[DOI]

Alejandro Cartas

,

,

,

,

Mariella Dimiccoli

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

A Reality Check on Inference at Mobile Networks Edge.

[DOI]

Alejandro Cartas

,

,

,

Ilias Leontiadis

,

,

Nishanth Sastry

,

José Núñez-Martínez

,

,

Proceedings of the 2nd International Workshop on Edge Systems, Analytics and Networking, 2019

2018

The use of long-term features for GMM- and i-vector-based speaker diarization systems.

[DOI]

Abraham Woubie Zewoudie

,

,

Javier Hernando

EURASIP J. Audio Speech Music. Process., 2018

Chatbol, a Chatbot for the Spanish "La Liga".

[DOI]

,

,

,

Marta R. Costa-jussà

,

Rafael E. Banchs

Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition.

[DOI]

,

,

,

,

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

END-to-END Photopleth YsmographY (PPG) Based Biometric Authentication by Using Convolutional Neural Networks.

[DOI]

,

,

,

Alexandre Maravilla

,

,

Proceedings of the 26th European Signal Processing Conference, 2018

2017

The Role of Linguistic and Prosodic Cues on the Prediction of Self-Reported Satisfaction in Contact Centre Phone Calls.

[DOI]

,

,

Ariadna Sánchez

,

,

Luis Angel Galindo

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Emergence of linguistic laws in human voice.

[DOI]

Iván González Torre

,

,

,

,

Antoni Hernández-Fernández

CoRR, 2016

Short- and Long-Term Speech Features for Hybrid HMM-i-Vector based Speaker Diarization System.

[DOI]

Abraham Woubie Zewoudie

,

,

Javier Hernando

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Improving i-Vector and PLDA Based Speaker Clustering with Long-Term Features.

[DOI]

,

,

Javier Hernando

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls.

[DOI]

,

Daniel Balcells

,

,

,

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

2015

Using voice-quality measurements with prosodic and spectral features for speaker diarization.

[DOI]

,

,

Javier Hernando

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Effect of gender and call duration on customer satisfaction in call center big data.

[DOI]

,

,

,

Zoraida Hidalgo

,

,

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

MASK+: Data-driven regions selection for acoustic fingerprinting.

[DOI]

,

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Speech earthquakes: scaling and universality in human voice.

[DOI]

,

,

CoRR, 2014

Audio-to-text alignment for speech recognition with very limited resources.

[DOI]

,

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Inferring social relationships in a phone call from a single party's speech.

[DOI]

Sree Harsha Yella

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Sentiment retrieval on web reviews using spontaneous natural speech.

[DOI]

José Costa Pereira

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Phoneme-Lattice to Phoneme-Sequence Matching Algorithm Based on Dynamic Programming.

[DOI]

,

,

,

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warping.

[DOI]

Miquel Ferrarons

,

,

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

On the modeling of natural vocal emotion expressions through binary key.

[DOI]

,

Proceedings of the 22nd European Signal Processing Conference, 2014

2013

The Telefonica Research Spoken Web Search System for MediaEval 2013.

[DOI]

,

Miroslav Skácel

,

,

Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

2012

Simultaneous Speech Detection With Spatial Features for Speaker Diarization.

[DOI]

Martin Zelenák

,

,

,

Javier Hernando

IEEE Trans. Speech Audio Process., 2012

On the use of agglomerative and spectral clustering in speaker diarization of meetings.

[DOI]

,

Javier Hernando

Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Consistent association between image features of fetal lungs from different ultrasound equipments and fetal lung maturity from amniocentesis.

[DOI]

Elisenda Bonet-Carne

,

,

,

Mónica Martinez-Terron

,

Alvaro Perez-Moreno

,

,

Eduard Gratacós

,

Ivan Amat-Roldan

Proceedings of the 9th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2012

2011

Parallel Transformation Network features for speaker recognition.

[DOI]

,

,

Isabel Trancoso

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Connectionist Transformation Network Features for Speaker Recognition.

[DOI]

,

Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

2008

Multimodal identification and localization of users in a smart environment.

[DOI]

Albert Ali Salah

,

,

,

,

Javier Hernando

,

,

Ben A. M. Schouten

,

Eric J. Pauwels

J. Multimodal User Interfaces, 2008

Clustering initialization based on spatial information for speaker diarization of meetings.

[DOI]

,

,

Javier Hernando

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Robust Speaker Identification for Meetings: UPC CLEAR'07 Meeting Room Evaluation System.

[DOI]

,

Javier Hernando

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Speaker Diarization for Conference Room: The UPC RT07s Evaluation System.

[DOI]

,

,

,

Javier Hernando

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006

Person Verification by Fusion of Prosodic, Voice Spectral and Facial Parameters.

Javier Hernando

,

,

Pascual Ejarque

,

,

Proceedings of the SECRYPT 2006, 2006

On the fusion of prosody, voice spectrum and face features for multimodal person verification.

[DOI]

,

,

Pascual Ejarque

,

,

Javier Hernando

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Audio, Video and Multimodal Person Identification in a Smart Room.

[DOI]

,

,

,

,

,

,

Ferran Marqués

,

Claudi Martinez

,

Verónica Vilaplana

,

Javier Hernando

Proceedings of the Multimodal Technologies for Perception of Humans, 2006

Loading...