Jordi Luque

Orcid: 0000-0002-4507-4930

According to our database1, Jordi Luque authored at least 52 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles.
CoRR, 2023

Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning.
CoRR, 2023

2022
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation.
CoRR, 2022

Data Augmentation for Low-Resource Quechua ASR Improvement.
Proceedings of the Interspeech 2022, 2022

Recycle Your Wav2Vec2 Codebook: A Speech Perceiver for Keyword Spotting.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Voice Quality and Pitch Features in Transformer-Based Speech Recognition.
CoRR, 2021

Influence of ASR and Language Model on Alzheimer's Disease Detection.
CoRR, 2021

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System.
CoRR, 2021

Speech Enhancement for Wake-Up-Word detection in Voice Assistants.
CoRR, 2021

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge.
Proceedings of the Fifth International Conference, 2021

Speech Enhancement for Wake-Up-Word detection in Voice Assistants.
Proceedings of the Fifth International Conference, 2021

Efficient Keyword Spotting by Capturing Long-Range Interactions with Temporal Lambda Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Convolutional Speech Recognition with Pitch and Voice Quality Features.
CoRR, 2020

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos.
CoRR, 2020

A unifying framework for modeling acoustic/prosodic entrainment: definition and evaluation on two large corpora.
Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2020

Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models.
Proceedings of the 8th International Conference on Learning Representations, 2020

Detection of Speech Events and Speaker Characteristics through Photo-Plethysmographic Signal Neural Processing.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
How Much Does Audio Matter to Recognize Egocentric Object Interactions?
CoRR, 2019

Seeing and Hearing Egocentric Actions: How Much Can We Learn?
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

A Reality Check on Inference at Mobile Networks Edge.
Proceedings of the 2nd International Workshop on Edge Systems, Analytics and Networking, 2019

2018
The use of long-term features for GMM- and i-vector-based speaker diarization systems.
EURASIP J. Audio Speech Music. Process., 2018

Chatbol, a Chatbot for the Spanish "La Liga".
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition.
Proceedings of the Interspeech 2018, 2018

END-to-END Photopleth YsmographY (PPG) Based Biometric Authentication by Using Convolutional Neural Networks.
Proceedings of the 26th European Signal Processing Conference, 2018

2017
The Role of Linguistic and Prosodic Cues on the Prediction of Self-Reported Satisfaction in Contact Centre Phone Calls.
Proceedings of the Interspeech 2017, 2017

2016
Emergence of linguistic laws in human voice.
CoRR, 2016

Short- and Long-Term Speech Features for Hybrid HMM-i-Vector based Speaker Diarization System.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Improving i-Vector and PLDA Based Speaker Clustering with Long-Term Features.
Proceedings of the Interspeech 2016, 2016

Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

2015
Using voice-quality measurements with prosodic and spectral features for speaker diarization.
Proceedings of the INTERSPEECH 2015, 2015

Effect of gender and call duration on customer satisfaction in call center big data.
Proceedings of the INTERSPEECH 2015, 2015

MASK+: Data-driven regions selection for acoustic fingerprinting.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Speech earthquakes: scaling and universality in human voice.
CoRR, 2014

Audio-to-text alignment for speech recognition with very limited resources.
Proceedings of the INTERSPEECH 2014, 2014

Inferring social relationships in a phone call from a single party's speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Sentiment retrieval on web reviews using spontaneous natural speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

Phoneme-Lattice to Phoneme-Sequence Matching Algorithm Based on Dynamic Programming.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Flexible Stand-Alone Keyword Recognition Application Using Dynamic Time Warping.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

On the modeling of natural vocal emotion expressions through binary key.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
The Telefonica Research Spoken Web Search System for MediaEval 2013.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

2012
Simultaneous Speech Detection With Spatial Features for Speaker Diarization.
IEEE Trans. Speech Audio Process., 2012

On the use of agglomerative and spectral clustering in speaker diarization of meetings.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Consistent association between image features of fetal lungs from different ultrasound equipments and fetal lung maturity from amniocentesis.
Proceedings of the 9th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2012

2011
Parallel Transformation Network features for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Connectionist Transformation Network Features for Speaker Recognition.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

2008
Multimodal identification and localization of users in a smart environment.
J. Multimodal User Interfaces, 2008

Clustering initialization based on spatial information for speaker diarization of meetings.
Proceedings of the INTERSPEECH 2008, 2008

2007
Robust Speaker Identification for Meetings: UPC CLEAR'07 Meeting Room Evaluation System.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Speaker Diarization for Conference Room: The UPC RT07s Evaluation System.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Person Verification by Fusion of Prosodic, Voice Spectral and Facial Parameters.
Proceedings of the SECRYPT 2006, 2006

On the fusion of prosody, voice spectrum and face features for multimodal person verification.
Proceedings of the INTERSPEECH 2006, 2006

Audio, Video and Multimodal Person Identification in a Smart Room.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006


  Loading...