Doroteo T. Toledano

Orcid: 0000-0003-1159-6455

Affiliations:
  • Autonomous University of Madrid, AUDIAS, Spain


According to our database1, Doroteo T. Toledano authored at least 76 papers between 1997 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Whisper-based spoken term detection systems for search on speech ALBAYZIN evaluation challenge.
EURASIP J. Audio Speech Music. Process., December, 2024

Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices.
CoRR, 2024

2022
Source Separation for Sound Event Detection in Domestic Environments using Jointly Trained Models.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

2021
BiosecurID: a multimodal biometric database.
CoRR, 2021

A Multi-Resolution CRNN-Based Approach for Semi-Supervised Sound Event Detection in DCASE 2020 Challenge.
IEEE Access, 2021

A study of data augmentation for increased ASR robustness against packet losses.
Proceedings of the Fifth International Conference, 2021

An analysis of Sound Event Detection under acoustic degradation using multi-resolution systems.
Proceedings of the Fifth International Conference, 2021

Query-by-Example Spoken Term Detection using Attentive Pooling Networks at ALBAYZIN 2020 Evaluation: The AUDIAS-UAM System.
Proceedings of the Fifth International Conference, 2021

Multiple Feature Resolutions for Different Polyphonic Sound Detection Score Scenarios in DCASE 2021 Task 4.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
A Multi-Resolution Approach to Sound Event Detection in DCASE 2020 Task4.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Estudio sobre documentos reutilizables como recursos lingüísticos en el marco del desarrollo del Plan de Impulso de las Tecnologías del Lenguaje.
Proces. del Leng. Natural, 2019

Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation.
EURASIP J. Audio Speech Music. Process., 2019

ALBAYZIN 2018 spoken term detection evaluation: a multi-domain international evaluation in Spanish.
EURASIP J. Audio Speech Music. Process., 2019

Exploring convolutional, recurrent, and hybrid deep neural networks for speech and music detection in a large audio dataset.
EURASIP J. Audio Speech Music. Process., 2019

2018
ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation.
EURASIP J. Audio Speech Music. Process., 2018

DNN-based Embeddings for Speaker Diarization in the AuDIaS-UAM System for the Albayzin 2018 IberSPEECH-RTVE Evaluation.
Proceedings of the Fourth International Conference, 2018

Audio event detection on Google's Audio Set database: Preliminary results using different types of DNNs.
Proceedings of the Fourth International Conference, 2018

AUDIAS-CEU: A Language-independent approach for the Query-by-Example Spoken Term Detection task of the Search on Speech ALBAYZIN 2018 evaluation.
Proceedings of the Fourth International Conference, 2018

2017
ALBAYZIN 2016 spoken term detection evaluation: an international open competitive evaluation in Spanish.
EURASIP J. Audio Speech Music. Process., 2017

2016
Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations.
EURASIP J. Audio Speech Music. Process., 2016

Detection of Publicity Mentions in Broadcast Radio: Preliminary Results.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

2015
Speech Analysis.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Publisher's Erratum to: Voice Device.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Voice Device.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Speaker Features.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion.
EURASIP J. Audio Speech Music. Process., 2015

Speech Signal and Facial Image Processing for Obstructive Sleep Apnea Assessment.
Comput. Math. Methods Medicine, 2015

An end-to-end approach to language identification in short utterances using convolutional neural networks.
Proceedings of the INTERSPEECH 2015, 2015

2014
Feature analysis for discriminative confidence estimation in spoken term detection.
Comput. Speech Lang., 2014

Analysis of voice features related to obstructive sleep apnoea and their application in diagnosis support.
Comput. Speech Lang., 2014

ATVS-CSLT-HCTLab System for NIST 2013 Open Keyword Search Evaluation.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

2013
Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion.
EURASIP J. Audio Speech Music. Process., 2013

2012
Mejorando el acceso, el análisis y la visibilidad de la Información y los contenidos Multilingues y Multimedia en Red para la Comunidad de Madrid.
Proces. del Leng. Natural, 2012

Preliminary Results of Alignment of Text and Audio in News and Songs.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Using HMM to Detect Speakers with Severe Obstructive Sleep Apnoea Syndrome.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

2011
Analyzing Training Dependencies and Posterior Fusion in Discriminant Classification of Apnea Patients Based on Sustained and Connected Speech.
Proceedings of the INTERSPEECH 2011, 2011

2010
BiosecurID: a multimodal biometric database.
Pattern Anal. Appl., 2010

Multilevel and Session Variability Compensated Language Recognition: ATVS-UAM Systems at NIST LRE 2009.
IEEE J. Sel. Top. Signal Process., 2010

A Study of the Influence of Speech Type on Automatic Language Recognition Performance.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Augmented set of features for confidence estimation in spoken term detection.
Proceedings of the INTERSPEECH 2010, 2010

Phone-Conditioned Suboptimal Wiener Filtering.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
Speech Analysis.
Proceedings of the Encyclopedia of Biometrics, 2009

Voice Device.
Proceedings of the Encyclopedia of Biometrics, 2009

Speaker Features.
Proceedings of the Encyclopedia of Biometrics, 2009

Feature Compensation Techniques for ASR on Band-Limited Speech.
IEEE Trans. Speech Audio Process., 2009

Assessment of Severe Apnoea through Voice Analysis, Automatic Speech, and Speaker Recognition Techniques.
EURASIP J. Adv. Signal Process., 2009

Severe Apnoea Detection using Speaker Recognition Techniques.
Proceedings of the BIOSIGNALS 2009, 2009

2008
Herramientas de anotación de corpus de habla espontánea del Laboratorio de Lingística Informática de la UAM.
Proces. del Leng. Natural, 2008

Phoneme and sub-phoneme t-normalization for text-dependent speaker recognition.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

rre STC-TIMIT: Generation of a Single-channel Telephone Corpus.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

MAP and sub-word level t-norm for text-dependent speaker recognition.
Proceedings of the INTERSPEECH 2008, 2008

Anchor-model fusion for language recognition.
Proceedings of the INTERSPEECH 2008, 2008

2007
Emulating DNA: Rigorous Quantification of Evidential Weight in Transparent and Testable Forensic Speaker Recognition.
IEEE Trans. Speech Audio Process., 2007

Blind Feature Compensation for Time-Variant Band-Limited Speech Recognition.
IEEE Signal Process. Lett., 2007

Biosec baseline corpus: A multimodal biometric database.
Pattern Recognit., 2007

Beyond objective performance evaluation in multimodal biometric systems.
Ann. des Télécommunications, 2007

Multivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features.
Proceedings of the INTERSPEECH 2007, 2007

2006
Initialization, training, and context-dependency in HMM-based formant tracking.
IEEE Trans. Speech Audio Process., 2006

Usability evaluation of multi-modal biometric verification systems.
Interact. Comput., 2006

Exploring PPRLM performance for NIST 2005 Language Recognition Evaluation.
Proceedings of the Odyssey 2006, 2006

Using Data-driven and Phonetic Units for Speaker Verification.
Proceedings of the Odyssey 2006, 2006

Unsupervised Class-Based Feature Compensation for Time-Variable Bandwidth-Limited Speech.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
On the relationship between phonetic modeling precision and phonetic speaker recognition accuracy.
Proceedings of the INTERSPEECH 2005, 2005

Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR.
Proceedings of the INTERSPEECH 2005, 2005

MFCC Compensation for Improved Recognition of Filtered and Band-Limited Speech.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Acoustic-phonetic decoding of different types of spontaneous speech in Spanish.
Proceedings of the ISCA Tutorial and Research Workshop (ITRW) on Disfluency in Spontaneous Speech, 2005

2003
Automatic phonetic segmentation.
IEEE Trans. Speech Audio Process., 2003

2002
HMMs for Automatic Phonetic Segmentation.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

2001
Local refinement of phonetic boundaries: a general framework and its application using different transition models.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Neural network boundary refining for automatic speech segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

1997
Automatic alternative transcription generation and vocabulary selection for flexible word recognizers.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997


  Loading...