Francesc Alías

Orcid: 0000-0002-1921-2375

According to our database1, Francesc Alías authored at least 73 papers between 2001 and 2022.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Conversión de texto en habla multidominio basada en selección de unidades con ajuste subjetivo de pesos y marcado robusto de pitch
PhD thesis, 2022

2021
GENIOVOX Project: Computational generation of expressive voice.
Proceedings of the Fifth International Conference, 2021

Contribution of vocal tract and glottal source spectral cues in the generation of happy and aggressive [a] vowels.
Proceedings of the Fifth International Conference, 2021

2020
WASN-Based Day-Night Characterization of Urban Anomalous Noise Events in Narrow and Wide Streets.
Sensors, 2020

Aggregate Impact of Anomalous Noise Events on the WASN-Based Computation of Road Traffic Noise Levels in Urban and Suburban Environments.
Sensors, 2020

Parallel hierarchical architectures for efficient consensus clustering on big multimedia cluster ensembles.
Inf. Sci., 2020

2019
A WASN-Based Suburban Dataset for Anomalous Noise Event Detection on Dynamic Road-Traffic Noise Mapping.
Sensors, 2019

Review of Wireless Acoustic Sensor Networks for Environmental Noise Monitoring in Smart Cities.
J. Sensors, 2019

2018
Detection of Anomalous Noise Events on Low-Capacity Acoustic Nodes for Dynamic Road Traffic Noise Mapping within an Hybrid WASN.
Sensors, 2018

Influence of tense, modal and lax phonation on the three-dimensional finite element synthesis of vowel [A].
Proceedings of the Fourth International Conference, 2018

2017
The role of prosody and voice quality in indirect storytelling speech: A cross-narrator perspective in four European languages.
Speech Commun., 2017

An Anomalous Noise Events Detector for Dynamic Road Traffic Noise Mapping in Real-Life Urban and Suburban Environments.
Sensors, 2017

Design of a Mobile Low-Cost Sensor Network Using Urban Buses for Real-Time Ubiquitous Noise Monitoring.
Sensors, 2017

An FPGA-Based WASN for Remote Real-Time Monitoring of Endangered Species: A Case Study on the Birdsong Recognition of <i>Botaurus stellaris</i>.
Sensors, 2017

homeSound: Real-Time Audio Event Detection Based on High Performance Computing for Behaviour and Surveillance Remote Monitoring.
Sensors, 2017

Remote Acoustic Monitoring System for Noise Sensing.
Proceedings of the Online Engineering & Internet of Things, 2017

2016
The role of prosody and voice quality in indirect storytelling speech: Annotation methodology and expressive categories.
Speech Commun., 2016

Adding Singing Capabilities to Unit Selection TTS Through HNM-Based Conversion.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

2015
Look, listen and find: A purely audiovisual approach to online videos geotagging.
Inf. Sci., 2015

The role of prosody and voice quality in text-dependent categories of storytelling across languages.
Proceedings of the INTERSPEECH 2015, 2015

2014
Gesture synthesis adapted to speech emphasis.
Speech Commun., 2014

A one-shot domain-independent robust multimedia clustering methodology based on hybrid multimodal fusion.
Multim. Tools Appl., 2014

2013
Sentence-Based Sentiment Analysis for Expressive Text-to-Speech.
IEEE Trans. Speech Audio Process., 2013

Prosodic analysis of storytelling discourse modes and narrative situations oriented to text-to-speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Unified numerical simulation of the physics of voice. the EUNISON project.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Two-step detection of water sound events for the diagnostic and monitoring of dementia.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

2012
Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification.
IEEE Trans. Multim., 2012

Positional and confidence voting-based consensus functions for fuzzy cluster ensembles.
Fuzzy Sets Syst., 2012

Classification of audio scenes using Narrow-Band Autocorrelation features.
Proceedings of the 20th European Signal Processing Conference, 2012

Gammatone Wavelet features for sound classification in surveillance applications.
Proceedings of the 20th European Signal Processing Conference, 2012

Audio and video cues for geo-tagging online videos in the absence of metadata.
Proceedings of the 10th International Workshop on Content-Based Multimedia Indexing, 2012

2011
Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept.
Speech Commun., 2011

2010
Reliable Pitch Marking of Affective Speech at Peaks or Valleys Using Restricted Dynamic Programming.
IEEE Trans. Multim., 2010

Evolutionary process indicators for active IGAs applied to weight tuning in unit selection TTS synthesis.
Proceedings of the IEEE Congress on Evolutionary Computation, 2010

2009
Emulating Subjective Criteria in Corpus Validation.
Proceedings of the Encyclopedia of Artificial Intelligence (3 Volumes), 2009

GTM User Modeling for aIGA Weight Tuning in TTS Synthesis.
Proceedings of the Encyclopedia of Artificial Intelligence (3 Volumes), 2009

Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification.
Speech Commun., 2009

Combinación de clusterizadores difusos mediante voto posicional para clustering robusto de documentos.
Proces. del Leng. Natural, 2009

Sentiment classification in English from sentence-level annotations of emotions regarding models of affect.
Proceedings of the INTERSPEECH 2009, 2009

2008
Towards High-Quality Next-Generation Text-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification.
IEEE Trans. Speech Audio Process., 2008

Predicción estadística de las discontinuidades espectrales del habla para síntesis concatenativa.
Proces. del Leng. Natural, 2008

Identificación de emociones a partir de texto usando desambiguación semántica.
Proces. del Leng. Natural, 2008

2007
BordaConsensus: a new consensus function for soft cluster ensembles.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Objective and Subjective Evaluation of an Expressive Speech Corpus.
Proceedings of the Advances in Nonlinear Speech Processing, 2007

Mixing HMM-Based Spanish Speech Synthesis with a CBR for Prosody Estimation.
Proceedings of the Advances in Nonlinear Speech Processing, 2007

Validation of an Expressive Speech Corpus by Mapping Automatic Classification to Subjective Evaluation.
Proceedings of the Computational and Ambient Intelligence, 2007

Assessing Students' Teamwork Performance by Means of Fuzzy Logic.
Proceedings of the Computational and Ambient Intelligence, 2007

Extracting User Preferences by GTM for aiGA Weight Tuning in Unit Selection Text-to-Speech Synthesis.
Proceedings of the Computational and Ambient Intelligence, 2007

Prosody Modelling of Spanish for Expressive Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2007

Text Clustering on Latent Thematic Spaces: Variants, Strengths and Weaknesses.
Proceedings of the Independent Component Analysis and Signal Separation, 2007

A Hierarchical Consensus Architecture for Robust Document Clustering.
Proceedings of the Advances in Information Retrieval, 2007

2006
Robust Document Clustering by Exploiting Feature Diversity in Cluster Ensembles.
Proces. del Leng. Natural, 2006

Transcripción fonética de acrónimos en castellano utilizando el algoritmo C4.5.
Proces. del Leng. Natural, 2006

Técnicas de representación de textos para clasificación no supervisada de documentos.
Proces. del Leng. Natural, 2006

Clasificación de textos adaptada para Conversión de Texto en Habla Multidominio.
Proces. del Leng. Natural, 2006

Feature diversity in cluster ensembles for robust document clustering.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Multi-domain text-to-speech synthesis by automatic text classification.
Proceedings of the INTERSPEECH 2006, 2006

A pitch marks filtering algorithm based on restricted dynamic programming.
Proceedings of the INTERSPEECH 2006, 2006

Efficient Interactive Weight Tuning For Tts Synthesis: Reducing User Fatigue By Improving User Consistency.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Analyzing active interactive genetic algorithms using visual analytics.
Proceedings of the Genetic and Evolutionary Computation Conference, 2006

2005
High quality Spanish restricted-domain TTS oriented to a weather forecast application.
Proceedings of the INTERSPEECH 2005, 2005

2004
Perception-guided and phonetic clustering weight tuning based on diphone pairs for unit selection TTS.
Proceedings of the INTERSPEECH 2004, 2004

ICA-based hierarchical text classification for multi-domain text-to-speech synthesis.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Reliability in ICA-Based Text Classification.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Modeling and Synthesizing Emotional Speech for Catalan Text-to-Speech Synthesis.
Proceedings of the Affective Dialogue Systems, Tutorial and Research Workshop, 2004

2003
Arquitectura para conversión texto-habla multidominio.
Proces. del Leng. Natural, 2003

Ajuste subjetivo de pesos para selección de unidades a través de algoritmos genéticos interactivos.
Proces. del Leng. Natural, 2003

A hybrid method oriented to concatenative text-to-speech synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Text to visual synthesis with appearance models.
Proceedings of the 2003 International Conference on Image Processing, 2003

2002
Un modelo híbrido orientado a la síntesis multimodal del habla.
Proces. del Leng. Natural, 2002

Previs: a person-specific realistic virtual speaker.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

2001
Asignación automática de marcas de pitch basada en programación dinámica.
Proces. del Leng. Natural, 2001


  Loading...