Fernando Fernández Martínez

  • Technical University of Madrid (UPM), Information Processing and Telecomunications Center, Spain (PhD 2008)

According to our database1, Fernando Fernández Martínez authored at least 64 papers between 2003 and 2021.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:

On csauthors.net:


Multimodal Emotion Recognition on RAVDESS Dataset Using Transfer Learning.
Sensors, 2021

Time Analysis in Human Activity Recognition.
Neural Process. Lett., 2021

A dynamic term discovery strategy for automatic speech recognizers with evolving dictionaries.
Expert Syst. Appl., 2021

GTH-UPM at DETOXIS-IberLEF 2021: Automatic Detection of Toxic Comments in Social Networks.
Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2021) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2021), 2021

A proposal for emotion recognition using speech features, transfer learning and convolutional neural networks.
Proceedings of the Fifth International Conference, 2021

An approach to intent detection and classification based on attentive recurrent neural networks.
Proceedings of the Fifth International Conference, 2021

GTH-UPM System for Albayzin Multimodal Diarization Challenge 2020.
Proceedings of the Fifth International Conference, 2021

Improving physical activity recognition using a new deep learning architecture and post-processing techniques.
Eng. Appl. Artif. Intell., 2020

Human activity recognition adapted to the type of movement.
Comput. Electr. Eng., 2020

Spotting celebrities among peers in a TV show: how to exploit web querying for weakly supervised visual diarization.
Proceedings of the IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, 2020

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention and LSTM Models.
Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020

Emotion and attention: Audiovisual models for group-level skin response recognition in short movies.
Web Intell., 2019

Project CAVIAR CApturing VIewers' Affective Response.
Proces. del Leng. Natural, 2019

A multi-threshold approach and a realistic error measure for vanishing point detection in natural landscapes.
Eng. Appl. Artif. Intell., 2019

Predicting Group-Level Skin Attention to Short Movies from Audio-Based LSTM-Mixture of Experts Models.
Proceedings of the Interspeech 2019, 2019

Attention-Based Word Vector Prediction with LSTMs and its Application to the OOV Problem in ASR.
Proceedings of the Interspeech 2019, 2019

Exploiting visual saliency for assessing the impact of car commercials upon viewers.
Multim. Tools Appl., 2018

Emotion and attention: predicting electrodermal activity through video visual descriptors.
Proceedings of the International Conference on Web Intelligence, 2017

Comparing visual descriptors and automatic rating strategies for video aesthetics prediction.
Signal Process. Image Commun., 2016

Feature extraction from smartphone inertial signals for human activity segmentation.
Signal Process., 2016

Succeeding metadata based annotation scheme and visual tips for the automatic assessment of video aesthetic quality in car commercials.
Expert Syst. Appl., 2015

Towards a robust affect recognition: Automatic facial expression recognition in 3D faces.
Expert Syst. Appl., 2015

Combining audio-visual features for viewers' perception classification of Youtube car commercials.
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014

A web-based application for the management and evaluation of tutoring requests in PBL-based massive laboratories.
Proceedings of the IEEE Frontiers in Education Conference, 2014

A satisfaction-based model for affect recognition from conversational features in spoken dialog systems.
Speech Commun., 2013

I <i>Feel</i> You: The Design and Evaluation of a Domotic Affect-Sensitive Spoken Conversational Agent.
Sensors, 2013

On the dynamic adaptation of language models based on dialogue information.
Expert Syst. Appl., 2013

NEMOHIFI: an affective HiFi agent.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Design, development and field evaluation of a Spanish into sign language translation system.
Pattern Anal. Appl., 2012

Towards building intelligent speech interfaces through the use of more flexible, robust and natural dialogue management solutions.
Interact. Comput., 2012

Text categorization methods for automatic estimation of verbal intelligence.
Expert Syst. Appl., 2012

Assessing User Bias in Affect Detection within Context-Based Spoken Dialog Systems.
Proceedings of the 2012 International Conference on Privacy, 2012

Estimating Adaptation of Dialogue Partners with Different Verbal Intelligence.
Proceedings of the SIGDIAL 2012 Conference, 2012

Relating Dominance of Dialogue Participants with their Verbal Intelligence Scores.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Investigating Verbal Intelligence Using the TF-IDF Approach.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

I Feel You: Towards Affect-Sensitive Domotic Spoken Conversational Agents.
Proceedings of the Ambient Assisted Living and Home Care - 4th International Workshop, 2012

Mutual Information and Perplexity Based Clustering of Dialogue Information for Dynamic Adaptation of Language Models.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2012

Automatic Understanding of ATC Speech: Study of Prospectives and Field Experiments for Several Controller Positions.
IEEE Trans. Aerosp. Electron. Syst., 2011

Evaluation of a User-adapted Spoken Language Dialogue System - Measuring the Relevance of the Contextual Information Sources.
Proceedings of the ICAART 2011 - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence, Volume 1, 2011

Clustering of syntactic and discursive information for the dynamic adaptation of Language Models.
Proces. del Leng. Natural, 2010

HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Flexible, Robust and Dynamic Dialogue Modeling with a Speech Dialogue Interface for Controlling a Hi-Fi Audio System.
Proceedings of the Database and Expert Systems Applications, 2010

Gestión de la identidad de locutor y perfiles de usuario en un sistema de diálogo.
Proces. del Leng. Natural, 2009

Speech Technology at Home: Enhanced Interfaces for People with Disabilities.
Intell. Autom. Soft Comput., 2009

Novel Applications of Neural Networks in Speech Technology Systems: Search Space Reduction and Prosodic Modeling.
Intell. Autom. Soft Comput., 2009

Using dialogue-based dynamic language models for improving speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Acoustic emotion recognition using dynamic Bayesian networks and multi-space distributions.
Proceedings of the INTERSPEECH 2009, 2009

A Bayesian NETWORKS approach for dialog modeling: The fusion BN.
Proceedings of the IEEE International Conference on Acoustics, 2009

Speech to sign language translation system for Spanish.
Speech Commun., 2008

Desarrollo de un Robot-Guía con Integración de un Sistema de Diálogo y Expresión de Emociones: Proyecto ROBINT.
Proces. del Leng. Natural, 2008

Aplicación de métodos estadísticos para la traducción de voz a Lengua de Signos.
Proces. del Leng. Natural, 2008

Evaluation of a spoken dialogue system for controlling a Hifi audio system.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Language identification using several sources of information with a multiple-Gaussian classifier.
Proceedings of the INTERSPEECH 2007, 2007

Language identification based on n-gram frequency ranking.
Proceedings of the INTERSPEECH 2007, 2007

Utilización de medidas de confianza en sistemas de comprensión del habla.
Proces. del Leng. Natural, 2005

Demostración de una interfaz vocal para el control de un sistema de alta fidelidad.
Proces. del Leng. Natural, 2005

Topic Identification based on Bayesian Belief Networks in the context of an Air Traffic Control Task.
Proces. del Leng. Natural, 2005

New word-level and sentence-level confidence scoring using graph theory calculus and its evaluation on speech understanding.
Proceedings of the INTERSPEECH 2005, 2005

Speech interface for controlling an hi-fi audio system based on a Bayesian belief networks approach for dialog modeling.
Proceedings of the INTERSPEECH 2005, 2005

Realización de sistemas de diálogo en una plataforma compatible con VoiceXML: proyecto GEMINI.
Proces. del Leng. Natural, 2004

Implementation of dialog applications in an open-source voiceXML platform.
Proceedings of the INTERSPEECH 2004, 2004

Language identification techniques based on full recognition in an air traffic control task.
Proceedings of the INTERSPEECH 2004, 2004

Sistema de comprensión de comunicaciones habladas para el control de tráfico aéreo del proyecto INVOCA.
Proces. del Leng. Natural, 2003

Demostración del sistema de comprensión de comunicaciones habladas para control de tráfico aéreo del proyecto INVOCA.
Proces. del Leng. Natural, 2003