We stand with Ukraine

We stand with Ukraine

Florian Eyben

Orcid: 0009-0003-0330-8545

According to our database¹, Florian Eyben authored at least 143 papers between 2007 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Testing Correctness, Fairness, and Robustness of Speech Emotion Recognition Models.

[DOI]

,

Hagen Wierstorf

,

Ali Gürcan Özkil

,

,

Felix Burkhardt

,

Björn W. Schuller

IEEE Trans. Affect. Comput., 2025

Am I Blue or Is My Hobby Counting the Teardrops? Expression Leakage in Large Language Models as a Symptom of Irrelevancy Disruption.

[DOI]

,

,

,

,

Maximilian Schmitt

,

,

Felix Burkhardt

,

,

Björn W. Schuller

Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing, 2025

EmoDB 2.0: A Database of Emotional Speech in a World that is not Black or White but Grey.

[DOI]

Felix Burkhardt

,

Oliver Schrüfer

,

,

Hagen Wierstorf

,

,

,

Björn W. Schuller

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Domain adaptation and question-answer pooling for Aphasia modelling.

[DOI]

,

Monica González Machorro

,

Lisa Maria Ehlen

,

,

,

Cornelius J. Werner

,

Felix Burkhardt

,

Christian Kohlschein

,

,

Björn W. Schuller

Proceedings of the 8th International Conference on Natural Language and Speech Processing, 2025

Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study.

[DOI]

Monica González Machorro

,

,

,

Helly N. Hammer

,

,

,

,

Björn W. Schuller

Proceedings of the 8th International Conference on Natural Language and Speech Processing, 2025

Mental Wellbeing at Sea: A Prototype to Collect Speech Data in Maritime Settings.

[DOI]

,

Monica González Machorro

,

,

,

Matthias Kahlau

,

,

Björn W. Schuller

,

Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies, 2025

2024

VocDoc, what happened to my voice? Towards automatically capturing vocal fatigue in the wild.

[DOI]

Florian B. Pokorny

,

,

,

,

Claus Gerstenberger

,

,

,

,

Martin Hagmüller

,

Barbara Schuppler

,

,

Markus Gugatschka

Biomed. Signal Process. Control., February, 2024

Using voice analysis as an early indicator of risk for depression in young adults.

[DOI]

Klaus R. Scherer

,

Felix Burkhardt

,

,

,

Björn W. Schuller

CoRR, 2024

Wav2Small: Distilling Wav2Vec2 to 72K parameters for Low-Resource Speech emotion recognition.

[DOI]

Dionyssos Kounadis-Bastian

,

Oliver Schrüfer

,

,

Hagen Wierstorf

,

,

Felix Burkhardt

,

Björn W. Schuller

CoRR, 2024

Check Your Audio Data: Nkululeko for Bias Detection.

[DOI]

Felix Burkhardt

,

Bagus Tris Atmaja

,

,

,

Björn W. Schuller

Proceedings of the 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2024

Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition.

[DOI]

Oliver Schrüfer

,

,

Felix Burkhardt

,

,

Björn W. Schuller

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

A Comparative Analysis of Federated Learning for Speech-Based Cognitive Decline Detection.

[DOI]

Stefan Kalabakov

,

Monica González Machorro

,

,

Björn W. Schuller

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap.

[DOI]

Johannes Wagner

,

Andreas Triantafyllopoulos

,

Hagen Wierstorf

,

Maximilian Schmitt

,

Felix Burkhardt

,

,

Björn W. Schuller

IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Multistage linguistic conditioning of convolutional layers for speech emotion recognition.

[DOI]

Andreas Triantafyllopoulos

,

,

,

,

,

Björn W. Schuller

Frontiers Comput. Sci., 2023

Testing Speech Emotion Recognition Machine Learning Models.

[DOI]

,

Hagen Wierstorf

,

Ali Gürcan Özkil

,

,

Felix Burkhardt

,

Björn W. Schuller

CoRR, 2023

Going Retro: Astonishingly Simple Yet Effective Rule-based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions.

[DOI]

Felix Burkhardt

,

,

,

Björn W. Schuller

CoRR, 2023

Speech-based Age and Gender Prediction with Transformers.

[DOI]

Felix Burkhardt

,

Johannes Wagner

,

Hagen Wierstorf

,

,

Björn W. Schuller

CoRR, 2023

audb - Sharing and Versioning of Audio and Annotation Data in Python.

[DOI]

Hagen Wierstorf

,

Johannes Wagner

,

,

Felix Burkhardt

,

Björn W. Schuller

CoRR, 2023

Towards Supporting an Early Diagnosis of Multiple Sclerosis using Vocal Features.

[DOI]

Monica González Machorro

,

,

,

Helly N. Hammer

,

,

,

,

,

,

,

Dagmar M. Schuller

,

Björn W. Schuller

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Nkululeko: Machine Learning Experiments on Speaker Characteristics Without Programming.

[DOI]

Felix Burkhardt

,

,

Björn W. Schuller

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?

[DOI]

Felix Burkhardt

,

,

Matthias Kahlau

,

Klaus R. Scherer

,

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2023

Multimodal Recognition of Valence, Arousal and Dominance via Late-Fusion of Text, Audio and Facial Expressions.

[DOI]

Fabrizio Nunnari

,

,

,

Chirag Bhuvaneshwara

,

Panagiotis Paraskevas Filntisis

,

,

Felix Burkhardt

,

,

Björn W. Schuller

,

Proceedings of the 31st European Symposium on Artificial Neural Networks, 2023

2022

Voice Analysis for Neurological Disorder Recognition-A Systematic Review and Perspective on Emerging Trends.

[DOI]

,

,

,

Björn W. Schuller

,

Frontiers Digit. Health, 2022

A Comparative Cross Language View On Acted Databases Portraying Basic Emotions Utilising Machine Learning.

[DOI]

Felix Burkhardt

,

,

,

Hagen Wierstorf

,

,

Björn W. Schuller

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Nkululeko: A Tool For Rapid Speaker Characteristics Detection.

[DOI]

Felix Burkhardt

,

Johannes Wagner

,

Hagen Wierstorf

,

,

Björn W. Schuller

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Probing speech emotion recognition transformers for linguistic knowledge.

[DOI]

Andreas Triantafyllopoulos

,

Johannes Wagner

,

Hagen Wierstorf

,

Maximilian Schmitt

,

,

,

Felix Burkhardt

,

Björn W. Schuller

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Quantifying Cognitive Load from Voice using Transformer-Based Models and a Cross-Dataset Evaluation.

[DOI]

,

Arpita Kappattanavar

,

Maximilian Schmitt

,

Sidratul Moontaha

,

Johannes Wagner

,

,

Björn W. Schuller

,

Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022

2021

Speaking Corona? Human and Machine Recognition of COVID-19 from Voice.

[DOI]

,

Florian B. Pokorny

,

Katrin D. Bartl-Pokorny

,

,

,

,

,

Dagmar M. Schuller

,

,

Björn W. Schuller

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Exploiting time-frequency patterns with LSTM-RNNs for low-bitrate audio restoration.

[DOI]

,

Björn W. Schuller

,

,

Dagmar Schuller

,

,

,

Neural Comput. Appl., 2020

The voice of COVID-19: Acoustic correlates of infection.

[DOI]

Katrin D. Bartl-Pokorny

,

Florian B. Pokorny

,

,

Shahin Amiriparian

,

Anastasia Semertzidou

,

,

,

Florian Schmidt

,

Rainer Schönweiler

,

,

Björn W. Schuller

CoRR, 2020

2019

Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.

[DOI]

Björn W. Schuller

,

,

,

Fabien Ringeval

,

,

,

,

,

Alessandro Vinciarelli

,

Klaus R. Scherer

,

Mohamed Chetouani

,

Marcello Mortillaro

Comput. Speech Lang., 2019

On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction.

[DOI]

,

,

,

Björn W. Schuller

CoRR, 2019

2018

audEERING's approach to the One-Minute-Gradual Emotion Challenge.

[DOI]

Andreas Triantafyllopoulos

,

,

,

Björn W. Schuller

CoRR, 2018

Emotion-Awareness for Intelligent Vehicle Assistants: A Research Agenda.

[DOI]

Proceedings of the 1st IEEE/ACM International Workshop on Software Engineering for AI in Autonomous Systems, 2018

Robust Laughter Detection for Wearable Wellbeing Sensing.

[DOI]

Gerhard Hagerer

,

Nicholas Cummins

,

,

Björn W. Schuller

Proceedings of the 2018 International Conference on Digital Health, 2018

2017

Enhancing LSTM RNN-Based Speech Overlap Detection by Artificially Mixed Data.

[DOI]

Gerhard Hagerer

,

,

,

Björn W. Schuller

Proceedings of the AES International Conference Semantic Audio 2017, 2017

A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits.

[DOI]

,

,

,

Maximilian Schmitt

,

,

Björn W. Schuller

Proceedings of the 2017 ACM on Multimedia Conference, 2017

"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers.

[DOI]

Gerhard Hagerer

,

Nicholas Cummins

,

,

Björn W. Schuller

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Seeking the SuperStar: Automatic assessment of perceived singing quality.

[DOI]

,

,

Maximilian Schmitt

,

,

Björn W. Schuller

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Automatic multi-lingual arousal detection from voice applied to real product testing applications.

[DOI]

,

Matthias Unfried

,

Gerhard Hagerer

,

Björn W. Schuller

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Detecting Vocal Irony.

[DOI]

Felix Burkhardt

,

,

,

,

Björn W. Schuller

Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017

VoicePlay - An affective sports game operated by speech emotion recognition based on the component process model.

[DOI]

Gerhard Hagerer

,

,

Dagmar Schuller

,

Klaus R. Scherer

,

Björn W. Schuller

Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

Deep neural networks for anger detection from real life speech data.

[DOI]

,

,

Björn W. Schuller

,

Felix Burkhardt

Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

2016

The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.

[DOI]

,

Klaus R. Scherer

,

Björn W. Schuller

,

,

Elisabeth André

,

,

Laurence Y. Devillers

,

,

,

Shrikanth S. Narayanan

,

Khiet P. Truong

IEEE Trans. Affect. Comput., 2016

Real-Time Tracking of Speakers' Emotions, States, and Traits on Mobile Platforms.

[DOI]

,

,

Gerhard Hagerer

,

Björn W. Schuller

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data.

[DOI]

Fabien Ringeval

,

,

,

,

Jean-Philippe Thiran

,

Touradj Ebrahimi

,

,

Björn W. Schuller

Pattern Recognit. Lett., 2015

Emotion in the singing voice - a deeperlook at acoustic features in the light ofautomatic classification.

[DOI]

,

Gláucia L. Salomão

,

,

Klaus R. Scherer

,

Björn W. Schuller

EURASIP J. Audio Speech Music. Process., 2015

A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge.

[DOI]

Björn W. Schuller

,

,

,

,

Alessandro Vinciarelli

,

Felix Burkhardt

,

,

,

,

,

Gelareh Mohammadi

,

Comput. Speech Lang., 2015

Does my speech rock? automatic assessment of public speaking skills.

[DOI]

,

,

,

Guillaume Vidal

,

,

Eduardo Coutinho

,

,

Björn W. Schuller

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.

[DOI]

,

Fabio Vesperini

,

,

,

Stefano Squartini

,

Björn W. Schuller

Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks.

[DOI]

,

Fabio Vesperini

,

,

Stefano Squartini

,

Björn W. Schuller

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract).

[DOI]

Björn W. Schuller

,

Bogdan Vlasenko

,

,

Martin Wöllmer

,

André Stuhlsatz

,

Andreas Wendemuth

,

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Building autonomous sensitive artificial listeners (Extended abstract).

[DOI]

,

Elisabetta Bevacqua

,

,

,

,

,

,

,

,

,

Catherine Pelachaud

,

Björn W. Schuller

,

Etienne de Sevin

,

Michel F. Valstar

,

Martin Wöllmer

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).

[DOI]

Angeliki Metallinou

,

Athanasios Katsamanis

,

Martin Wöllmer

,

,

Björn W. Schuller

,

Shrikanth S. Narayanan

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

iHEARu-PLAY: Introducing a game for crowdsourced data collection for affective computing.

[DOI]

,

,

,

Björn W. Schuller

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Real-time robust recognition of speakers' emotions and characteristics on mobile platforms.

[DOI]

,

,

,

Dagmar Schuller

,

Björn W. Schuller

Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014

Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition.

[DOI]

,

,

,

Björn W. Schuller

IEEE Signal Process. Lett., 2014

Medium-term speaker states - A review on intoxication, sleepiness and the first challenge.

[DOI]

Björn W. Schuller

,

,

,

,

Jarek Krajewski

,

,

Comput. Speech Lang., 2014

A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems.

[DOI]

,

Björn W. Schuller

,

,

Martin Wöllmer

,

CoRR, 2014

AVEC 2014: 3D Dimensional Affect and Depression Recognition Challenge.

[DOI]

Michel F. Valstar

,

Björn W. Schuller

,

,

Timur R. Almaev

,

,

Jarek Krajewski

,

,

Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

Emotional Analysis of Music: A Comparison of Methods.

[DOI]

Mohammad Soleymani

,

,

,

Michael N. Caro

,

,

Konstantin Markov

,

Björn W. Schuller

,

Remco C. Veltkamp

,

,

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production.

[DOI]

Björn W. Schuller

,

Felix Friedmann

,

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load.

[DOI]

Björn W. Schuller

,

,

,

,

,

Fabien Ringeval

,

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Audio onset detection: A wavelet packet based approach with recurrent neural networks.

[DOI]

,

Giacomo Ferroni

,

,

Stefano Squartini

,

Björn W. Schuller

Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Emotion Recognition in the Wild: Incorporating Voice and Lip Activity in Multimodal Decision-Level Fusion.

[DOI]

Fabien Ringeval

,

Shahin Amiriparian

,

,

Klaus R. Scherer

,

Björn W. Schuller

Proceedings of the 16th International Conference on Multimodal Interaction, 2014

MAPTRAITS 2014: The First Audio/Visual Mapping Personality Traits Challenge.

[DOI]

Oya Çeliktutan

,

,

Evangelos Sariyanidi

,

,

Björn W. Schuller

Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, 2014

MAPTRAITS 2014 - The First Audio/Visual Mapping Personality Traits Challenge - An Introduction: Perceived Personality and Social Dimensions.

[DOI]

Oya Çeliktutan

,

,

Evangelos Sariyanidi

,

,

Björn W. Schuller

Proceedings of the 16th International Conference on Multimodal Interaction, 2014

On-line continuous-time music mood regression with deep recurrent neural networks.

[DOI]

,

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2014

Single-channel speech separation with memory-enhanced recurrent neural networks.

[DOI]

,

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2014

Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks.

[DOI]

,

Giacomo Ferroni

,

,

Leonardo Gabrielli

,

Stefano Squartini

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2014

CCA based feature selection with application to continuous depression recognition from acoustic speech features.

[DOI]

,

,

Albert Ali Salah

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2014

A frequency-weighted post-filtering transform for compensation of the over-smoothing effect in HMM-based speech synthesis.

[DOI]

,

Yannis Agiomyrgiannakis

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework.

[DOI]

Martin Wöllmer

,

,

,

Björn W. Schuller

,

Image Vis. Comput., 2013

Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.

[DOI]

,

,

,

Björn W. Schuller

Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

AVEC 2013: the continuous audio/visual emotion and depression recognition challenge.

[DOI]

Michel F. Valstar

,

Björn W. Schuller

,

,

,

,

Sanjay Bilakhia

,

Sebastian Schnieder

,

,

Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge, 2013

Recent developments in openSMILE, the munich open-source multimedia feature extractor.

[DOI]

,

,

,

Björn W. Schuller

Proceedings of the ACM Multimedia Conference, 2013

The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features.

[DOI]

,

,

Björn W. Schuller

Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.

[DOI]

Björn W. Schuller

,

,

,

Alessandro Vinciarelli

,

Klaus R. Scherer

,

Fabien Ringeval

,

Mohamed Chetouani

,

,

,

,

Marcello Mortillaro

,

,

Anna Polychroniou

,

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Detecting overlapping speech with long short-term memory recurrent neural networks.

[DOI]

Jürgen T. Geiger

,

,

Björn W. Schuller

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Using linguistic information to detect overlapping speech.

[DOI]

Jürgen T. Geiger

,

,

Nicholas W. D. Evans

,

Björn W. Schuller

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Affect recognition in real-life acoustic conditions - a new perspective on feature selection.

[DOI]

,

,

Björn W. Schuller

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

The acoustics of eye contact: detecting visual attention from conversational audio cues.

[DOI]

,

,

,

Björn W. Schuller

Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction, 2013

Automatic recognition of physiological parameters in the human voice: Heart rate and skin conductance.

[DOI]

Björn W. Schuller

,

Felix Friedmann

,

Proceedings of the IEEE International Conference on Acoustics, 2013

Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies.

[DOI]

,

,

Stefano Squartini

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

A multitask approach to continuous five-dimensional affect sensing in natural speech.

[DOI]

,

Martin Wöllmer

,

Björn W. Schuller

ACM Trans. Interact. Intell. Syst., 2012

Building Autonomous Sensitive Artificial Listeners.

[DOI]

,

Elisabetta Bevacqua

,

,

,

,

,

,

,

,

,

Catherine Pelachaud

,

Björn W. Schuller

,

Etienne de Sevin

,

Michel François Valstar

,

Martin Wöllmer

IEEE Trans. Affect. Comput., 2012

Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification.

[DOI]

Angeliki Metallinou

,

Martin Wöllmer

,

Athanasios Katsamanis

,

,

Björn W. Schuller

,

Shrikanth S. Narayanan

IEEE Trans. Affect. Comput., 2012

Real-Time Activity Detection in a Multi-Talker Reverberated Environment.

[DOI]

Emanuele Principi

,

,

Martin Wöllmer

,

,

Stefano Squartini

,

Björn W. Schuller

Cogn. Comput., 2012

Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature Sets.

[DOI]

,

,

Nicolas H. Lehment

,

,

Björn W. Schuller

Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings.

[DOI]

Martin Wöllmer

,

,

Björn W. Schuller

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

The INTERSPEECH 2012 Speaker Trait Challenge.

[DOI]

Björn W. Schuller

,

,

,

,

Alessandro Vinciarelli

,

Felix Burkhardt

,

,

,

,

,

Gelareh Mohammadi

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

AVEC 2012: the continuous audio/visual emotion challenge.

[DOI]

Björn W. Schuller

,

Michel F. Valstar

,

,

,

Proceedings of the International Conference on Multimodal Interaction, 2012

Preserving actual dynamic trend of emotion in dimensional speech emotion recognition.

[DOI]

,

,

,

,

,

Björn W. Schuller

Proceedings of the International Conference on Multimodal Interaction, 2012

Improving generalisation and robustness of acoustic affect recognition.

[DOI]

,

Björn W. Schuller

,

Proceedings of the International Conference on Multimodal Interaction, 2012

Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music.

[DOI]

,

,

,

,

,

Björn W. Schuller

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Audiovisual vocal outburst classification in noisy acoustic conditions.

[DOI]

,

Stavros Petridis

,

Björn W. Schuller

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Unsupervised clustering of emotion and voice styles for expressive TTS.

[DOI]

,

Sabine Buchholz

,

Norbert Braunschweiler

,

,

,

Mark J. F. Gales

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Real-Time Speech Separation by Semi-supervised Nonnegative Matrix Factorization.

[DOI]

,

,

,

,

Björn W. Schuller

Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Fully Automatic Audiovisual Emotion Recognition: Voice, Words, and the Face.

[DOI]

Martin Wöllmer

,

,

,

,

Björn W. Schuller

,

Proceedings of the 10th ITG Conference on Speech Communication, 2012

2011

Computational Assessment of Interest in Speech - Facing the Real-Life Challenge.

[DOI]

Martin Wöllmer

,

,

,

Björn W. Schuller

Künstliche Intell., 2011

Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits.

[DOI]

Björn W. Schuller

,

Martin Wöllmer

,

,

,

Proceedings of the AES International Conference Semantic Audio 2011, 2011

Interacting with Emotional Virtual Agents.

[DOI]

Elisabetta Bevacqua

,

,

,

,

,

Catherine Pelachaud

,

,

Björn W. Schuller

,

Etienne de Sevin

,

Martin Wöllmer

Proceedings of the Intelligent Technologies for Interactive Entertainment, 2011

Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets.

[DOI]

Martin Wöllmer

,

,

,

Björn W. Schuller

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A multi-stream ASR framework for BLSTM modeling of conversational speech.

[DOI]

Martin Wöllmer

,

,

Björn W. Schuller

,

Proceedings of the IEEE International Conference on Acoustics, 2011

Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition.

[DOI]

,

Jean-Louis Durrieu

,

,

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2011

Deep neural networks for acoustic emotion recognition: Raising the benchmarks.

[DOI]

André Stuhlsatz

,

Christine Meyer

,

,

,

Hans-Günter Meier

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2011

Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks.

[DOI]

Christian Landsiedel

,

,

,

,

Björn W. Schuller

Proceedings of the IEEE International Conference on Acoustics, 2011

Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks.

[DOI]

,

Stavros Petridis

,

Björn W. Schuller

,

George Tzimiropoulos

,

Stefanos Zafeiriou

,

Proceedings of the IEEE International Conference on Acoustics, 2011

Come and have an emotional workout with sensitive artificial listeners!

[DOI]

,

,

,

,

Michel François Valstar

,

,

,

,

,

,

Björn W. Schuller

,

Martin Wöllmer

,

Elisabetta Bevacqua

,

Catherine Pelachaud

,

Etienne de Sevin

Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

String-based audiovisual fusion of behavioural events for the assessment of dimensional affect.

[DOI]

,

Martin Wöllmer

,

Michel François Valstar

,

,

Björn W. Schuller

,

Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

AVEC 2011-The First International Audio/Visual Emotion Challenge.

[DOI]

Björn W. Schuller

,

Michel François Valstar

,

,

,

,

Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010

Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies.

[DOI]

Björn W. Schuller

,

Bogdan Vlasenko

,

,

Martin Wöllmer

,

André Stuhlsatz

,

Andreas Wendemuth

,

IEEE Trans. Affect. Comput., 2010

Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening.

[DOI]

Martin Wöllmer

,

Björn W. Schuller

,

,

IEEE J. Sel. Top. Signal Process., 2010

On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues.

[DOI]

,

Martin Wöllmer

,

,

Björn W. Schuller

,

Ellen Douglas-Cowie

,

J. Multimodal User Interfaces, 2010

Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework.

[DOI]

Martin Wöllmer

,

,

,

Björn W. Schuller

,

Cogn. Comput., 2010

Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car.

[DOI]

,

Martin Wöllmer

,

,

Björn W. Schuller

,

Christoph Blaschke

,

Berthold Färber

,

Nhu Nguyen-Thien

Adv. Hum. Comput. Interact., 2010

Opensmile: the munich versatile and fast open-source audio feature extractor.

[DOI]

,

Martin Wöllmer

,

Björn W. Schuller

Proceedings of the 18th International Conference on Multimedia 2010, 2010

3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE.

[DOI]

,

,

Martin Wöllmer

,

,

Björn W. Schuller

,

,

,

Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

Vocalist Gender Recognition in Recorded Popular Music.

[DOI]

Björn W. Schuller

,

Christoph Kozielski

,

,

,

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks.

[DOI]

,

Sebastian Böck

,

Björn W. Schuller

,

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Long short-term memory networks for noise robust speech recognition.

[DOI]

Martin Wöllmer

,

,

,

Björn W. Schuller

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling.

[DOI]

Martin Wöllmer

,

Angeliki Metallinou

,

,

Björn W. Schuller

,

Shrikanth S. Narayanan

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Recognition of spontaneous conversational speech using long short-term memory phoneme predictions.

[DOI]

Martin Wöllmer

,

,

Björn W. Schuller

,

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Emotion recognition using imperfect speech recognition.

[DOI]

,

,

,

,

Björn W. Schuller

,

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder.

[DOI]

Martin Wöllmer

,

,

Björn W. Schuller

,

Proceedings of the IEEE International Conference on Acoustics, 2010

Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote.

[DOI]

Björn W. Schuller

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Being bored? Recognising natural interest by extensive audiovisual integration for real-life application.

[DOI]

Björn W. Schuller

,

,

,

,

Benedikt Hörnler

,

Martin Wöllmer

,

,

,

Image Vis. Comput., 2009

A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams.

[DOI]

Martin Wöllmer

,

Marc A. Al-Hames

,

,

Björn W. Schuller

,

Neurocomputing, 2009

Improving Keyword Spotting with a Tandem BLSTM-DBN Architecture.

[DOI]

Martin Wöllmer

,

,

,

Björn W. Schuller

,

Proceedings of the Advances in Nonlinear Speech Processing, 2009

Robust in-car spelling recognition - a tandem BLSTM-HMM approach.

[DOI]

Martin Wöllmer

,

,

Björn W. Schuller

,

,

Tobias Moosmayr

,

Nhu Nguyen-Thien

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks.

[DOI]

Martin Wöllmer

,

,

Björn W. Schuller

,

Ellen Douglas-Cowie

,

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks.

[DOI]

Martin Wöllmer

,

,

,

,

Björn W. Schuller

,

Proceedings of the IEEE International Conference on Acoustics, 2009

Robust vocabulary independent keyword spotting with graphical models.

[DOI]

Martin Wöllmer

,

,

Björn W. Schuller

,

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Acoustic emotion recognition: A benchmark comparison of performances.

[DOI]

Björn W. Schuller

,

Bogdan Vlasenko

,

,

,

Andreas Wendemuth

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

From speech to letters - using a novel neural network architecture for grapheme based ASR.

[DOI]

,

Martin Wöllmer

,

Björn W. Schuller

,

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

A demonstration of audiovisual sensitive artificial listeners.

[DOI]

,

Elisabetta Bevacqua

,

,

,

,

,

,

,

Catherine Pelachaud

,

Björn W. Schuller

,

Etienne de Sevin

,

Michel F. Valstar

,

Martin Wöllmer

Proceedings of the Affective Computing and Intelligent Interaction, 2009

OpenEAR - Introducing the munich open-source emotion and affect recognition toolkit.

[DOI]

,

Martin Wöllmer

,

Björn W. Schuller

Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008

Tango or Waltz?: Putting Ballroom Dance Style into Tempo Detection.

[DOI]

Björn W. Schuller

,

,

EURASIP J. Audio Speech Music. Process., 2008

Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech.

[DOI]

Björn W. Schuller

,

,

Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies.

[DOI]

Martin Wöllmer

,

,

,

Björn W. Schuller

,

,

Ellen Douglas-Cowie

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Music Thumbnailing Incorporating Harmony- and Rhythm Structure.

[DOI]

Björn W. Schuller

,

Florian Dibiasi

,

,

Proceedings of the Adaptive Multimedia Retrieval. Identifying, 2008

2007

Wearable Assistance for the Ballroom-Dance Hobbyist - Holistic Rhythm Analysis and Dance-Style Classification.

[DOI]

,

Björn W. Schuller

,

,

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Fast and Robust Meter and Tempo Recognition for the Automatic Discrimination of Ballroom Dance Styles.

[DOI]

Björn W. Schuller

,

,

Proceedings of the IEEE International Conference on Acoustics, 2007

Loading...