Björn W. Schuller

According to our database1, Björn W. Schuller authored at least 530 papers between 2002 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.
Computer Speech & Language, 2019

2018
Introduction to the Special Section on Multimedia Computing and Applications of Socio-Affective Behaviors in the Wild.
TOMCCAP, 2018

MixedEmotions: An Open-Source Toolbox for Multimodal Emotion Analysis.
IEEE Trans. Multimedia, 2018

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
ACM TIST, 2018

Guest Editorial Special Issue on Computational Intelligence for End-to-End Audio Processing.
IEEE Trans. Emerging Topics in Comput. Intellig., 2018

Semisupervised Autoencoders for Speech Emotion Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Editorial: Transactions on Affective Computing-Good Reasons for Joy and Excitement.
IEEE Trans. Affective Computing, 2018

Personalized machine learning for robot perception of affect and engagement in autism therapy.
Science Robotics, 2018

Deep Canonical Time Warping for Simultaneous Alignment and Representation Learning of Sequences.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Three recent trends in Paralinguistics on the way to omniscient machine intelligence.
J. Multimodal User Interfaces, 2018

Scaling Speech Enhancement in Unseen Environments with Noise Embeddings.
CoRR, 2018

Dynamic Difficulty Awareness Training for Continuous Emotion Prediction.
CoRR, 2018

Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives.
CoRR, 2018

Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification.
CoRR, 2018

audEERING's approach to the One-Minute-Gradual Emotion Challenge.
CoRR, 2018

Deep Affect Prediction in-the-wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond.
CoRR, 2018

Calibrated Prediction Intervals for Neural Network Regressors.
CoRR, 2018

Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora.
CoRR, 2018

End2You - The Imperial Toolkit for Multimodal Profiling by End-to-End Learning.
CoRR, 2018

Weakly Supervised One-Shot Detection with Attention Siamese Networks.
CoRR, 2018

The Age of Artificial Emotional Intelligence.
IEEE Computer, 2018

What Affective Computing Reveals about Autistic Children's Facial Expressions of Joy or Fear.
IEEE Computer, 2018

Snoring classified: The Munich-Passau Snore Sound Corpus.
Comp. in Bio. and Med., 2018

Speech emotion recognition: two decades in a nutshell, benchmarks, and ongoing trends.
Commun. ACM, 2018

Leveraging Unlabeled Data for Emotion Recognition With Enhanced Collaborative Semi-Supervised Learning.
IEEE Access, 2018

Calibrated Prediction Intervals for Neural Network Regressors.
IEEE Access, 2018

Trustability-Based Dynamic Active Learning for Crowdsourced Labelling of Emotional Audio Data.
IEEE Access, 2018

How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers.
Proceedings of the Speech and Computer - 20th International Conference, 2018

You Sound Like Your Counterpart: Interpersonal Speech Analysis.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Summary for AVEC 2018: Bipolar Disorder and Cross-Cultural Affect Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Passive monitoring and geo-based prediction of mobile network vehicle-to-server communication.
Proceedings of the 14th International Wireless Communications & Mobile Computing Conference, 2018

Automated Classification of Children's Linguistic versus Non-Linguistic Vocalisations.
Proceedings of the Interspeech 2018, 2018

The INTERSPEECH 2018 Computational Paralinguistics Challenge: Atypical & Self-Assessed Affect, Crying & Heart Beats.
Proceedings of the Interspeech 2018, 2018

State of Mind: Classification through Self-reported Affect and Word Use in Speech.
Proceedings of the Interspeech 2018, 2018

How Did You like 2017? Detection of Language Markers of Depression and Narcissism in Personal Narratives.
Proceedings of the Interspeech 2018, 2018

Categorical vs Dimensional Perception of Italian Emotional Speech.
Proceedings of the Interspeech 2018, 2018

Annotator Trustability-based Cooperative Learning Solutions for Intelligent Audio Analysis.
Proceedings of the Interspeech 2018, 2018

Towards Temporal Modelling of Categorical Speech Emotion Recognition.
Proceedings of the Interspeech 2018, 2018

The Perception and Analysis of the Likeability and Human Likeness of Synthesized Speech.
Proceedings of the Interspeech 2018, 2018

Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks.
Proceedings of the Interspeech 2018, 2018

Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech.
Proceedings of the Interspeech 2018, 2018

Evolving Learning for Analysing Mood-Related Infant Vocalisation.
Proceedings of the Interspeech 2018, 2018

Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Bag-of-Deep-Features: Noise-Robust Deep Feature Representations for Audio Analysis.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Affective Image Content Analysis: A Comprehensive Survey.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Emotion-Awareness for Intelligent Vehicle Assistants: A Research Agenda.
Proceedings of the 1st IEEE/ACM International Workshop on Software Engineering for AI in Autonomous Systems, 2018

Deep End-to-End Representation Learning for Food Type Recognition from Speech.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

EAT -: The ICMI 2018 Eating Analysis and Tracking Challenge.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Exploring A New Method for Food Likability Rating Based on DT-CWT Theory.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Introducing an Emotion-Driven Assistance System for Cognitively Impaired Individuals.
Proceedings of the Computers Helping People with Special Needs, 2018

End-to-End Speech Emotion Recognition Using Deep Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

What is my Dog Trying to Tell Me? the Automatic Recognition of the Context and Perceived Emotion of Dog Barks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multimodal Bag-of-Words for Cross Domains Sentiment Analysis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards Conditional Adversarial Training for Predicting Emotions from Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Low Level Texture Features for Snore Sound Discrimination.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Deep Unsupervised Representation Learning for Abnormal Heart Sound Classification.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Learning Image-based Representations for Heart Sound Classification.
Proceedings of the 2018 International Conference on Digital Health, 2018

Robust Laughter Detection for Wearable Wellbeing Sensing.
Proceedings of the 2018 International Conference on Digital Health, 2018

2017
Acquisition of Affect.
Proceedings of the Emotions and Personality in Personalized Services, 2017

Stacked denoising autoencoders for sentiment analysis: a review.
Wiley Interdiscip. Rev. Data Min. Knowl. Discov., 2017

Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multifeature Analysis.
IEEE Trans. Biomed. Engineering, 2017

A Two-Dimensional Framework of Multiple Kernel Subspace Learning for Recognizing Emotion in Speech.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Editorial: IEEE Transactions on Affective Computing - Challenges and Chances.
IEEE Trans. Affective Computing, 2017

Continuous Estimation of Emotions in Speech by Dynamic Cooperative Speaker Models.
IEEE Trans. Affective Computing, 2017

Advanced Data Exploitation in Speech Analysis: An overview.
IEEE Signal Process. Mag., 2017

Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2017

A Deep Matrix Factorization Method for Learning Attribute Representations.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

End-to-End Multimodal Emotion Recognition Using Deep Neural Networks.
J. Sel. Topics Signal Processing, 2017

openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit.
Journal of Machine Learning Research, 2017

auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks.
Journal of Machine Learning Research, 2017

Guest editorial: Multimodal sentiment analysis and mining in the wild.
Image Vision Comput., 2017

A survey of multimodal sentiment analysis.
Image Vision Comput., 2017

Strength modelling for real-worldautomatic continuous affect recognition from audiovisual signals.
Image Vision Comput., 2017

Measuring Engagement in Robot-Assisted Autism Therapy: A Cross-Cultural Study.
Front. Robotics and AI, 2017

auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks.
CoRR, 2017

Learning Audio Sequence Representations for Acoustic Event Classification.
CoRR, 2017

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
CoRR, 2017

Deep Structured Learning for Facial Action Unit Intensity Estimation.
CoRR, 2017

End-to-End Multimodal Emotion Recognition using Deep Neural Networks.
CoRR, 2017

DeepCoder: Semi-parametric Variational Autoencoders for Facial Action Unit Intensity Estimation.
CoRR, 2017

Fast Single-Class Classification and the Principle of Logit Separation.
CoRR, 2017

Can Affective Computing Save Lives? Meet Mobile Health.
IEEE Computer, 2017

A Novel Graphical Technique for Combinational Logic Representation and Optimization.
Complexity, 2017

Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection.
Comp. Int. and Neurosc., 2017

Recognizing Emotions From Whispered Speech Based on Acoustic Feature Transfer Learning.
IEEE Access, 2017

Automatic speaker analysis 2.0: Hearing the bigger picture.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2017

Big Data, Deep Learning - At the Edge of X-Ray Speaker Analysis.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Enhancing LSTM RNN-Based Speech Overlap Detection by Artificially Mixed Data.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

AVEC 2017: Real-life Depression, and Affect Recognition Workshop and Challenge.
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017

Summary for AVEC 2017: Real-life Depression and Affect Challenge and Workshop.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

An Image-based Deep Spectrum Feature Representation for the Recognition of Emotional Speech.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

The Perception of Emotion in the Singing Voice: The Understanding of Music Mood for Music Organisation.
Proceedings of the 4th International Workshop on Digital Libraries for Musicology, 2017

The SEILS Dataset: Symbolically Encoded Scores in Modern-Early Notation for Computational Musicology.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Implementing Gender-Dependent Vowel-Level Analysis for Boosting Speech-Based Depression Recognition.
Proceedings of the Interspeech 2017, 2017


Discussion.
Proceedings of the Interspeech 2017, 2017

Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach.
Proceedings of the Interspeech 2017, 2017

The Perception of Emotions in Noisified Nonsense Speech.
Proceedings of the Interspeech 2017, 2017

Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings.
Proceedings of the Interspeech 2017, 2017

Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World.
Proceedings of the Interspeech 2017, 2017

"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers.
Proceedings of the Interspeech 2017, 2017

An 'End-to-Evolution' Hybrid Approach for Snore Sound Classification.
Proceedings of the Interspeech 2017, 2017

Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective.
Proceedings of the Interspeech 2017, 2017

Automatic Classification of Autistic Child Vocalisations: A Novel Database and Results.
Proceedings of the Interspeech 2017, 2017

Snore Sound Classification Using Image-Based Deep Spectrum Features.
Proceedings of the Interspeech 2017, 2017

Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation.
Proceedings of the Interspeech 2017, 2017

Towards intoxicated speech recognition.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Deep recurrent music writer: Memory-enhanced variational autoencoder-based musical score composition and an objective measure.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Seeking the SuperStar: Automatic assessment of perceived singing quality.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Stimulation of psychological listener experiences by semi-automatically composed electroacoustic environments.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

End-to-end learning for dimensional emotion recognition from physiological signals.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Prediction-based learning for continuous emotion recognition in speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Reconstruction-error-based learning for continuous emotion recognition in speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Automatic multi-lingual arousal detection from voice applied to real product testing applications.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A machine learning based system for the automatic evaluation of aphasia speech.
Proceedings of the 19th IEEE International Conference on e-Health Networking, 2017

Detecting Vocal Irony.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017

Recognising Guitar Effects - Which Acoustic Features Really Matter?
Proceedings of the 47. Jahrestagung der Gesellschaft für Informatik, 2017

Automatic Guitar String Detection by String-Inverse Frequency Estimation.
Proceedings of the 47. Jahrestagung der Gesellschaft für Informatik, 2017

Snore sound recognition: On wavelets and classifiers from deep nets to kernels.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

"You sound ill, take the day off": Automatic recognition of speech affected by upper respiratory tract infection.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

Speech-based Diagnosis of Autism Spectrum Condition by Generative Adversarial Network Representations.
Proceedings of the 2017 International Conference on Digital Health, 2017

Contextual Bidirectional Long Short-Term Memory Recurrent Neural Network Language Models: A Generative Approach to Sentiment Analysis.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Deep Structured Learning for Facial Action Unit Intensity Estimation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Reading the Author and Speaker: Towards a Holistic and Deep Approach on Automatic Assessment of What is in One's Words.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017

Perception of Paralinguistic Traits in Synthesized Voices.
Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences, 2017

Enhancing Speech-Based Depression Detection Through Gender Dependent Vowel-Level Formant Features.
Proceedings of the Artificial Intelligence in Medicine, 2017

Emotion-augmented machine learning: Overview of an emerging domain.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Multimodal multimodel emotion analysis as linked data.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

The effect of personality trait, age, and gender on the performance of automatic speech valence recognition.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

VoicePlay - An affective sports game operated by speech emotion recognition based on the component process model.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

Deep neural networks for anger detection from real life speech data.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

CAST a database: Rapid targeted large-scale big data acquisition via small-world modelling of social media platforms.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Feature selection in multimodal continuous emotion prediction.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

Sentiment analysis using image-based deep spectrum features.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

Tunable Sensitivity to Large Errors in Neural Network Training.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Computational Analysis of Vocal Expression of Affect: Trends and Challenges.
Proceedings of the Social Signal Processing, 2017

Automatic Analysis of Social Emotions.
Proceedings of the Social Signal Processing, 2017

Automatic Analysis of Aesthetics: Human Beauty, Attractiveness, and Likability.
Proceedings of the Social Signal Processing, 2017

2016
Route and Stopping Intent Prediction at Intersections From Car Fleet Data.
IEEE Trans. Intelligent Vehicles, 2016

Editorial: Transactions on Affective Computing - Changes and Continuance.
IEEE Trans. Affective Computing, 2016

The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing.
IEEE Trans. Affective Computing, 2016

New avenues in knowledge bases for natural language processing.
Knowl.-Based Syst., 2016

Stream fusion for multi-stream automatic speech recognition.
I. J. Speech Technology, 2016

AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge.
CoRR, 2016

openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit.
CoRR, 2016

Tunable Sensitivity to Large Errors in Neural Network Training.
CoRR, 2016

Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data.
CoRR, 2016

Using Computer Intelligence for Depression Diagnosis and Crowdsourcing.
IEEE Computer, 2016

The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations.
IEEE Access, 2016

Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition.
IEEE Access, 2016

AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge.
Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, 2016

Summary for AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Social and Affective Robotics Tutorial.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling.
Proceedings of the 26th IEEE International Workshop on Machine Learning for Signal Processing, 2016

Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Fisher Kernels on Phase-Based Features for Speech Emotion Recognition.
Proceedings of the Dialogues with Social Robots, 2016

Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks.
Proceedings of the Interspeech 2016, 2016


The INTERSPEECH 2016 Computational Paralinguistics Challenge: A Summary of Results.
Proceedings of the Interspeech 2016, 2016

The Native Language Sub-Challenge: The Data.
Proceedings of the Interspeech 2016, 2016

The Sincerity Sub-Challenge: The Data.
Proceedings of the Interspeech 2016, 2016

The Deception Sub-Challenge: The Data.
Proceedings of the Interspeech 2016, 2016

The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language.
Proceedings of the Interspeech 2016, 2016

At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech.
Proceedings of the Interspeech 2016, 2016

Enhancing Multilingual Recognition of Emotion in Speech by Language Identification.
Proceedings of the Interspeech 2016, 2016

Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children.
Proceedings of the Interspeech 2016, 2016

Manual versus Automated: The Challenging Routine of Infant Vocalisation Segmentation in Home Videos to Study Neuro(mal)development.
Proceedings of the Interspeech 2016, 2016

Does She Speak RTT? Towards an Earlier Identification of Rett Syndrome Through Intelligent Pre-Linguistic Vocalisation Analysis.
Proceedings of the Interspeech 2016, 2016

Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion Utilizing Complex Many-to-Many Alignments.
Proceedings of the Interspeech 2016, 2016

Real-Time Tracking of Speakers' Emotions, States, and Traits on Mobile Platforms.
Proceedings of the Interspeech 2016, 2016

Convolutional Neural Networks with Data Augmentation for Classifying Speakers' Native Language.
Proceedings of the Interspeech 2016, 2016

Is Deception Emotional? An Emotion-Driven Predictive Approach.
Proceedings of the Interspeech 2016, 2016

Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective.
Proceedings of the Interspeech 2016, 2016

Convolutional RNN: An enhanced model for extracting features from sequential data.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Driver Frustration Detection from Audio and Video in the Wild.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Detecting road surface wetness from audio: A deep learning approach.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Multiscale kernel locally penalised discriminant analysis exemplified by emotion recognition in speech.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Ask Alice: an artificial retrieval of information agent.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Language proficiency assessment of English L2 speakers based on joint analysis of prosody and native language.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Semi-autonomous data enrichment based on cross-task labelling of missing targets for holistic speech analysis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Enhanced semi-supervised learning for multimodal emotion recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Audio watermarking based on empirical mode decomposition and beat detection.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Cross lingual speech emotion recognition using canonical correlation analysis on principal component subspace.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Wavelet features for classification of vote snore sounds.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

GPU-based fast signal processing for large amounts of snore sound data.
Proceedings of the IEEE 5th Global Conference on Consumer Electronics, 2016

Deep Canonical Time Warping.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

SenticNet 4: A Semantic Resource for Sentiment Analysis Based on Conceptual Primitives.
Proceedings of the COLING 2016, 2016

MEC 2016: The Multimodal Emotion Recognition Challenge of CCPR 2016.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

The University of Passau Open Emotion Recognition System for the Multimodal Emotion Challenge.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

A Bag-of-Audio-Words Approach for Snore Sounds' Excitation Localisation.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

2015
Sentiment analysis and opinion mining: on optimal parameters and performances.
Wiley Interdiscip. Rev. Data Min. Knowl. Discov., 2015

Cooperative Learning and its Application to Emotion Recognition from Speech.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data.
Pattern Recognition Letters, 2015

Introducing CURRENNT: the munich open-source CUDA recurrent neural network toolkit.
Journal of Machine Learning Research, 2015

Emotion in the singing voice - a deeperlook at acoustic features in the light ofautomatic classification.
EURASIP J. Audio, Speech and Music Processing, 2015

Introduction.
Computer Speech & Language, 2015

A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge.
Computer Speech & Language, 2015

A deep matrix factorization method for learning attribute representations.
CoRR, 2015

The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models.
CoRR, 2015

Do Computers Have Personality?
IEEE Computer, 2015

Speech Analysis in the Big Data Era.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Exploring the Importance of Individual Differences to the Automatic Estimation of Emotions Induced by Music.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015

AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015

AVEC 2015: The 5th International Audio/Visual Emotion Challenge and Workshop.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

The ICL-TUM-PASSAU Approach for the MediaEval 2015 "Affective Impact of Movies" Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Automatically Estimating Emotion in Music with Deep Long-Short Term Memory Recurrent Neural Networks.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Modelling User Affect and Sentiment in Intelligent User Interfaces: A Tutorial Overview.
Proceedings of the 20th International Conference on Intelligent User Interfaces, 2015

IDGEI 2015: 3rd International Workshop on Intelligent Digital Games for Empowerment and Inclusion.
Proceedings of the 20th International Conference on Intelligent User Interfaces, 2015

Dimensionality reduction for speech emotion features by multiscale kernels.
Proceedings of the INTERSPEECH 2015, 2015

The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition.
Proceedings of the INTERSPEECH 2015, 2015

Face reading from speech - predicting facial action units from audio cues.
Proceedings of the INTERSPEECH 2015, 2015

Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages.
Proceedings of the INTERSPEECH 2015, 2015

Does my speech rock? automatic assessment of public speaking skills.
Proceedings of the INTERSPEECH 2015, 2015

Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Dynamic Active Learning Based on Agreement and Applied to Emotion Recognition in Spoken Interactions.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

ERM4CT 2015: Workshop on Emotion Representations and Modelling for Companion Systems.
Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies, 2015

A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

Bird sounds classification by large scale acoustic features and extreme learning machine.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

On rater reliability and agreement based dynamic active learning.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Building autonomous sensitive artificial listeners (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Detection of negative emotions in speech signals using bags-of-audio-words.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

iHEARu-PLAY: Introducing a game for crowdsourced data collection for affective computing.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Cross-language acoustic emotion recognition: An overview and some tendencies.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Real-time robust recognition of speakers' emotions and characteristics on mobile platforms.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Intelligent user interfaces in digital games for empowerment and inclusion.
Proceedings of the 12th International Conference on Advances in Computer Entertainment Technology, 2015

2014
Channel mapping using bidirectional long short-term memory for dereverberation in hands-free voice controlled devices.
IEEE Trans. Consumer Electronics, 2014

Memory-Enhanced Neural Networks and NMF for Robust ASR.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Distributing Recognition in Computational Paralinguistics.
IEEE Trans. Affective Computing, 2014

Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2014

Affective neural networks and cognitive learning systems for big data analysis.
Neural Networks, 2014

The TUM Gait from Audio, Image and Depth (GAID) database: Multimodal recognition of subjects and traits.
J. Visual Communication and Image Representation, 2014

Probabilistic speech feature extraction with context-sensitive Bottleneck neural networks.
Neurocomputing, 2014

Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments.
Computer Speech & Language, 2014

Medium-term speaker states - A review on intoxication, sleepiness and the first challenge.
Computer Speech & Language, 2014

Introduction to the Special Issue on Broadening the View on Speaker Analysis.
Computer Speech & Language, 2014

A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems.
CoRR, 2014

The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions.
CoRR, 2014

Acoustic Gait-based Person Identification using Hidden Markov Models.
CoRR, 2014

On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014

On the Influence of Alcohol Intoxication on Speaker Recognition.
Proceedings of the AES International Conference on Semantic Audio 2014, 2014

AVEC 2014: 3D Dimensional Affect and Depression Recognition Challenge.
Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, 2014

AVEC 2014: the 4th international audio/visual emotion challenge and workshop.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Emotional Analysis of Music: A Comparison of Methods.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Munich LSTM-RNN Approach to the MediaEval 2014 "Emotion in Music'" Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

IDGEI 2014: 2nd international workshop on intelligent digital games for empowerment and inclusion.
Proceedings of the 19th International Conference on Intelligent User Interfaces, 2014

The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load.
Proceedings of the INTERSPEECH 2014, 2014

Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling.
Proceedings of the INTERSPEECH 2014, 2014

Investigating NMF speech enhancement for neural network based acoustic models.
Proceedings of the INTERSPEECH 2014, 2014

Audio onset detection: A wavelet packet based approach with recurrent neural networks.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Transfer learning emotion manifestation across music and speech.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Linked Source and Target Domain Subspace Feature Transfer Learning - Exemplified by Speech Emotion Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

A Deep Semi-NMF Model for Learning Hidden Representations.
Proceedings of the 31th International Conference on Machine Learning, 2014

Emotion Recognition in the Wild: Incorporating Voice and Lip Activity in Multimodal Decision-Level Fusion.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

ERM4HCI 2014: The 2nd Workshop on Emotion Representation and Modelling in Human-Computer-Interaction-Systems.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Acoustic Gait-based Person Identification using Hidden Markov Models.
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, 2014

MAPTRAITS 2014: The First Audio/Visual Mapping Personality Traits Challenge.
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, 2014

MAPTRAITS 2014 - The First Audio/Visual Mapping Personality Traits Challenge - An Introduction: Perceived Personality and Social Dimensions.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Modeling gender information for emotion recognition using Denoising autoencoder.
Proceedings of the IEEE International Conference on Acoustics, 2014

Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

On-line continuous-time music mood regression with deep recurrent neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Single-channel speech separation with memory-enhanced recurrent neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

CCA based feature selection with application to continuous depression recognition from acoustic speech features.
Proceedings of the IEEE International Conference on Acoustics, 2014

Introducing shared-hidden-layer autoencoders for transfer learning and their application in acoustic emotion recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Social signal classification using deep blstm recurrent neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Discriminatively trained recurrent neural networks for single-channel speech separation.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013
Keyword spotting exploiting Long Short-Term Memory.
Speech Communication, 2013

Serious Gaming for Behavior Change: The State of Play.
IEEE Pervasive Computing, 2013

LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework.
Image Vision Comput., 2013

Categorical and dimensional affect analysis in continuous input: Current trends and future directions.
Image Vision Comput., 2013

Introduction To The Special Issue On Affect Analysis In Continuous Input.
Image Vision Comput., 2013

Words that Fascinate the Listener: Predicting Affective Ratings of On-Line Lectures.
IJDET, 2013

YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context.
IEEE Intelligent Systems, 2013

New Avenues in Opinion Mining and Sentiment Analysis.
IEEE Intelligent Systems, 2013

Statistical Approaches to Concept-Level Sentiment Analysis.
IEEE Intelligent Systems, 2013

Knowledge-Based Approaches to Concept-Level Sentiment Analysis.
IEEE Intelligent Systems, 2013

Computational Audio Analysis (Dagstuhl Seminar 13451).
Dagstuhl Reports, 2013

Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory.
Computer Speech & Language, 2013

Paralinguistics in speech and language - State-of-the-art and the challenge.
Computer Speech & Language, 2013

Introduction to the special issue on Paralinguistics in Naturalistic Speech and Language.
Computer Speech & Language, 2013

6th International Symposium on Attention in Cognitive Systems 2013.
CoRR, 2013

A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios.
Cognitive Computation, 2013

Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Large-scale audio feature extraction and SVM for acoustic scene classification.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

AVEC 2013: the continuous audio/visual emotion and depression recognition challenge.
Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge, 2013

Workshop summary for the 3rd international audio/visual emotion challenge and workshop (AVEC'13).
Proceedings of the ACM Multimedia Conference, 2013

Recent developments in openSMILE, the munich open-source multimedia feature extractor.
Proceedings of the ACM Multimedia Conference, 2013

The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Active learning by label uncertainty for acoustic emotion recognition.
Proceedings of the INTERSPEECH 2013, 2013

The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.
Proceedings of the INTERSPEECH 2013, 2013

Active learning for dimensional speech emotion recognition.
Proceedings of the INTERSPEECH 2013, 2013

Detecting overlapping speech with long short-term memory recurrent neural networks.
Proceedings of the INTERSPEECH 2013, 2013

Using linguistic information to detect overlapping speech.
Proceedings of the INTERSPEECH 2013, 2013

Affect recognition in real-life acoustic conditions - a new perspective on feature selection.
Proceedings of the INTERSPEECH 2013, 2013

Influence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classification.
Proceedings of the Man-Machine Interactions 3, 2013

ERM4HCI 2013: the 1st workshop on emotion representation and modelling in human-computer-interaction-systems.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

The acoustics of eye contact: detecting visual attention from conversational audio cues.
Proceedings of the 6th workshop on Eye gaze in intelligent human machine interaction: gaze in multimodal interaction, 2013

Co-training succeeds in Computational Paralinguistics.
Proceedings of the IEEE International Conference on Acoustics, 2013

Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2013

Probabilistic asr feature extraction applying context-sensitive connectionist temporal classification networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker trait characterization in web videos: Uniting speech, language, and facial features.
Proceedings of the IEEE International Conference on Acoustics, 2013

A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2013

Acoustic Geo-Sensing: Recognising cyclists' route, route direction, and route progress from cell-phone audio.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic recognition of physiological parameters in the human voice: Heart rate and skin conductance.
Proceedings of the IEEE International Conference on Acoustics, 2013

A comparative study on sparsity penalties for NMF-based speech separation: Beyond LP-norms.
Proceedings of the IEEE International Conference on Acoustics, 2013

Integrating noise estimation and factorization-based speech separation: A novel hybrid approach.
Proceedings of the IEEE International Conference on Acoustics, 2013

Off-line refinement of audio-to-score alignment by observation template adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Gait-based person identification by spectral, cepstral and energy-related audio features.
Proceedings of the IEEE International Conference on Acoustics, 2013

Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies.
Proceedings of the IEEE International Conference on Acoustics, 2013

Hierarchical neural networks and enhanced class posteriors for social signal classification.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

Intelligent Audio Analysis.
Signals and communication technology, Springer, ISBN: 978-3-642-36805-9, 2013

2012
Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit.
Signal Processing Systems, 2012

Neural Networks and Learning Systems Come Together.
IEEE Trans. Neural Netw. Learning Syst., 2012

A multitask approach to continuous five-dimensional affect sensing in natural speech.
TiiS, 2012

The Voice of Leadership: Models and Performances of Automatic Analysis in Online Speeches.
IEEE Trans. Affective Computing, 2012

Guest Editorial: Special Section on Naturalistic Affect Resources for System Building and Evaluation.
IEEE Trans. Affective Computing, 2012

Building Autonomous Sensitive Artificial Listeners.
IEEE Trans. Affective Computing, 2012

Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification.
IEEE Trans. Affective Computing, 2012

The Computational Paralinguistics Challenge [Social Sciences].
IEEE Signal Process. Mag., 2012

Synthesized speech for model training in cross-corpus recognition of human emotion.
I. J. Speech Technology, 2012

Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech.
Neurocomputing, 2012

Emotion and mental state recognition from speech.
EURASIP J. Adv. Sig. Proc., 2012

Cognitive and Emotional Information Processing for Human-Machine Interaction.
Cognitive Computation, 2012

Real-Time Activity Detection in a Multi-Talker Reverberated Environment.
Cognitive Computation, 2012

Emotion in the speech of children with autism spectrum conditions: prosody and everything else.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012

Speech, Emotion, Age, Language, Task, and Typicality: Trying to Disentangle Performance and Feature Relevance.
Proceedings of the 2012 International Conference on Privacy, 2012

Dimensional and continuous analysis of emotions for multimedia applications: a tutorial overview.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature Sets.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Dominance Detection in a Reverberated Acoustic Scenario.
Proceedings of the Advances in Neural Networks - ISNN 2012, 2012

Score-Informed Leading Voice Separation from Monaural Audio.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Towards distributed recognition of emotion from speech.
Proceedings of the 5th International Symposium on Communications, 2012

Active Learning by Sparse Instance Tracking and Classifier Confidence in Acoustic Emotion Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings.
Proceedings of the INTERSPEECH 2012, 2012

Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise.
Proceedings of the INTERSPEECH 2012, 2012

Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives.
Proceedings of the INTERSPEECH 2012, 2012

Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender.
Proceedings of the INTERSPEECH 2012, 2012


Novel Metrics of Speech Rhythm for the Assessment of Emotion.
Proceedings of the INTERSPEECH 2012, 2012

Convolutive Non-Negative Sparse Coding and New Features for Speech Overlap Handling in Speaker Diarization.
Proceedings of the INTERSPEECH 2012, 2012

Confidence Measures in Speech Emotion Recognition Based on Semi-supervised Learning.
Proceedings of the INTERSPEECH 2012, 2012

Likability Classification - A Not so Deep Neural Network Approach.
Proceedings of the INTERSPEECH 2012, 2012

AVEC 2012: the continuous audio/visual emotion challenge.
Proceedings of the International Conference on Multimodal Interaction, 2012

AVEC 2012: the continuous audio/visual emotion challenge - an introduction.
Proceedings of the International Conference on Multimodal Interaction, 2012

Preserving actual dynamic trend of emotion in dimensional speech emotion recognition.
Proceedings of the International Conference on Multimodal Interaction, 2012

Improving generalisation and robustness of acoustic affect recognition.
Proceedings of the International Conference on Multimodal Interaction, 2012

Semi-supervised learning helps in sound event classification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Supervised and semi-supervised suppression of background music in monaural speech recordings.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Speech overlap detection and attribution using convolutive non-negative sparse coding.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Automatic recognition of emotion evoked by general sound events.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fine-tuning HMMS for nonverbal vocalizations in spontaneous speech: A multicorpus perspective.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Audiovisual vocal outburst classification in noisy acoustic conditions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Real-Time Speech Separation by Semi-supervised Nonnegative Matrix Factorization.
Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Speech overlap detection using convolutive non-negative sparse coding: New improvements and insights.
Proceedings of the 20th European Signal Processing Conference, 2012

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines.
Proceedings of the Multimodal Music Processing, 2012

Towards Automatic Intoxication Detection from Speech in Real-Life Acoustic Environments.
Proceedings of the 10. ITG Conference on Speech Communication, 2012

Fully Automatic Audiovisual Emotion Recognition: Voice, Words, and the Face.
Proceedings of the 10. ITG Conference on Speech Communication, 2012

Sparse, Hierarchical and Semi-Supervised Base Learning for Monaural Enhancement of Conversational Speech.
Proceedings of the 10. ITG Conference on Speech Communication, 2012

Exploring Nonnegative Matrix Factorization for Audio Classification: Application to Speaker Recognition.
Proceedings of the 10. ITG Conference on Speech Communication, 2012

Confidence Measures for Speech Emotion Recognition: A Start.
Proceedings of the 10. ITG Conference on Speech Communication, 2012

2011
Tandem decoding of children's speech for keyword detection in a child-robot interaction scenario.
TSLP, 2011

Online Driver Distraction Detection Using Long Short-Term Memory.
IEEE Trans. Intelligent Transportation Systems, 2011

Recognizing Affect from Linguistic Information in 3D Continuous Space.
IEEE Trans. Affective Computing, 2011

Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge.
Speech Communication, 2011

Introduction to the special issue on sensing emotion and affect - Facing realism in speech processing.
Speech Communication, 2011

Computational Assessment of Interest in Speech - Facing the Real-Life Challenge.
KI, 2011

Affective speaker state analysis in the presence of reverberation.
I. J. Speech Technology, 2011

Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization.
EURASIP J. Adv. Sig. Proc., 2011

Whodunnit - Searching for the most important feature types signalling emotion-related user states in speech.
Computer Speech & Language, 2011

Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

Enhancing Spontaneous Speech Recognition with BLSTM Features.
Proceedings of the Advances in Nonlinear Speech Processing, 2011

A Real-Time Speech Enhancement Framework for Multi-party Meetings.
Proceedings of the Advances in Nonlinear Speech Processing, 2011

Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents.
Proceedings of the Advances in Neural Networks - ISNN 2011, 2011

Automatic Assessment of Singer Traits in Popular Music: Gender, Age, Height and Race.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Multi-Modal Non-Prototypical Music Mood Analysis in Continuous Space: Reliability and Performances.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Interacting with Emotional Virtual Agents.
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2011

Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments.
Proceedings of the INTERSPEECH 2011, 2011

Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets.
Proceedings of the INTERSPEECH 2011, 2011

Feature Frame Stacking in RNN-Based Tandem ASR Systems - Learned vs. Predefined Context.
Proceedings of the INTERSPEECH 2011, 2011

Using Multiple Databases for Training in Emotion Recognition: To Unite or to Vote?
Proceedings of the INTERSPEECH 2011, 2011

The INTERSPEECH 2011 Speaker State Challenge.
Proceedings of the INTERSPEECH 2011, 2011

Learning New Acoustic Events in an HMM-Based System Using MAP Adaptation.
Proceedings of the INTERSPEECH 2011, 2011

"Would You Buy a Car from Me?" - On the Likability of Telephone Voices.
Proceedings of the INTERSPEECH 2011, 2011

Real-Time Speech Recognition in a Multi-talker Reverberated Acoustic Scenario.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence, 2011

A multi-stream ASR framework for BLSTM modeling of conversational speech.
Proceedings of the IEEE International Conference on Acoustics, 2011

Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory.
Proceedings of the IEEE International Conference on Acoustics, 2011

Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations.
Proceedings of the IEEE International Conference on Acoustics, 2011

OpenBliSSART: Design and evaluation of a research toolkit for Blind Source Separation in Audio Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Deep neural networks for acoustic emotion recognition: Raising the benchmarks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks.
Proceedings of the IEEE International Conference on Acoustics, 2011

Come and have an emotional workout with sensitive artificial listeners!
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Emotion representation, analysis and synthesis in continuous space: A survey.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

String-based audiovisual fusion of behavioural events for the assessment of dimensional affect.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Ten Recent Trends in Computational Paralinguistics.
Proceedings of the Cognitive Behavioural Systems, 2011

Conversational Speech Recognition in Non-stationary Reverberated Environments.
Proceedings of the Cognitive Behavioural Systems, 2011

Unsupervised learning in cross-corpus acoustic emotion recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

AVEC 2011-The First International Audio/Visual Emotion Challenge.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

The First Audio/Visual Emotion Challenge and Workshop - An Introduction.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

Voice and Speech Analysis in Search of States and Traits.
Proceedings of the Computer Analysis of Human Behavior., 2011

2010
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies.
IEEE Trans. Affective Computing, 2010

Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening.
J. Sel. Topics Signal Processing, 2010

On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues.
J. Multimodal User Interfaces, 2010

On the Impact of Children's Emotional Speech on Acoustic and Language Models.
EURASIP J. Audio, Speech and Music Processing, 2010

Determination of Nonprototypical Valence and Arousal in Popular Music: Features and Performances.
EURASIP J. Audio, Speech and Music Processing, 2010

Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework.
Cognitive Computation, 2010

Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car.
Adv. Human-Computer Interaction, 2010

Segmenting into Adequate Units for Automatic Recognition of Emotion-Related Episodes: A Speech-Based Approach.
Adv. Human-Computer Interaction, 2010

Opensmile: the munich versatile and fast open-source audio feature extractor.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

CINEMO - A French Spoken Language Resource for Complex Emotions: Facts and Baselines.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Vocalist Gender Recognition in Recorded Popular Music.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks.
Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Long short-term memory networks for noise robust speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling.
Proceedings of the INTERSPEECH 2010, 2010

Recognition of spontaneous conversational speech using long short-term memory phoneme predictions.
Proceedings of the INTERSPEECH 2010, 2010

The INTERSPEECH 2010 paralinguistic challenge.
Proceedings of the INTERSPEECH 2010, 2010

Incremental acoustic valence recognition: an inter-corpus perspective on features, matching, and performance in a gating paradigm.
Proceedings of the INTERSPEECH 2010, 2010

Emotion recognition using imperfect speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder.
Proceedings of the IEEE International Conference on Acoustics, 2010

Non-negative matrix factorization as noise-robust feature extractor for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discrimination of speech and non-linguistic vocalizations by Non-Negative Matrix Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2010

Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote.
Proceedings of the IEEE International Conference on Acoustics, 2010

Learning with synthesized speech for automatic emotion recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Learning and Knowledge-Based Sentiment Analysis in Movie Review Key Excerpts.
Proceedings of the Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces. Theoretical and Practical Issues, 2010

Real Time Person Tracking and Behavior Interpretation in Multi Camera Scenarios Applying Homography and Coupled HMMs.
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

2009
Being bored? Recognising natural interest by extensive audiovisual integration for real-life application.
Image Vision Comput., 2009

A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams.
Neurocomputing, 2009

Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement.
EURASIP J. Audio, Speech and Music Processing, 2009

Applying Bayes Markov chains for the detection of ATM related scenarios.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2009), 2009

Improving Keyword Spotting with a Tandem BLSTM-DBN Architecture.
Proceedings of the Advances in Nonlinear Speech Processing, 2009

Robust in-car spelling recognition - a tandem BLSTM-HMM approach.
Proceedings of the INTERSPEECH 2009, 2009

Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks.
Proceedings of the INTERSPEECH 2009, 2009

The INTERSPEECH 2009 emotion challenge.
Proceedings of the INTERSPEECH 2009, 2009

Recognising interest in conversational speech - comparing bag of frames and supra-segmental features.
Proceedings of the INTERSPEECH 2009, 2009

Audio chord labeling by musiological modeling and beat-synchronization.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Speech control in surgery: A field analysis and strategies.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Boosting multi-modal camera selection with semantic features.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

"The Godfather" vs. "Chaos": Comparing Linguistic Analysis Based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

GMs in On-Line Handwritten Whiteboard Note Recognition: The Influence of Implementation and Modeling.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks.
Proceedings of the IEEE International Conference on Acoustics, 2009

Emotion recognition from speech: Putting ASR in the loop.
Proceedings of the IEEE International Conference on Acoustics, 2009

Robust vocabulary independent keyword spotting with graphical models.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Acoustic emotion recognition: A benchmark comparison of performances.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

From speech to letters - using a novel neural network architecture for grapheme based ASR.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

The hinterland of emotions: Facing the open-microphone challenge.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

A demonstration of audiovisual sensitive artificial listeners.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

OpenEAR - Introducing the munich open-source emotion and affect recognition toolkit.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
Tango or Waltz?: Putting Ballroom Dance Style into Tempo Detection.
EURASIP J. Audio, Speech and Music Processing, 2008

Detecting problems in spoken child-computer interaction.
Proceedings of the 1st Workshop on Child, Computer and Interaction, 2008

Does affect affect automatic recognition of children2s speech?
Proceedings of the 1st Workshop on Child, Computer and Interaction, 2008

Low-Level Fusion of Audio, Video Feature for Multi-Modal Emotion Recognition.
Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008

Emotion sensitive speech control for human-robot interaction in minimal invasive surgery.
Proceedings of the 17th IEEE International Symposium on Robot and Human Interactive Communication, 2008

On the Influence of Phonetic Content Variation for Acoustic Emotion Recognition.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies.
Proceedings of the INTERSPEECH 2008, 2008

Balancing spoken content adaptation and unit length in the recognition of emotion and interest.
Proceedings of the INTERSPEECH 2008, 2008

Patterns, prototypes, performance: classifying emotional user states.
Proceedings of the INTERSPEECH 2008, 2008

Prosodic and spectral features within segment-based acoustic modeling.
Proceedings of the INTERSPEECH 2008, 2008

Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement.
Proceedings of the INTERSPEECH 2008, 2008

Detection of security related affect and behaviour in passenger transport.
Proceedings of the INTERSPEECH 2008, 2008

Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Applying multi layer homography for multi camera person tracking.
Proceedings of the 2008 Second ACM/IEEE International Conference on Distributed Smart Cameras, 2008

Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space?
Proceedings of the IEEE International Conference on Acoustics, 2008

Mothers, adults, children, pets - towards the acoustics of intimacy.
Proceedings of the IEEE International Conference on Acoustics, 2008

Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition.
Proceedings of the Pattern Recognition, 2008

Music Thumbnailing Incorporating Harmony- and Rhythm Structure.
Proceedings of the Adaptive Multimedia Retrieval. Identifying, 2008

2007
Mensch, Maschine, Emotion: Erkennung aus sprachlicher und manueller Interaktion.
VDM, ISBN: 978-3-8364-1522-4, 2007

Combining frame and turn-level information for robust recognition of emotions within speech.
Proceedings of the INTERSPEECH 2007, 2007

The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals.
Proceedings of the INTERSPEECH 2007, 2007

Audiovisual recognition of spontaneous interest within conversations.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

Hidden Conditional Random Fields for Meeting Segmentation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Wearable Assistance for the Ballroom-Dance Hobbyist - Holistic Rhythm Analysis and Dance-Style Classification.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Suspicious Behavior Detection in Public Transport by Fusion of Low-Level Video Descriptors.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Towards More Reality in the Recognition of Emotional Speech.
Proceedings of the IEEE International Conference on Acoustics, 2007

Fast and Robust Meter and Tempo Recognition for the Automatic Discrimination of Ballroom Dance Styles.
Proceedings of the IEEE International Conference on Acoustics, 2007

Audiovisual Behavior Modeling by Combined Feature Spaces.
Proceedings of the IEEE International Conference on Acoustics, 2007

Comparing one and two-stage acoustic modeling in the recognition of emotion in speech.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Frame vs. Turn-Level: Emotion Recognition from Speech Considering Static and Dynamic Processing.
Proceedings of the Affective Computing and Intelligent Interaction, 2007

What Should a Generic Emotion Markup Language Be Able to Represent?
Proceedings of the Affective Computing and Intelligent Interaction, 2007

On the Necessity and Feasibility of Detecting a Driver's Emotional State While Driving.
Proceedings of the Affective Computing and Intelligent Interaction, 2007

2006
Timing levels in segment-based speech emotion recognition.
Proceedings of the INTERSPEECH 2006, 2006

Recognition of interest in human conversational speech.
Proceedings of the INTERSPEECH 2006, 2006

Efficient Recognition of Authentic Dynamic Facial Expressions on the Feedtum Database.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Musical Signal Type Discrimination based on Large Open Feature Sets.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Evolutionary Feature Generation in Speech Emotion Recognition.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Segmentation and Recognition of Meeting Events using a Two-Layered HMM and a Combined MLP-HMM Approach.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Two-Layer Graphical Model for Combined Video Shot and Scene Boundary Detection.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Submotions for Hidden Markov Model Based Dynamic Facial Action Recognition.
Proceedings of the International Conference on Image Processing, 2006

A Combined LSTM-RNN - HMM - Approach for Meeting Event Segmentation and Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Automatische Emotionserkennung aus sprachlicher und manueller Interaktion.
PhD thesis, 2005

Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles.
Proceedings of the INTERSPEECH 2005, 2005

Feature Selection and Stacking for Robust Discrimination of Speech, Monophonic Singing, and Polyphonic Music.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Speaker Independent Speech Emotion Recognition by Ensemble Classification.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Video Based Online Behavior Detection Using Probabilistic Multi Stream Fusion.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Video based online behavior detection using probabilistic multi stream fusion.
Proceedings of the 2005 International Conference on Image Processing, 2005

Meta-Classifiers in Acoustic and Linguistic Feature Fusion-Based Affect Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Discrimination of speech and monophonic singing in continuous audio streams applying multi-layer support vector machines.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Emotion recognition in the manual interaction with graphical user interfaces.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Multimodal music retrieval for large databases.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Applying Bayesian belief networks in approximate string matching for robust keyword-based retrieval.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A real-time system for hand gesture controlled operation of in-car devices.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

HMM-based music retrieval using stereophonic feature information and framelength adaptation.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Hidden Markov model-based speech emotion recognition.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Hidden Markov model-based speech emotion recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Aspekte effizienten Usability Engineerings (Aspects of Efficient Usability Engineering).
it+ti - Informationstechnik und Technische Informatik, 2002

Multimodal emotion recognition in audiovisual communication.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

A new technique for adjusting distraction moments in multitasking non-field usability tests.
Proceedings of the Extended abstracts of the 2002 Conference on Human Factors in Computing Systems, 2002

Experimental evaluation of user errors at the skill-based level in an automative environment.
Proceedings of the Extended abstracts of the 2002 Conference on Human Factors in Computing Systems, 2002


  Loading...