Martin Wöllmer

According to our database1, Martin Wöllmer authored at least 73 papers between 2008 and 2017.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2017
Towards intoxicated speech recognition.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

2015
Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Building autonomous sensitive artificial listeners (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Context-sensitive learning for enhanced audiovisual emotion classification (Extended abstract).
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Memory-Enhanced Neural Networks and NMF for Robust ASR.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Probabilistic speech feature extraction with context-sensitive Bottleneck neural networks.
Neurocomputing, 2014

Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments.
Computer Speech & Language, 2014

A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems.
CoRR, 2014

2013
Keyword spotting exploiting Long Short-Term Memory.
Speech Communication, 2013

LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework.
Image Vision Comput., 2013

YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context.
IEEE Intelligent Systems, 2013

Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory.
Computer Speech & Language, 2013

Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2013

Probabilistic asr feature extraction applying context-sensitive connectionist temporal classification networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker trait characterization in web videos: Uniting speech, language, and facial features.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Context-sensitive machine learning for intelligent human behavior analysis.
PhD thesis, 2012

A multitask approach to continuous five-dimensional affect sensing in natural speech.
TiiS, 2012

Building Autonomous Sensitive Artificial Listeners.
IEEE Trans. Affective Computing, 2012

Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification.
IEEE Trans. Affective Computing, 2012

Real-Time Activity Detection in a Multi-Talker Reverberated Environment.
Cognitive Computation, 2012

Dominance Detection in a Reverberated Acoustic Scenario.
Proceedings of the Advances in Neural Networks - ISNN 2012, 2012

Towards distributed recognition of emotion from speech.
Proceedings of the 5th International Symposium on Communications, 2012

Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings.
Proceedings of the INTERSPEECH 2012, 2012

Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise.
Proceedings of the INTERSPEECH 2012, 2012

Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fully Automatic Audiovisual Emotion Recognition: Voice, Words, and the Face.
Proceedings of the 10. ITG Conference on Speech Communication, 2012

Sparse, Hierarchical and Semi-Supervised Base Learning for Monaural Enhancement of Conversational Speech.
Proceedings of the 10. ITG Conference on Speech Communication, 2012

2011
Tandem decoding of children's speech for keyword detection in a child-robot interaction scenario.
TSLP, 2011

Online Driver Distraction Detection Using Long Short-Term Memory.
IEEE Trans. Intelligent Transportation Systems, 2011

Computational Assessment of Interest in Speech - Facing the Real-Life Challenge.
KI, 2011

Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

Enhancing Spontaneous Speech Recognition with BLSTM Features.
Proceedings of the Advances in Nonlinear Speech Processing, 2011

Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents.
Proceedings of the Advances in Neural Networks - ISNN 2011, 2011

Automatic Assessment of Singer Traits in Popular Music: Gender, Age, Height and Race.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Interacting with Emotional Virtual Agents.
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2011

Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments.
Proceedings of the INTERSPEECH 2011, 2011

Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets.
Proceedings of the INTERSPEECH 2011, 2011

Feature Frame Stacking in RNN-Based Tandem ASR Systems - Learned vs. Predefined Context.
Proceedings of the INTERSPEECH 2011, 2011

A multi-stream ASR framework for BLSTM modeling of conversational speech.
Proceedings of the IEEE International Conference on Acoustics, 2011

Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory.
Proceedings of the IEEE International Conference on Acoustics, 2011

Come and have an emotional workout with sensitive artificial listeners!
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

String-based audiovisual fusion of behavioural events for the assessment of dimensional affect.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Conversational Speech Recognition in Non-stationary Reverberated Environments.
Proceedings of the Cognitive Behavioural Systems, 2011

Unsupervised learning in cross-corpus acoustic emotion recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies.
IEEE Trans. Affective Computing, 2010

Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening.
J. Sel. Topics Signal Processing, 2010

On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues.
J. Multimodal User Interfaces, 2010

Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework.
Cognitive Computation, 2010

Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car.
Adv. Human-Computer Interaction, 2010

Opensmile: the munich versatile and fast open-source audio feature extractor.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

Long short-term memory networks for noise robust speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling.
Proceedings of the INTERSPEECH 2010, 2010

Recognition of spontaneous conversational speech using long short-term memory phoneme predictions.
Proceedings of the INTERSPEECH 2010, 2010

Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder.
Proceedings of the IEEE International Conference on Acoustics, 2010

Non-negative matrix factorization as noise-robust feature extractor for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Being bored? Recognising natural interest by extensive audiovisual integration for real-life application.
Image Vision Comput., 2009

A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams.
Neurocomputing, 2009

Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement.
EURASIP J. Audio, Speech and Music Processing, 2009

Improving Keyword Spotting with a Tandem BLSTM-DBN Architecture.
Proceedings of the Advances in Nonlinear Speech Processing, 2009

Robust in-car spelling recognition - a tandem BLSTM-HMM approach.
Proceedings of the INTERSPEECH 2009, 2009

Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks.
Proceedings of the INTERSPEECH 2009, 2009

Speech control in surgery: A field analysis and strategies.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks.
Proceedings of the IEEE International Conference on Acoustics, 2009

Robust vocabulary independent keyword spotting with graphical models.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

From speech to letters - using a novel neural network architecture for grapheme based ASR.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

A demonstration of audiovisual sensitive artificial listeners.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

OpenEAR - Introducing the munich open-source emotion and affect recognition toolkit.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies.
Proceedings of the INTERSPEECH 2008, 2008

Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement.
Proceedings of the INTERSPEECH 2008, 2008

Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition.
Proceedings of the Pattern Recognition, 2008


  Loading...