Stefan Scherer

Orcid: 0000-0002-0280-5393

According to our database1, Stefan Scherer authored at least 140 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Accelerating Look-ahead in Bayesian Optimization: Multilevel Monte Carlo is All you Need.
CoRR, 2024

2023
Machine learning for semi-automated scoping reviews.
Intell. Syst. Appl., September, 2023

Multimodal Analysis and Assessment of Therapist Empathy in Motivational Interviews.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Improving Selective Visual Question Answering by Learning from Your Peers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Therapist Empathy Assessment in Motivational Interviews.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, 2023

2022
Development and Cross-Cultural Evaluation of a Scoring Algorithm for the Biometric Attachment Test: Overcoming the Challenges of Multimodal Fusion with "Small Data".
IEEE Trans. Affect. Comput., 2022

Training public speaking with virtual social interactions: effectiveness of real-time feedback and delayed feedback.
J. Multimodal User Interfaces, 2022

Speech Behavioral Markers Align on Symptom Factors in Psychological Distress.
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022

2020
Social and Emotional Skills Training with Embodied Moxie.
CoRR, 2020

Multimodal Automatic Coding of Client Behavior in Motivational Interviewing.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

2019
A steady-state precipitation model for flowsheet simulation and its application.
Comput. Chem. Eng., 2019

2018
Unfolding the External Behavior and Inner Affective State of Teammates through Ensemble Learning: Experimental Evidence from a Dyadic Team Corpus.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

NADiA: Neural Network Driven Virtual Human Conversation Agents.
Proceedings of the 18th International Conference on Intelligent Virtual Agents, 2018

Influence of Individual Differences when Training Public Speaking with Virtual Audiences.
Proceedings of the 18th International Conference on Intelligent Virtual Agents, 2018

Multimodal Analysis of Client Behavioral Change Coding in Motivational Interviewing.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Towards Learning Nuisance-Free Representations of Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Modeling Temporality of Human Intentions by Domain Adaptation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

NADiA - Towards Neural Network Driven Virtual Human Conversation Agents.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

A Generic Platform for Training Social Skills with Adaptative Virtual Agents.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

A Linguistically-Informed Fusion Approach for Multimodal Depression Detection.
Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 2018

What type of happiness are you looking for? - A closer look at detecting mental health from language.
Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, 2018

Multimodal assessment of depression from behavioral signals.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018

2017
Adolescent Suicidal Risk Assessment in Clinician-Patient Interaction.
IEEE Trans. Affect. Comput., 2017

Guest Editorial: Towards Machines Able to Deal with Laughter.
IEEE Trans. Affect. Comput., 2017

Reporting Mental Health Symptoms: Breaking Down Barriers to Care with Virtual Human Interviewers.
Frontiers Robotics AI, 2017

Perception of Virtual Audiences.
IEEE Computer Graphics and Applications, 2017

AVEC 2017: Real-life Depression, and Affect Recognition Workshop and Challenge.
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017

Racing Heart and Sweaty Palms - What Influences Users' Self-Assessments and Physiological Signals When Interacting with Virtual Audiences?
Proceedings of the Intelligent Virtual Agents - 17th International Conference, 2017

OpenMM: An Open-Source Multimodal Feature Extraction Tool.
Proceedings of the Interspeech 2017, 2017

The relationship between task-induced stress, vocal changes, and physiological state during a dyadic team task.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Learning representations of emotional speech with deep convolutional generative adversarial networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Assessing Public Speaking Ability from Thin Slices of Behavior.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

Affect-LM: A Neural Language Model for Customizable Affective Text Generation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

A Cross-modal Review of Indicators for Depression Detection Systems.
Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology, 2017

What really matters - An information gain analysis of questions and reactions in automated PTSD screenings.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Comparing models for gesture recognition of children's bullying behaviors.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Manual and automatic measures confirm - Intranasal oxytocin increases facial expressivity.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

2016
Self-Reported Symptoms of Depression and PTSD Are Associated with Reduced Vowel Space in Screening Interviews.
IEEE Trans. Affect. Comput., 2016

Multimodal Behavior Analytics for Interactive Technologies.
Künstliche Intell., 2016

Automatic Behavior Analysis During a Clinical Interview with a Virtual Human.
Proceedings of the Medicine Meets Virtual Reality 22 - NextMed, 2016

AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge.
Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, 2016

A Multimodal Corpus for the Assessment of Public Speaking Ability and Anxiety.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Manipulating the Perception of Virtual Audiences Using Crowdsourced Behaviors.
Proceedings of the Intelligent Virtual Agents - 16th International Conference, 2016

An Architecture for Biologically Grounded Real-Time Reflexive Behavior.
Proceedings of the Intelligent Virtual Agents - 16th International Conference, 2016

Representation Learning for Speech Emotion Recognition.
Proceedings of the Interspeech 2016, 2016

Getting to know you: a multimodal investigation of team behavior and resilience to stress.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Native vs. non-native language fluency implications on multimodal interaction for interpersonal skills training.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

An unsupervised approach to glottal inverse filtering.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
I Can Already Guess Your Answer: Predicting Respondent Reactions during Dyadic Negotiation.
IEEE Trans. Affect. Comput., 2015

A review of depression and suicide risk assessment using speech analysis.
Speech Commun., 2015

Emotion recognition from speech signals via a probabilistic echo-state network.
Pattern Recognit. Lett., 2015

Preface of pattern recognition in human computer interaction.
Pattern Recognit. Lett., 2015

Automatic nonverbal behavior indicators of depression and PTSD: the effect of gender.
J. Multimodal User Interfaces, 2015

Learning Representations of Affect from Speech.
CoRR, 2015

A Multimodal Predictive Model of Successful Debaters or How I Learned to Sway Votes.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Multimodal Public Speaking Performance Assessment.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Exploring Behavior Representation for Learning Analytics.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Public Speaking Training with a Multimodal Interactive Virtual Audience Framework.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Combining Two Perspectives on Classifying Multimodal Data for Recognizing Speaker Traits.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Acoustic and para-verbal indicators of persuasiveness in social multimedia.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Reduced vowel space is a robust indicator of psychological distress: A cross-corpus analysis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Exploring feedback strategies to improve public speaking: an interactive virtual audience framework.
Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2015

Automatic assessment and analysis of public speaking anxiety: A virtual audience case study.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

A demonstration of the perception system in SimSensei, a virtual human application for healthcare interviews.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Towards an affective interface for assessment of psychological distress.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

A multi-label convolutional neural network approach to cross-domain action unit detection.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

SimSensei Demonstration: A Perceptive Virtual Human Interviewer for Healthcare Applications.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Investigating automatic measurements of prosodic accommodation and its dynamics in social interaction.
Speech Commun., 2014

Automatic audiovisual behavior descriptors for psychological disorder analysis.
Image Vis. Comput., 2014

Adolescent suicidal risk assessment in clinician-patient interaction: A study of verbal and acoustic behaviors.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The Distress Analysis Interview Corpus of human and computer interviews.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Dyadic Behavior Analysis in Depression Severity Assessment Interviews.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

COVAREP - A collaborative voice analysis repository for speech technologies.
Proceedings of the IEEE International Conference on Acoustics, 2014

Context-based signal descriptors of heart-rate variability for anxiety assessment.
Proceedings of the IEEE International Conference on Acoustics, 2014

SimSensei kiosk: a virtual human interviewer for healthcare decision support.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

An interactive virtual audience platform for public speaking training.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

2013
On the discovery of events in EEG data utilizing information fusion.
Comput. Stat., 2013

Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification.
Comput. Speech Lang., 2013

Verbal indicators of psychological distress in interactive dialogue with a virtual human.
Proceedings of the SIGDIAL 2013 Conference, 2013

Virtual character performance from speech.
Proceedings of the ACM SIGGRAPH / Eurographics Symposium on Computer Animation, 2013

User-State Sensing for Virtual Health Agents and TeleHealth Applications.
Proceedings of the Medicine Meets Virtual Reality 20 - NextMed, 2013

Cicero - Towards a Multimodal Virtual Audience Platform for Public Speaking Training.
Proceedings of the Intelligent Virtual Agents - 13th International Conference, 2013

Investigating voice quality as a speaker-independent indicator of depression and PTSD.
Proceedings of the INTERSPEECH 2013, 2013

Prediction of strategy and outcome as negotiation unfolds by using basic verbal and behavioral features.
Proceedings of the INTERSPEECH 2013, 2013

A comparative study of glottal open quotient estimation techniques.
Proceedings of the INTERSPEECH 2013, 2013

Audiovisual behavior descriptors for depression assessment.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

ICMI 2013 grand challenge workshop on multimodal learning analytics.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Investigating the speech characteristics of suicidal adolescents.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker and language independent voice quality classification applied to unlabelled corpora of expressive speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic behavior descriptors for psychological disorder analysis.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Towards higher quality character performance in previz.
Proceedings of the Symposium on Digital Production, 2013

Automatic Nonverbal Behavior Indicators of Depression and PTSD: Exploring Gender Differences.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

Mutual Behaviors during Dyadic Negotiation: Automatic Prediction of Respondent Reactions.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
Spotting laughter in natural multiparty conversations: A comparison of automatic online and offline approaches using audiovisual data.
ACM Trans. Interact. Intell. Syst., 2012

A generic framework for the inference of user states in human computer interaction.
J. Multimodal User Interfaces, 2012

Investigating the influence of virtual peers as dialect models on students' prosodic inventory.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012

Vers un mesure automatique de l'adaptation prosodique en interaction conversationnelle (Automatic measurement of prosodic accommodation in conversational interaction) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

The Effect of Fuzzy Training Targets on Voice Quality Classification.
Proceedings of the Multimodal Pattern Recognition of Social Signals in Human-Computer-Interaction, 2012

An audiovisual political speech analysis incorporating eye-tracking and perception data.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Perception Markup Language: Towards a Standardized Representation of Perceived Nonverbal Behaviors.
Proceedings of the Intelligent Virtual Agents - 12th International Conference, 2012

Automatic emotion classification vs. human perception: Comparing machine performance to the human benchmark.
Proceedings of the 11th International Conference on Information Science, 2012

1st international workshop on multimodal learning analytics: extended abstract.
Proceedings of the International Conference on Multimodal Interaction, 2012

Step-wise emotion recognition using concatenated-HMM.
Proceedings of the International Conference on Multimodal Interaction, 2012

Detecting a targeted voice style in an audiobook using voice quality features.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Analyzing the user's state in HCI: from crisp emotions to conversational dispositions.
PhD thesis, 2011

Studying Self- and Active-Training Methods for Multi-feature Set Emotion Recognition.
Proceedings of the Partially Supervised Learning - First IAPR TC3 Workshop, 2011

On the Use of Multimodal Cues for the Prediction of Degrees of Involvement in Spontaneous Conversation.
Proceedings of the INTERSPEECH 2011, 2011

Conditioned Hidden Markov Model Fusion for Multimodal Classification.
Proceedings of the INTERSPEECH 2011, 2011

Multimodal Emotion Classification in Naturalistic User Behavior.
Proceedings of the Human-Computer Interaction. Towards Mobile and Intelligent Interaction Environments, 2011

Social Signal Processing in Companion Systems - Challenges Ahead.
Proceedings of the 41. Jahrestagung der Gesellschaft für Informatik, 2011

How Low Level Observations Can Help to Reveal the User's State in HCI.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

Multiple Classifier Systems for the Classification of Audio-Visual Emotional States.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010
Multiple Classifier Systems for the Recogonition of Human Emotions.
Proceedings of the Multiple Classifier Systems, 9th International Workshop, 2010

Evaluation of the PIT Corpus Or What a Difference a Face Makes?
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Developing an Expressive Speech Labeling Tool Incorporating the Temporal Characteristics of Emotion.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

An Open Source Process Engine Framework for Realtime Pattern Recognition and Information Fusion Tasks.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

It takes two to tango - assessing the impact of delay on conversational interactivity on perceived speech quality.
Proceedings of the INTERSPEECH 2010, 2010

Comparing measures of synchrony and alignment in dialogue speech timing with respect to turn-taking activity.
Proceedings of the INTERSPEECH 2010, 2010

Towards the Automatic Detection of Involvement in Conversation.
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010

Maximum Echo-State-Likelihood Networks for Emotion Recognition.
Proceedings of the Artificial Neural Networks in Pattern Recognition, 2010

2009
The GMM-SVM Supervector Approach for the Recognition of the Emotional Status from Speech.
Proceedings of the Artificial Neural Networks, 2009

Multimodal real-time conversation analysis using a novel process engine.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

Multimodal Laughter Detection in Natural Discourses.
Proceedings of the Human Centered Robot Systems, Cognition, Interaction, Technology, 2009

2008
Emotion Recognition from Speech Using Multi-Classifier Systems and RBF-Ensembles.
Proceedings of the Speech, 2008

Real-Time Emotion Recognition Using Echo State Networks.
Proceedings of the Perception in Multimodal Dialogue Systems, 2008

The PIT Corpus of German Multi-Party Dialogues.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

A Flexible Wizard of Oz Environment for Rapid Prototyping.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Emotion Recognition from Speech: Stress Experiment.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Real-Time Emotion Recognition from Speech Using Echo State Networks.
Proceedings of the Artificial Neural Networks in Pattern Recognition, Third IAPR Workshop, 2008

2007
Fuzzy-Input Fuzzy-Output One-Against-All Support Vector Machines.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2007

A Novel Feature for Emotion Recognition in Voice Based Applications.
Proceedings of the Affective Computing and Intelligent Interaction, 2007

2006
Wizard-of-Oz Data Collection for Perception and Interaction in Multi-User Environments.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

2000
Fitting 3D Models To 2D Imagery: A Physics Based Approach.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2000), 2000

Vision Guided Bin Picking and Mounting in a Flexible Assembly Cell.
Proceedings of the Intelligent Problem Solving, 2000

A Novel Bidirectional Framework for Control and Refinement of Area Based Correlation Techniques.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

3D Model Based Pose Determination in Real-Time: Strategies, Convergence, Accurac.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

1999
The Discriminatory Power of Ordinal Measures - Towards a New Coefficient.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

Subpixel Stereo Matching by Robust Estimation of Local Distortion Using Gabor Filters.
Proceedings of the Computer Analysis of Images and Patterns, 8th International Conference, 1999

A Vision Driven Automatic Assembly Unit.
Proceedings of the Computer Analysis of Images and Patterns, 8th International Conference, 1999

1998
Robust adaptive window matching by homogeneity constraint and integration of descriptions.
Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998


  Loading...