Maximilian Schmitt

Orcid: 0000-0001-7453-5612

According to our database1, Maximilian Schmitt authored at least 56 papers between 2016 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

2022
Bag-of-words representations for computer audition.
PhD thesis, 2022

Face mask recognition from audio: The MASC database and an overview on the mask challenge.
Pattern Recognit., 2022

Probing speech emotion recognition transformers for linguistic knowledge.
Proceedings of the Interspeech 2022, 2022

Quantifying Cognitive Load from Voice using Transformer-Based Models and a Cross-Dataset Evaluation.
Proceedings of the 21st IEEE International Conference on Machine Learning and Applications, 2022

2021
Can Machine Learning Assist Locating the Excitation of Snore Sound? A Review.
IEEE J. Biomed. Health Informatics, 2021

SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Computer Audition for Fighting the SARS-CoV-2 Corona Crisis - Introducing the Multitask Speech Corpus for COVID-19.
IEEE Internet Things J., 2021

Automatic Recognition of Texture in Renaissance Music.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

2020
DEMoS: an Italian emotional speech corpus.
Lang. Resour. Evaluation, 2020

I see it in your eyes: Training the shallowest-possible CNN to recognise emotions and pain from muted web-assisted in-the-wild video-chats in real-time.
Inf. Process. Manag., 2020

The INTERSPEECH 2020 Computational Paralinguistics Challenge: Elderly Emotion, Breathing & Masks.
Proceedings of the Interspeech 2020, 2020

2019
Humans Inside: Cooperative Big Multimedia Data Mining.
Proceedings of the Innovations in Big Data Mining and Embedded Knowledge, 2019

Synchronization in Interpersonal Speech.
Frontiers Robotics AI, 2019

SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild.
CoRR, 2019

Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and Results.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019

AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition.
Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop, 2019

Automatic Detection of Major Depressive Disorder via a Bag-of-Behaviour-Words Approach.
Proceedings of the Third International Symposium on Image Computing and Digital Medicine, 2019

The INTERSPEECH 2019 Computational Paralinguistics Challenge: Styrian Dialects, Continuous Sleepiness, Baby Sounds & Orca Activity.
Proceedings of the Interspeech 2019, 2019

Continuous Emotion Recognition in Speech - Do We Need Recurrence?
Proceedings of the Interspeech 2019, 2019

Performance Analysis of Unimodal and Multimodal Models in Valence-Based Empathy Recognition.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

End-to-end Audio Classification with Small Datasets - Making It Work.
Proceedings of the 27th European Signal Processing Conference, 2019

Automated Classification of Airborne Pollen using Neural Networks.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Snoring - An Acoustic Definition.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

I Know How you Feel Now, and Here's why!: Demystifying Time-Continuous High Resolution Text-Based Affect Predictions in the Wild.
Proceedings of the 32nd IEEE International Symposium on Computer-Based Medical Systems, 2019

2018
MixedEmotions: An Open-Source Toolbox for Multimodal Emotion Analysis.
IEEE Trans. Multim., 2018

Weakly Supervised One-Shot Detection with Attention Siamese Networks.
CoRR, 2018

Snoring classified: The Munich-Passau Snore Sound Corpus.
Comput. Biol. Medicine, 2018

How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers.
Proceedings of the Speech and Computer - 20th International Conference, 2018

You Sound Like Your Counterpart: Interpersonal Speech Analysis.
Proceedings of the Speech and Computer - 20th International Conference, 2018

AVEC 2018 Workshop and Challenge: Bipolar Disorder and Cross-Cultural Affect Recognition.
Proceedings of the 2018 on Audio/Visual Emotion Challenge and Workshop, 2018

A German Corpus for Fine-Grained Named Entity Recognition and Relation Extraction of Traffic and Industry Events.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Musical-Linguistic Annotations of Il Lauro Secco.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

Identifying Emotions in Opera Singing: Implications of Adverse Acoustic Conditions.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018


Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech.
Proceedings of the Interspeech 2018, 2018

EAT -: The ICMI 2018 Eating Analysis and Tracking Challenge.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Introducing an Emotion-Driven Assistance System for Cognitively Impaired Individuals.
Proceedings of the Computers Helping People with Special Needs, 2018

Multimodal Bag-of-Words for Cross Domains Sentiment Analysis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Unsupervised Representation Learning for Abnormal Heart Sound Classification.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

2017
openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit.
J. Mach. Learn. Res., 2017

AVEC 2017: Real-life Depression, and Affect Recognition Workshop and Challenge.
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017

From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Industrial Evaluation of Search-Based Test Generation Techniques for Control Systems.
Proceedings of the 2017 IEEE International Symposium on Software Reliability Engineering Workshops, 2017


Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective.
Proceedings of the Interspeech 2017, 2017

Seeking the SuperStar: Automatic assessment of perceived singing quality.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

A machine learning based system for the automatic evaluation of aphasia speech.
Proceedings of the 19th IEEE International Conference on e-Health Networking, 2017

Recognising Guitar Effects - Which Acoustic Features Really Matter?
Proceedings of the 47. Jahrestagung der Gesellschaft für Informatik, 2017

Automatic Guitar String Detection by String-Inverse Frequency Estimation.
Proceedings of the 47. Jahrestagung der Gesellschaft für Informatik, 2017

"You sound ill, take the day off": Automatic recognition of speech affected by upper respiratory tract infection.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

Speech-based Diagnosis of Autism Spectrum Condition by Generative Adversarial Network Representations.
Proceedings of the 2017 International Conference on Digital Health, 2017

2016
At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech.
Proceedings of the Interspeech 2016, 2016

Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

A Bag-of-Audio-Words Approach for Snore Sounds' Excitation Localisation.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016


  Loading...