Erik Marchi

Orcid: 0000-0002-5335-6356

According to our database1, Erik Marchi authored at least 59 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models.
CoRR, 2024

2023
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models.
CoRR, 2023

Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types.
Proceedings of the IEEE International Conference on Acoustics, 2023

Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations.
CoRR, 2022

Improving Voice Trigger Detection with Metric Learning.
Proceedings of the Interspeech 2022, 2022

Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models.
Proceedings of the Interspeech 2022, 2022

2021
Whispered and Lombard Neural Speech Synthesis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Progressive Voice Trigger Detection: Accuracy vs Latency.
Proceedings of the IEEE International Conference on Acoustics, 2021

Knowledge Transfer for Efficient on-Device False Trigger Mitigation.
Proceedings of the IEEE International Conference on Acoustics, 2021

On The Role of Visual Cues in Audiovisual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.
CoRR, 2020

Multi-Task Learning for Speaker Verification and Voice Trigger Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Detecting Emotion Primitives from Speech and Their Use in Discerning Categorical Emotions.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Automatic Emotion Recognition in the Voice of Children with Autism Spectrum Conditions.
PhD thesis, 2019

The ASC-Inclusion Perceptual Serious Gaming Platform for Autistic Children.
IEEE Trans. Games, 2019

Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge.
Comput. Speech Lang., 2019

Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice.
Proceedings of the Interspeech 2019, 2019

2018
Efficient Voice Trigger Detection for Low Resource Hardware.
Proceedings of the Interspeech 2018, 2018

Generalised Discriminative Transform via Curriculum Learning for Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection.
Comput. Intell. Neurosci., 2017

End-to-end learning for dimensional emotion recognition from physiological signals.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations.
IEEE Access, 2016

Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks.
Proceedings of the Interspeech 2016, 2016

Enhancing Multilingual Recognition of Emotion in Speech by Language Identification.
Proceedings of the Interspeech 2016, 2016

Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children.
Proceedings of the Interspeech 2016, 2016

Real-Time Tracking of Speakers' Emotions, States, and Traits on Mobile Platforms.
Proceedings of the Interspeech 2016, 2016

Is Deception Emotional? An Emotion-Driven Predictive Approach.
Proceedings of the Interspeech 2016, 2016

Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Driver Frustration Detection from Audio and Video in the Wild.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Detecting road surface wetness from audio: A deep learning approach.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Enhanced semi-supervised learning for multimodal emotion recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Pairwise Decomposition with Deep Neural Networks and Multiscale Kernel Subspace Learning for Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models.
CoRR, 2015

Detecting Road Surface Wetness from Audio: A Deep Learning Approach.
CoRR, 2015

AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data.
Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge, 2015

The ICL-TUM-PASSAU Approach for the MediaEval 2015 "Affective Impact of Movies" Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Face reading from speech - predicting facial action units from audio cues.
Proceedings of the INTERSPEECH 2015, 2015

Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages.
Proceedings of the INTERSPEECH 2015, 2015

Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Real-time robust recognition of speakers' emotions and characteristics on mobile platforms.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Intelligent user interfaces in digital games for empowerment and inclusion.
Proceedings of the 12th International Conference on Advances in Computer Entertainment Technology, 2015

2014
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions.
CoRR, 2014

The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load.
Proceedings of the INTERSPEECH 2014, 2014

Audio onset detection: A wavelet packet based approach with recurrent neural networks.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Active learning by label uncertainty for acoustic emotion recognition.
Proceedings of the INTERSPEECH 2013, 2013

The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism.
Proceedings of the INTERSPEECH 2013, 2013

Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
Emotion in the speech of children with autism spectrum conditions: prosody and everything else.
Proceedings of the Third Workshop on Child, Computer and Interaction, 2012

Speech, Emotion, Age, Language, Task, and Typicality: Trying to Disentangle Performance and Feature Relevance.
Proceedings of the 2012 International Conference on Privacy, 2012

Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender.
Proceedings of the INTERSPEECH 2012, 2012

2011
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents.
Proceedings of the Advances in Neural Networks - ISNN 2011, 2011


  Loading...