Zixing Zhang

Orcid: 0000-0001-8487-0561

Affiliations:
  • Imperial College London, Department of Computing, UK
  • University of Passau, Faculty of Computer Science and Mathematics, Germany
  • TU Munich, Institute for Human-Machine Communication, Germany


According to our database1, Zixing Zhang authored at least 103 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition.
CoRR, 2024

2023
Diversifying Emotional Dialogue Generation via Selective Adversarial Training.
Sensors, July, 2023

ACG-EmoCluster: A Novel Framework to Capture Spatial and Temporal Information from Emotional Speech Enhanced by DeepCluster.
Sensors, 2023

Customising General Large Language Models for Specialised Emotion Recognition Tasks.
CoRR, 2023

Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models.
CoRR, 2023

Frequency Domain Feature Learning with Wavelet Transform for Image Translation.
Proceedings of the PRICAI 2023: Trends in Artificial Intelligence, 2023

GCFormer: A Graph Convolutional Transformer for Speech Emotion Recognition.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

Privacy-Enhanced Federated Learning Against Attribute Inference Attack for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed Prototypes.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes.
IEEE Trans. Multim., 2022

Rethinking Auditory Affective Descriptors Through Zero-Shot Emotion Recognition in Speech.
IEEE Trans. Comput. Soc. Syst., 2022

Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression.
CoRR, 2022

Deliberation Selector for Knowledge-Grounded Conversation Generation.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Automatic Respiratory Sound Classification Via Multi-Branch Temporal Convolutional Network.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Self-attention transfer networks for speech emotion recognition.
Virtual Real. Intell. Hardw., 2021

Can Machine Learning Assist Locating the Excitation of Snore Sound? A Review.
IEEE J. Biomed. Health Informatics, 2021

EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings.
IEEE Trans. Affect. Comput., 2021

Artificial Intelligence Internet of Things for the Elderly: From Assisted Living to Health-Care Monitoring.
IEEE Signal Process. Mag., 2021

Deep Learning for Mobile Mental Health: Challenges and recent advances.
IEEE Signal Process. Mag., 2021

Combining a parallel 2D CNN with a self-attention Dilated Residual Network for CTC-based discrete speech emotion recognition.
Neural Networks, 2021

Computer Audition for Fighting the SARS-CoV-2 Corona Crisis - Introducing the Multitask Speech Corpus for COVID-19.
IEEE Internet Things J., 2021

Internet of emotional people: Towards continual affective computing cross cultures via audiovisual signals.
Future Gener. Comput. Syst., 2021

Learning audio sequence representations for acoustic event classification.
Expert Syst. Appl., 2021

Exploring Perception Uncertainty for Emotion Recognition in Dyadic Conversation and Music Listening.
Cogn. Comput., 2021

Identifying surgical-mask speech using deep neural networks on low-level aggregation.
Proceedings of the SAC '21: The 36th ACM/SIGAPP Symposium on Applied Computing, 2021

2020
Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data.
IEEE J. Biomed. Health Informatics, 2020

Guest Editorial Special Issue on Adversarial Learning in Computational Intelligence.
IEEE Trans. Emerg. Top. Comput. Intell., 2020

Exploiting time-frequency patterns with LSTM-RNNs for low-bitrate audio restoration.
Neural Comput. Appl., 2020

Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders.
IEEE J. Sel. Top. Signal Process., 2020

Robust Semisupervised Generative Adversarial Networks for Speech Emotion Recognition via Distribution Smoothness.
IEEE Access, 2020

An Early Study on Intelligent Analysis of Speech Under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety.
Proceedings of the Interspeech 2020, 2020

Hierarchical Attention Transfer Networks for Depression Assessment from Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating and Protecting Against Adversarial Attacks for Deep Speech-Based Emotion Recognition Models.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Dynamic Difficulty Awareness Training for Continuous Emotion Prediction.
IEEE Trans. Multim., 2019

Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives [Review Article].
IEEE Comput. Intell. Mag., 2019

Exploring Deep Spectrum Representations via Attention-Based Recurrent and Convolutional Neural Networks for Speech Emotion Recognition.
IEEE Access, 2019

Automatic Detection of Major Depressive Disorder via a Bag-of-Behaviour-Words Approach.
Proceedings of the Third International Symposium on Image Computing and Digital Medicine, 2019

Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition.
Proceedings of the Interspeech 2019, 2019

Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition.
Proceedings of the Interspeech 2019, 2019

VCMNet: Weakly Supervised Learning for Automatic Infant Vocalisation Maturity Analysis.
Proceedings of the International Conference on Multimodal Interaction, 2019

Compact Convolutional Recurrent Neural Networks via Binarization for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Implicit Fusion by Joint Audiovisual Training for Emotion Recognition in Mono Modality.
Proceedings of the IEEE International Conference on Acoustics, 2019

Attention-augmented End-to-end Multi-task Learning for Emotion Prediction from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

Teaching Machines to Know Your Depressive State: On Physical Activity in Health and Major Depressive Disorder.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019

2018
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
ACM Trans. Intell. Syst. Technol., 2018

Semisupervised Autoencoders for Speech Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Deep Scalogram Representations for Acoustic Scene Classification.
IEEE CAA J. Autom. Sinica, 2018

Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives.
CoRR, 2018

Snoring classified: The Munich-Passau Snore Sound Corpus.
Comput. Biol. Medicine, 2018

Leveraging Unlabeled Data for Emotion Recognition With Enhanced Collaborative Semi-Supervised Learning.
IEEE Access, 2018

Exploring Spatio-Temporal Representations by Integrating Attention-based Bidirectional-LSTM-RNNs and FCNs for Speech Emotion Recognition.
Proceedings of the Interspeech 2018, 2018

Automated Classification of Children's Linguistic versus Non-Linguistic Vocalisations.
Proceedings of the Interspeech 2018, 2018

Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech.
Proceedings of the Interspeech 2018, 2018

Evolving Learning for Analysing Mood-Related Infant Vocalisation.
Proceedings of the Interspeech 2018, 2018

Exploring A New Method for Food Likability Rating Based on DT-CWT Theory.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Towards Conditional Adversarial Training for Predicting Emotions from Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multifeature Analysis.
IEEE Trans. Biomed. Eng., 2017

A Two-Dimensional Framework of Multiple Kernel Subspace Learning for Recognizing Emotion in Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Advanced Data Exploitation in Speech Analysis: An overview.
IEEE Signal Process. Mag., 2017

Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2017

Strength modelling for real-worldautomatic continuous affect recognition from audiovisual signals.
Image Vis. Comput., 2017

Learning Audio Sequence Representations for Acoustic Event Classification.
CoRR, 2017

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments.
CoRR, 2017

Recognizing Emotions From Whispered Speech Based on Acoustic Feature Transfer Learning.
IEEE Access, 2017

From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World.
Proceedings of the Interspeech 2017, 2017

Towards intoxicated speech recognition.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Prediction-based learning for continuous emotion recognition in speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Reconstruction-error-based learning for continuous emotion recognition in speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Deep Sequential Image Features on Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

Wavelets Revisited for the Classification of Acoustic Scenes.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017

2016
Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition.
IEEE Access, 2016

Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Fisher Kernels on Phase-Based Features for Speech Emotion Recognition.
Proceedings of the Dialogues with Social Robots, 2016

Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks.
Proceedings of the Interspeech 2016, 2016

Multiscale kernel locally penalised discriminant analysis exemplified by emotion recognition in speech.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Enhanced semi-supervised learning for multimodal emotion recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Wavelet features for classification of vote snore sounds.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

The University of Passau Open Emotion Recognition System for the Multimodal Emotion Challenge.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

2015
Semi-Autonomous Data Enrichment and Optimisation for Intelligent Speech Analysis.
PhD thesis, 2015

Cooperative Learning and its Application to Emotion Recognition from Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Dynamic Active Learning Based on Agreement and Applied to Emotion Recognition in Spoken Interactions.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Bird sounds classification by large scale acoustic features and extreme learning machine.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015

On rater reliability and agreement based dynamic active learning.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Channel mapping using bidirectional long short-term memory for dereverberation in hands-free voice controlled devices.
IEEE Trans. Consumer Electron., 2014

Distributing Recognition in Computational Paralinguistics.
IEEE Trans. Affect. Comput., 2014

Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition.
IEEE Signal Process. Lett., 2014

Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling.
Proceedings of the INTERSPEECH 2014, 2014

Linked Source and Target Domain Subspace Feature Transfer Learning - Exemplified by Speech Emotion Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Introducing shared-hidden-layer autoencoders for transfer learning and their application in acoustic emotion recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Active learning by label uncertainty for acoustic emotion recognition.
Proceedings of the INTERSPEECH 2013, 2013

Co-training succeeds in Computational Paralinguistics.
Proceedings of the IEEE International Conference on Acoustics, 2013

Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2013

Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
Synthesized speech for model training in cross-corpus recognition of human emotion.
Int. J. Speech Technol., 2012

Towards distributed recognition of emotion from speech.
Proceedings of the 5th International Symposium on Communications, 2012

Active Learning by Sparse Instance Tracking and Classifier Confidence in Acoustic Emotion Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Semi-supervised learning helps in sound event classification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Automatic recognition of emotion evoked by general sound events.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Towards Automatic Intoxication Detection from Speech in Real-Life Acoustic Environments.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

2011
Using Multiple Databases for Training in Emotion Recognition: To Unite or to Vote?
Proceedings of the INTERSPEECH 2011, 2011

Unsupervised learning in cross-corpus acoustic emotion recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011


  Loading...