Xin Jing

Affiliations:
  • Technical University of Munich (TUM), Chair of Health Informatics, Munich, Germany
  • University of Augsburg, Embedded Intelligence for Health Care and Wellbeing, Augsburg, Germany


According to our database1, Xin Jing authored at least 20 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Decoding nature's melody: significance and challenges of machine learning in assessing bird diversity via soundscape analysis.
Artif. Intell. Rev., January, 2026

2025
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge.
CoRR, May, 2025

STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition.
IEEE Trans. Affect. Comput., 2025

Audio-Based Kinship Verification Using Age Domain Conversion.
IEEE Signal Process. Lett., 2025

Vishing: Detecting social engineering in spoken communication - A first survey & urgent roadmap to address an emerging societal challenge.
Comput. Speech Lang., 2025

MADUV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

ParaCLAP - Towards a general language-audio model for computational paralinguistic tasks.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Temporal Oriented ResNet for Gaming Dimensional Emotion Prediction.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
HEAR4Health: a blueprint for making computer audition a staple of modern healthcare.
Frontiers Digit. Health, May, 2023

U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech.
CoRR, 2023

Daily Mental Health Monitoring from Speech: A Real-World Japanese Dataset and Multitask Learning Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Audio self-supervised learning: A survey.
Patterns, 2022

Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression.
CoRR, 2022

Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression.
CoRR, 2022

Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction.
CoRR, 2022

An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Temporal-oriented Broadcast ResNet for COVID-19 Detection.
Proceedings of the IEEE-EMBS International Conference on Biomedical and Health Informatics, 2022

2021
CovNet: A Transfer Learning Framework for Automatic COVID-19 Detection From Crowd-Sourced Cough Sounds.
Frontiers Digit. Health, 2021


  Loading...