Yang Xiao

Orcid: 0009-0005-9329-7425

Affiliations:
  • University of Melbourne, Australia
  • Nanyang Technological University, School of Computer Science and Engineering, Singapore


According to our database1, Yang Xiao authored at least 35 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Why Can't They Remember? Uncovering Representation and Retrieval Bottlenecks in Multi-Turn Acoustic Memory.
CoRR, May, 2026

Rethinking Continual Learning for Speech and Audio: A Representation-Centric Taxonomy and Open Problems.
CoRR, May, 2026

Continual Adaptation for Pacific Indigenous Speech Recognition.
CoRR, March, 2026

PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio.
CoRR, March, 2026

The First Environmental Sound Deepfake Detection Challenge: Benchmarking Robustness, Evaluation, and Insights.
CoRR, March, 2026

Focus Then Listen: Exploring Plug-and-Play Audio Enhancer for Noise-Robust Large Audio Language Models.
CoRR, March, 2026

Adapting Where It Matters: Depth-Aware Adaptation for Efficient Multilingual Speech Recognition in Low-Resource Languages.
CoRR, February, 2026

ESDD2: Environment-Aware Speech and Sound Deepfake Detection Challenge Evaluation Plan.
CoRR, January, 2026

2025
Environmental Sound Deepfake Detection Challenge: An Overview.
CoRR, December, 2025

ESDD 2026: Environmental Sound Deepfake Detection Challenge Evaluation Plan.
CoRR, August, 2025

Multilingual Source Tracing of Speech Deepfakes: A First Benchmark.
CoRR, August, 2025

XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection.
IEEE Signal Process. Lett., 2025

Noise-Robust Sound Event Detection and Counting via Language-Queried Sound Separation.
IEEE Signal Process. Lett., 2025

EnvSDD: Benchmarking Environmental Sound Deepfake Detection.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Listen, Analyze, and Adapt to Learn New Attacks: An Exemplar-Free Class Incremental Learning Method for Audio Deepfake Source Tracing.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

TF-Mamba: A Time-Frequency Network for Sound Source Localization.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Where's That Voice Coming? Continual Learning for Sound Source Localization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2025

Exploring Text-Queried Sound Event Detection with Audio Source Separation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Dark Experience for Incremental Keyword Spotting.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

DG-SED: Domain Generalization for Sound Event Detection with Heterogeneous Training Data.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

RawTFNet: A Lightweight CNN Architecture for Speech Anti-Spoofing.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

AnalyticKWS: Towards Exemplar-Free Analytic Class Incremental Learning for Small-footprint Keyword Spotting.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Configurable DOA Estimation using Incremental Learning.
CoRR, 2024

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System.
CoRR, 2024

FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels.
CoRR, 2024

Dual Knowledge Distillation for Efficient Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

Advancing Airport Tower Command Recognition: Integrating Squeeze-and-Excitation and Broadcasted Residual Learning.
Proceedings of the International Conference on Asian Language Processing, 2024

2023
Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Continual Learning For On-Device Environmental Sound Classification.
CoRR, 2022

Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness.
CoRR, 2022

Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Continual Learning for On-Ddevice Environmental Sound Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022


  Loading...