Jiawen Kang

Affiliations:

Chinese University of Hong Kong, Human-Computer Communications Laboratory, Hong Kong
Tsinghua University, Center for Speech and Language Technologies (CSLT), BNRist, Beijing, China (former)

According to our database¹, Jiawen Kang authored at least 26 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives.

[BibT_eX]

[DOI]

CoRR, January, 2025

Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

On the Within-class Variation Issue in Alzheimer's Disease Detection.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech.

[BibT_eX]

[DOI]

CoRR, 2024

Purple-teaming LLMs with Adversarial Defender Training.

[BibT_eX]

[DOI]

CoRR, 2024

Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Cross-Speaker Encoding Network for Multi-Talker Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Sidecar Separator Can Convert A Single-Talker Speech Recognition System to A Multi-Talker One.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

The Defender's Perspective on Automatic Speaker Verification: An Overview.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022

A Principle Solution for Enroll-Test Mismatch in Speaker Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022