Jiawen Kang

Affiliations:
  • Chinese University of Hong Kong, Human-Computer Communications Laboratory, Hong Kong
  • Tsinghua University, Center for Speech and Language Technologies (CSLT), BNRist, Beijing, China (former)


According to our database1, Jiawen Kang authored at least 25 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives.
CoRR, January, 2025

Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech.
CoRR, 2024

Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR.
CoRR, 2024

Purple-teaming LLMs with Adversarial Defender Training.
CoRR, 2024

Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Cross-Speaker Encoding Network for Multi-Talker Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning.
CoRR, 2023

Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Sidecar Separator Can Convert A Single-Talker Speech Recognition System to A Multi-Talker One.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Defender's Perspective on Automatic Speaker Verification: An Overview.
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

CN-Celeb: Multi-genre speaker recognition.
Speech Commun., 2022

Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Spoofing-Aware Speaker Verification by Multi-Level Fusion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

The CUHK-Tencent Speaker Diarization System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

TalkTive: A Conversational Agent Using Backchannels to Engage Older Adults in Neurocognitive Disorders Screening.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021
Squeezing Value of Cross-Domain Labels: A Decoupled Scoring Approach for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

CN-Celeb: A Challenging Chinese Speaker Recognition Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


  Loading...