Zexin Cai

According to our database1, Zexin Cai authored at least 21 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Integrating frame-level boundary detection and deepfake detection for locating manipulated regions in partially spoofed audio forgery attacks.
Comput. Speech Lang., April, 2024

Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning.
CoRR, 2024

2023
Cross-lingual multi-speaker speech synthesis with limited bilingual training data.
Comput. Speech Lang., 2023

The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023.
CoRR, 2023

Electrolaryngeal speech enhancement based on a two stage framework with bottleneck feature refinement and voice conversion.
Biomed. Signal Process. Control., 2023

Waveform Boundary Detection for Partially Spoofed Audio.
Proceedings of the IEEE International Conference on Acoustics, 2023

Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification Systems.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Invertible Voice Conversion.
CoRR, 2022

SIG-VC: A Speaker Information Guided Zero-Shot Voice Conversion System for Both Human Beings and Machines.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
Training Wake Word Detection with Synthesized Speech Data on Confusion Words.
CoRR, 2020

Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario.
CoRR, 2020

From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint.
Proceedings of the Interspeech 2020, 2020

2019
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-Level Embedding Features.
Proceedings of the Interspeech 2019, 2019

F0 Contour Estimation Using Phonetic Feature in Electrolaryngeal Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Insights into End-to-End Learning Scheme for Language Identification.
CoRR, 2018

Unsupervised query by example spoken term detection using features concatenated with Self-Organizing Map distances.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

End-to-end Language Identification using NetFV and NetVLAD.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Insights in-to-End Learning Scheme for Language Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018


  Loading...