Wangjin Zhou

Orcid: 0009-0007-0693-5316

According to our database1, Wangjin Zhou authored at least 15 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
SONAR: Self-Distilled Continual Pre-training for Domain Adaptive Audio Representation.
CoRR, September, 2025

Simple and Effective Content Encoder for Singing Voice Conversion via SSL-Embedding Dimension Reduction.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

InvoxSVC: Any-to-any Zero-shot Singing Voice Conversion with In-Context Learning in Latent Flow Matching.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

2024
Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification.
CoRR, 2024

Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations.
CoRR, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.
CoRR, 2024

Disentangling Age and Identity with a Mutual Information Minimization for Cross-Age Speaker Verification.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Realism in 3D Facial Animation Using Conformer-Based Generation and Automated Post-Processing.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
KyotoMOS: An Automatic MOS Scoring System for Speech Synthesis.
Proceedings of the ACM Multimedia Asia Workshops, 2023

The Kyoto Speech-to-Speech Translation System for IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

LE-SSL-MOS: Self-Supervised Learning MOS Prediction with Listener Enhancement.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Fusion of Self-supervised Learned Models for MOS Prediction.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Investigating Effective Domain Adaptation Method for Speaker Verification Task.
Proceedings of the Neural Information Processing - 29th International Conference, 2022


  Loading...