Wangjin Zhou

Orcid: 0009-0007-0693-5316

According to our database¹, Wangjin Zhou authored at least 15 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

SONAR: Self-Distilled Continual Pre-training for Domain Adaptive Audio Representation.

[BibT_eX]

[DOI]

CoRR, September, 2025

Simple and Effective Content Encoder for Singing Voice Conversion via SSL-Embedding Dimension Reduction.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

InvoxSVC: Any-to-any Zero-shot Singing Voice Conversion with In-Context Learning in Latent Flow Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

2024

Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2024

Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations.

[BibT_eX]

[DOI]

CoRR, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Disentangling Age and Identity with a Mutual Information Minimization for Cross-Age Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Realism in 3D Facial Animation Using Conformer-Based Generation and Automated Post-Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

KyotoMOS: An Automatic MOS Scoring System for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Asia Workshops, 2023

The Kyoto Speech-to-Speech Translation System for IWSLT 2023.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

LE-SSL-MOS: Self-Supervised Learning MOS Prediction with Listener Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Fusion of Self-supervised Learned Models for MOS Prediction.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Investigating Effective Domain Adaptation Method for Speaker Verification Task.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 29th International Conference, 2022

Wangjin Zhou

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...