Wenhan Yao

Orcid: 0000-0003-1014-9565

According to our database1, Wenhan Yao authored at least 12 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Imperceptible rhythm backdoor attacks: Exploring rhythm transformation for embedding undetectable vulnerabilities on speech recognition.
Neurocomputing, 2025

LRBA: Stealthy Backdoor Attacks on Speech Classification via Latent Rearrangement in VITS.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models.
Proceedings of the International Joint Conference on Neural Networks, 2025

Pureformer-VC: Non-parallel Voice Conversion with Pure Stylized Transformer Blocks and Triplet Discriminative Training.
Proceedings of the International Joint Conference on Neural Networks, 2025

Emotional Text-to-Speech via Style Decoder with Emotion Shared Styleformer Block and RoPE Prior Encoder.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2025, 2025

Art Style Backdoor Attacks on Semantic Segmentation Models.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2025, 2025

TimbreAdv: Timbre Adversarial Attacks on Speaker Verification Systems.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2025, 2025

2024
EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models.
CoRR, 2024

2022
StyleFormerGAN-VC:Improving Effect of few shot Cross-Lingual Voice Conversion Using VAE-StarGAN and Attention-AdaIN.
Proceedings of the 24th IEEE/ACIS International Conference on Software Engineering, 2022

A New Spoken Language Teaching Tech: Combining Multi-attention and AdaIN for One-shot Cross Language Voice Conversion.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

AdaptiveFormer: A Few-shot Speaker Adaptive Speech Synthesis Model based on FastSpeech2.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Voicifier-LN: An Novel Approach to Elevate the Speaker Similarity for General Zero-shot Multi-Speaker TTS.
Proceedings of the 5th International Conference on Artificial Intelligence and Pattern Recognition, 2022


  Loading...