Wenhao Guan

According to our database1, Wenhao Guan authored at least 23 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Topo-Field: Topometric Mapping With Brain-Inspired Hierarchical Layout-Object-Position Fields.
IEEE Robotics Autom. Lett., June, 2025

ReFlow-VC: Zero-shot Voice Conversion Based on Rectified Flow and Speaker Feature Optimization.
CoRR, June, 2025

DS-Codec: Dual-Stage Training with Mirror-to-NonMirror Architecture Switching for Speech Codec.
CoRR, May, 2025

Discl-VC: Disentangled Discrete Tokens and In-Context Learning for Controllable Zero-Shot Voice Conversion.
CoRR, May, 2025

SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow.
CoRR, April, 2025

SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Dynamic Language Group-based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations.
CoRR, 2024

Dynamic Language Group-Based MoE: Enhancing Efficiency and Flexibility for Code-Switching Speech Recognition.
CoRR, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.
CoRR, 2024

LOP-Field: Brain-inspired Layout-Object-Position Fields for Robotic Scene Understanding.
CoRR, 2024

Enhancing Code-Switching Speech Recognition With LID-Based Collaborative Mixture of Experts Model.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

MinSpeech: A Corpus of Southern Min Dialect for Automatic Speech Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Efficient Integrated Features Based on Pre-trained Models for Speaker Verification.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Improving Multi-Speaker ASR With Overlap-Aware Encoding And Monotonic Attention.
Proceedings of the IEEE International Conference on Acoustics, 2024

SR-HuBERT : An Efficient Pre-Trained Model for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multivariate Fourier Distribution Perturbation: Domain Shifts with Uncertainty in Frequency Domain.
Proceedings of the IEEE International Conference on Acoustics, 2024

Reflow-TTS: A Rectified Flow Model for High-Fidelity Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2018
Verifiable memory leakage-resilient dynamic searchable encryption.
J. High Speed Networks, 2018


  Loading...