Wei Rao

Orcid: 0000-0002-7237-0874

Affiliations:
  • Tencent Corporation, Tencent Ethereal Audio Lab, Shenzhen, China
  • National University of Singapore, HLT Lab, Department of Electrical and Computer Engineering, Singapore (2018-2020)
  • Nanyang Technological University (NTU), Singapore (2015-2018)
  • Hong Kong Polytechnic University, Hong Kong (PhD 2015)


According to our database1, Wei Rao authored at least 58 papers between 2010 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Robust Coverless Audio Steganography Based on Differential Privacy Clustering.
IEEE Trans. Multim., 2025

2024
Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Hierarchical Speaker Representation for Target Speaker Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization.
CoRR, 2023

The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022.
CoRR, 2023

Distance-based Weight Transfer from Near-field to Far-field Speaker Verification.
CoRR, 2023

Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Distance-Based Weight Transfer for Fine-Tuning From Near-Field to Far-Field Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 Dns-Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speech Enhancement with Intelligent Neural Homomorphic Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

Gesper: A Unified Framework for General Speech Restoration.
Proceedings of the IEEE International Conference on Acoustics, 2023

Inter-Subnet: Speech Enhancement with Subband Interaction.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Local-global speaker representation for target speaker extraction.
CoRR, 2022

Spatial-DCCRN: DCCRN Equipped with Frame-Level Angle Feature and Hybrid Filtering for Multi-Channel Speech Enhancement.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

TEA-PSE 2.0: Sub-Band Network for Real-Time Personalized Speech Enhancement.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Boosting the Performance of SpEx+ by Attention and Contextual Mechanism.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Speech Enhancement with Fullband-Subband Cross-Attention Network.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

TEA-PSE: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System for ICASSP 2022 DNS Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Target Speaker Verification With Selective Auditory Attention for Single and Multi-Talker Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing.
CoRR, 2021

Adversarial Training for Multi-domain Speaker Recognition.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
SpEx: Multi-Scale Time Domain Speaker Extraction Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation.
CoRR, 2020

The FFSVC 2020 Evaluation Plan.
CoRR, 2020

The INTERSPEECH 2020 Far-Field Speaker Verification Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification.
CoRR, 2019

Target Speaker Extraction for Multi-Talker Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss.
Proceedings of the IEEE International Conference on Acoustics, 2019

Time-Domain Speaker Extraction Network.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017


An investigation of spectral feature partitioning for replay attacks detection.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Sparse kernel machines with empirical kernel maps for PLDA speaker verification.
Comput. Speech Lang., 2016

Fantastic 4 system for NIST 2015 Language Recognition Evaluation.
CoRR, 2016

Neural networks based channel compensation for i-vector speaker verification.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

I-vector based deep neural network acoustic model adaptation using multilingual language resource.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Normalization of total variability matrix for i-vector/PLDA speaker verification.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Relevance vector machines with empirical likelihood-ratio kernels for PLDA speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

PLDA modeling in the fishervoice subspace for speaker verification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Construction of discriminative Kernels from known and unknown non-targets for PLDA-SVM scoring.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning.
IEEE Trans. Speech Audio Process., 2013

Likelihood-ratio empirical kernels for i-vector based PLDA-SVM scoring.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Utterance partitioning with acoustic vector resampling for i-vector based speaker verification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Alleviating the small sample-size problem in i-vector based speaker verification.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2011
Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification.
Speech Commun., 2011

Addressing the Data-Imbalance Problem in Kernel-Based Speaker Verification via Utterance Partitioning and Speaker Comparison.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

The HKCUPU system for the NIST 2010 speaker recognition evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Acoustic vector resampling for GMMSVM-based speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010


  Loading...