Wei Rao

Orcid: 0000-0002-7237-0874

Affiliations:

Tencent Corporation, Tencent Ethereal Audio Lab, Shenzhen, China
National University of Singapore, HLT Lab, Department of Electrical and Computer Engineering, Singapore (2018-2020)
Nanyang Technological University (NTU), Singapore (2015-2018)
Hong Kong Polytechnic University, Hong Kong (PhD 2015)

According to our database¹, Wei Rao authored at least 58 papers between 2010 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

A Robust Coverless Audio Steganography Based on Differential Privacy Clustering.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

2024

Diffusion-Based Method with TTS Guidance for Foreign Accent Conversion.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Hierarchical Speaker Representation for Target Speaker Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Joint Training or Not: An Exploration of Pre-trained Speech Models in Audio-Visual Speaker Diarization.

[BibT_eX]

[DOI]

CoRR, 2023

The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022.

[BibT_eX]

[DOI]

CoRR, 2023

Distance-based Weight Transfer from Near-field to Far-field Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2023

Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Distance-Based Weight Transfer for Fine-Tuning From Near-Field to Far-Field Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 Dns-Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Speech Enhancement with Intelligent Neural Homomorphic Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Gesper: A Unified Framework for General Speech Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Inter-Subnet: Speech Enhancement with Subband Interaction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Local-global speaker representation for target speaker extraction.

[BibT_eX]

[DOI]

CoRR, 2022

Spatial-DCCRN: DCCRN Equipped with Frame-Level Angle Feature and Hybrid Filtering for Multi-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

TEA-PSE 2.0: Sub-Band Network for Real-Time Personalized Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Boosting the Performance of SpEx+ by Attention and Contextual Mechanism.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Speech Enhancement with Fullband-Subband Cross-Attention Network.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

TEA-PSE: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System for ICASSP 2022 DNS Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Target Speaker Verification With Selective Auditory Attention for Single and Multi-Talker Speech.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing.

[BibT_eX]

[DOI]

CoRR, 2021

Adversarial Training for Multi-domain Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

SpEx: Multi-Scale Time Domain Speaker Extraction Network.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation.

[BibT_eX]

[DOI]

CoRR, 2020

The FFSVC 2020 Evaluation Plan.

[BibT_eX]

[DOI]

CoRR, 2020

The INTERSPEECH 2020 Far-Field Speaker Verification Challenge.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.

[BibT_eX]

[DOI]

CoRR, 2019

Target Speaker Extraction for Overlapped Multi-Talker Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2019

Target Speaker Extraction for Multi-Talker Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Time-Domain Speaker Extraction Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016.

[BibT_eX]

[DOI]

Achintya Kumar Sarkar

Fahimeh Bahmaninezhad

Sergey Isadskiy

Christian Rathgeb

Christoph Busch

Georgios Tzimiropoulos

Pierre-Michel Bousquet

Dennis Alexander Lehmann Thomsen

Jean-François Bonastre

Eliathamby Ambikairajah

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An investigation of spectral feature partitioning for replay attacks detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Sparse kernel machines with empirical kernel maps for PLDA speaker verification.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

Comput. Speech Lang., 2016

Fantastic 4 system for NIST 2015 Language Recognition Evaluation.

[BibT_eX]

[DOI]

CoRR, 2016

Neural networks based channel compensation for i-vector speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

I-vector based deep neural network acoustic model adaptation using multilingual language resource.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

Normalization of total variability matrix for i-vector/PLDA speaker verification.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

Kong-Aik Lee

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Relevance vector machines with empirical likelihood-ratio kernels for PLDA speaker verification.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

PLDA modeling in the fishervoice subspace for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Construction of discriminative Kernels from known and unknown non-targets for PLDA-SVM scoring.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

IEEE Trans. Speech Audio Process., 2013

Likelihood-ratio empirical kernels for i-vector based PLDA-SVM scoring.

[BibT_eX]

[DOI]

Man-Wai Mak

Wei Rao

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Utterance partitioning with acoustic vector resampling for i-vector based speaker verification.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Alleviating the small sample-size problem in i-vector based speaker verification.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2011

Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification.

[BibT_eX]

[DOI]

Man-Wai Mak

Wei Rao

Speech Commun., 2011

Addressing the Data-Imbalance Problem in Kernel-Based Speaker Verification via Utterance Partitioning and Speaker Comparison.

[BibT_eX]

[DOI]

Wei Rao

Man-Wai Mak

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

The HKCUPU system for the NIST 2010 speaker recognition evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Acoustic vector resampling for GMMSVM-based speaker verification.

[BibT_eX]

[DOI]

Man-Wai Mak

Wei Rao

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Wei Rao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...