Xueliang Zhang

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Enhancing Multi-Channel Speech with Limited Microphones via Spherical Harmonic Transform.

[BibT_eX]

[DOI]

Jiahui Pan

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Vector Quantized Diffusion Model Based Speech Bandwidth Extension.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Attention-Based Beamformer For Multi-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

A Two-Stage Band-Split Mamba-2 Network For Music Separation.

[BibT_eX]

[DOI]

CoRR, 2024

Cross-Attention-Guided WaveNet for EEG-to-MEL Spectrogram Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Innovative Directional Encoding in Speech Processing: Leveraging Spherical Harmonics Injection for Multi-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Efficient Multi-Channel Speech Enhancement with Spherical Harmonics Injection for Directional Encoding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Hierarchical Speaker Representation for Target Speaker Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Cross-Attention-Guided Wavenet for Mel Spectrogram Reconstruction in The ICASSP 2024 Auditory EEG Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Neural Multi-Channel and Multi-Microphone Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2023

PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch Interactions for Multi-Channel Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2023

Speech Enhancement with Intelligent Neural Homomorphic Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

ScaleFormer: Transformer-based speech enhancement in the multi-scale time domain.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

Fusing Bone-Conduction and Air-Conduction Sensors for Complex-Domain Speech Enhancement.

[BibT_eX]

[DOI]

Heming Wang

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Local-global speaker representation for target speaker extraction.

[BibT_eX]

[DOI]

CoRR, 2022

RAT: RNN-Attention Transformer for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain.

[BibT_eX]

[DOI]

Shulin He

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

Chenggang Zhang

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speaker recognition-assisted robust audio deepfake detection.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Complex Spectral Mapping with Inplace Convolution Recurrent Neural Networks For Acoustic Echo Cancellation.

[BibT_eX]

[DOI]

Chenggang Zhang

Proceedings of the IEEE International Conference on Acoustics, 2022

A Robust Deep Audio Splicing Detection Method via Singularity Detection Feature.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Alleviating the Loss-Metric Mismatch in Supervised Single-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Attention-Based Fusion for Bone-Conducted and Air-Conducted Speech Enhancement in the Complex Domain.

[BibT_eX]

[DOI]

Heming Wang

Proceedings of the IEEE International Conference on Acoustics, 2022

DRC-NET: Densely Connected Recurrent Convolutional Neural Network for Speech Dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Deep Learning Based Real-Time Speech Enhancement for Dual-Microphone Mobile Phones.

[BibT_eX]

[DOI]

Ke Tan

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Guided Training: A Simple Method for Single-channel Speaker Separation.

[BibT_eX]

[DOI]

CoRR, 2021

DBNet: A Dual-Branch Network Architecture Processing on Spectrum and Waveform for Single-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Inplace Gated Convolutional Recurrent Neural Network for Dual-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Real-Time Speech Enhancement for Mobile Communication Based on Dual-Channel Complex Spectral Mapping.

[BibT_eX]

[DOI]

Ke Tan

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Robust and Cascaded Acoustic Echo Cancellation Based on Deep Learning.

[BibT_eX]

[DOI]

Chenggang Zhang

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection.

[BibT_eX]

[DOI]

Tianjiao Xu

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 27th International Conference, 2020

Beamformed Feature for Learning-based Dual-channel Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speakerfilter: Deep Learning-Based Target Speaker Extraction Using Anchor Speech.

[BibT_eX]

[DOI]

Shulin He

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Robust Speech Dereverberation Based on WPE and Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Robust Speaker Localization Guided by Deep Learning-Based Time-Frequency Masking.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting.

[BibT_eX]

[DOI]

CoRR, 2019

Investigation of Cost Function for Supervised Monaural Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Robust Text-independent Speaker Verification Method Based on Speech Separation and Deep Speaker.

[BibT_eX]

[DOI]

Fei Zhao

Proceedings of the IEEE International Conference on Acoustics, 2019

Real-time Speech Enhancement Using an Efficient Convolutional Recurrent Network for Dual-microphone Mobile Phones in Close-talk Scenarios.

[BibT_eX]

[DOI]

Ke Tan

Proceedings of the IEEE International Conference on Acoustics, 2019

Supervised Speech Enhancement with Real Spectrum Approximation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement.

[BibT_eX]

[DOI]

Tianjiao Xu

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Deep Learning Based Speech Separation via NMF-Style Reconstructions.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Investigation of Monaural Front-End Processing for Robust ASR without Retraining or Joint-Training.

[BibT_eX]

[DOI]

CoRR, 2018

End-to-End Mongolian Text-to-Speech System.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Robust TDOA Estimation Based on Time-Frequency Masking and Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation.

[BibT_eX]

[DOI]

Yun Liu

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Training Supervised Speech Separation System to Improve STOI and PESQ Directly.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Online Direction of Arrival Estimation Based on Deep Learning.

[BibT_eX]

[DOI]

Qinglong Li

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Deep Learning Based Binaural Speech Separation in Reverberant Environments.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.

[BibT_eX]

[DOI]

CoRR, 2017

Multi-Target Ensemble Learning for Monaural Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A speech enhancement algorithm by iterating single- and multi-microphone processing and its application to robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Using optimal ratio mask as training target for supervised speech separation.

[BibT_eX]

[DOI]

Shasha Xia

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Convolutional neural network for robust pitch determination.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploiting spectro-temporal structures using NMF for DNN-based supervised speech separation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Joint optimization of recurrent networks exploiting source auto-regression for source separation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Two-stage multi-target joint learning for monaural speech separation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A pairwise algorithm for pitch estimation and speech separation using deep stacking network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Document summarization based on semantic representations.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Asian Language Processing, 2015

2014

Deep stacking networks with time series for speech separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Missing feature reconstruction methods for robust speaker identification.

[BibT_eX]

[DOI]

Proceedings of the 22nd European Signal Processing Conference, 2014

2012

Hidden Markov Model for Term Weighting in Verbose Queries.

[BibT_eX]

[DOI]

Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics, 2012

2011

Monaural voiced speech segregation based on elaborate harmonic grouping strategies.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2011

Monaural Voiced Speech Segregation Based on Pitch and Comb Filter.

[BibT_eX]

[DOI]

Wenju Liu

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function.

[BibT_eX]

[DOI]