Xueliang Zhang

Orcid: 0000-0002-0406-1105

Affiliations:
  • Inner Mongolia University, College of Computer Science, Inner Mongolia Key Laboratory of Mongolian Information Processing Technology, China
  • Chinese Academy and Sciences, National Laboratory of Pattern Recognition, NLPR, Institute of Automation, China


According to our database1, Xueliang Zhang authored at least 72 papers between 2008 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Neural Multi-Channel and Multi-Microphone Acoustic Echo Cancellation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Efficient Multi-Channel Speech Enhancement with Spherical Harmonics Injection for Directional Encoding.
CoRR, 2023

Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement.
CoRR, 2023

PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch Interactions for Multi-Channel Speech Enhancement.
CoRR, 2023

Speech Enhancement with Intelligent Neural Homomorphic Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

ScaleFormer: Transformer-based speech enhancement in the multi-scale time domain.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Fusing Bone-Conduction and Air-Conduction Sensors for Complex-Domain Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Local-global speaker representation for target speaker extraction.
CoRR, 2022

RAT: RNN-Attention Transformer for Speech Enhancement.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic Acoustic Echo Cancellation.
Proceedings of the Interspeech 2022, 2022

Speaker recognition-assisted robust audio deepfake detection.
Proceedings of the Interspeech 2022, 2022

A Complex Spectral Mapping with Inplace Convolution Recurrent Neural Networks For Acoustic Echo Cancellation.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Robust Deep Audio Splicing Detection Method via Singularity Detection Feature.
Proceedings of the IEEE International Conference on Acoustics, 2022

Alleviating the Loss-Metric Mismatch in Supervised Single-Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

Attention-Based Fusion for Bone-Conducted and Air-Conducted Speech Enhancement in the Complex Domain.
Proceedings of the IEEE International Conference on Acoustics, 2022

DRC-NET: Densely Connected Recurrent Convolutional Neural Network for Speech Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Deep Learning Based Real-Time Speech Enhancement for Dual-Microphone Mobile Phones.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Recurrent Neural Networks and Acoustic Features for Frame-Level Signal-to-Noise Ratio Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Guided Training: A Simple Method for Single-channel Speaker Separation.
CoRR, 2021

DBNet: A Dual-Branch Network Architecture Processing on Spectrum and Waveform for Single-Channel Speech Enhancement.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Inplace Gated Convolutional Recurrent Neural Network for Dual-Channel Speech Enhancement.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Real-Time Speech Enhancement for Mobile Communication Based on Dual-Channel Complex Spectral Mapping.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Robust and Cascaded Acoustic Echo Cancellation Based on Deep Learning.
Proceedings of the Interspeech 2020, 2020

Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection.
Proceedings of the Interspeech 2020, 2020

Frame-Level Signal-to-Noise Ratio Estimation Using Deep Learning.
Proceedings of the Interspeech 2020, 2020

Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition.
Proceedings of the Interspeech 2020, 2020

An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Beamformed Feature for Learning-based Dual-channel Speech Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speakerfilter: Deep Learning-Based Target Speaker Extraction Using Anchor Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Robust Speech Dereverberation Based on WPE and Deep Learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Robust Speaker Localization Guided by Deep Learning-Based Time-Frequency Masking.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting.
CoRR, 2019

Investigation of Cost Function for Supervised Monaural Speech Separation.
Proceedings of the Interspeech 2019, 2019

A Robust Text-independent Speaker Verification Method Based on Speech Separation and Deep Speaker.
Proceedings of the IEEE International Conference on Acoustics, 2019

Real-time Speech Enhancement Using an Efficient Convolutional Recurrent Network for Dual-microphone Mobile Phones in Close-talk Scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2019

Supervised Speech Enhancement with Real Spectrum Approximation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Deep Learning Based Speech Separation via NMF-Style Reconstructions.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Investigation of Monaural Front-End Processing for Robust ASR without Retraining or Joint-Training.
CoRR, 2018

End-to-End Mongolian Text-to-Speech System.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Robust TDOA Estimation Based on Time-Frequency Masking and Deep Neural Networks.
Proceedings of the Interspeech 2018, 2018

Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation.
Proceedings of the Interspeech 2018, 2018

Training Supervised Speech Separation System to Improve STOI and PESQ Directly.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Online Direction of Arrival Estimation Based on Deep Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Deep Learning Based Binaural Speech Separation in Reverberant Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.
CoRR, 2017

Multi-Target Ensemble Learning for Monaural Speech Separation.
Proceedings of the Interspeech 2017, 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks.
Proceedings of the Interspeech 2017, 2017

A speech enhancement algorithm by iterating single- and multi-microphone processing and its application to robust ASR.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Using optimal ratio mask as training target for supervised speech separation.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation.
Proceedings of the Interspeech 2016, 2016

Convolutional neural network for robust pitch determination.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploiting spectro-temporal structures using NMF for DNN-based supervised speech separation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Joint optimization of recurrent networks exploiting source auto-regression for source separation.
Proceedings of the INTERSPEECH 2015, 2015

Two-stage multi-target joint learning for monaural speech separation.
Proceedings of the INTERSPEECH 2015, 2015

A pairwise algorithm for pitch estimation and speech separation using deep stacking network.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Document summarization based on semantic representations.
Proceedings of the 2015 International Conference on Asian Language Processing, 2015

2014
Deep stacking networks with time series for speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Missing feature reconstruction methods for robust speaker identification.
Proceedings of the 22nd European Signal Processing Conference, 2014

2012
Hidden Markov Model for Term Weighting in Verbose Queries.
Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics, 2012

2011
Monaural voiced speech segregation based on elaborate harmonic grouping strategies.
Sci. China Inf. Sci., 2011

Monaural Voiced Speech Segregation Based on Pitch and Comb Filter.
Proceedings of the INTERSPEECH 2011, 2011

2010
Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function.
EURASIP J. Audio Speech Music. Process., 2010

2009
Monaural voiced speech segregation based on elaborate harmonic grouping strategy.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Multipitch Detection Based on Weighted Summary Correlogram.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008


  Loading...