Hui Zhang

Affiliations:
  • Inner Mongolia University, College of Computer Science, Inner Mongolia Key Laboratory of Mongolian Information Processing Technology, Hohhot, China


According to our database1, Hui Zhang authored at least 44 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
ScaleFormer: Transformer-based speech enhancement in the multi-scale time domain.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
MNASR: A Free Speech Corpus For Mongolian Speech Recognition And Accompanied Baselines.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

Speaker recognition-assisted robust audio deepfake detection.
Proceedings of the Interspeech 2022, 2022

Alleviating the Loss-Metric Mismatch in Supervised Single-Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Soft-BAC: Soft Bidirectional Alignment Cost for End-to-End Automatic Speech Recognition.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Joint Alignment Learning-Attention Based Model for Grapheme-to-Phoneme Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection.
Proceedings of the Interspeech 2020, 2020

Deep Features Representation of Word Image for Keyword Spotting in Historical Mongolian Document Images.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Multi-Task Learning Based Traditional Mongolian Words Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

2019
A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting.
CoRR, 2019

End-to-End Model for Offline Handwritten Mongolian Word Recognition.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

An Automatic Spelling Correction Method for Classical Mongolian.
Proceedings of the Knowledge Science, Engineering and Management, 2019

Investigation of Cost Function for Supervised Monaural Speech Separation.
Proceedings of the Interspeech 2019, 2019

UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-Noise Ratio Condition.
Proceedings of the Interspeech 2019, 2019

Woodblock-Printing Mongolian Words Recognition by Bi-LSTM with Attention Mechanism.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Supervised Speech Enhancement with Real Spectrum Approximation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Word Image Representation Based on Sequence to Sequence Model with Attention Mechanism for Out-of-Vocabulary Keyword Spotting.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019

Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

End-to-End Mongolian Text-to-Speech System.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation.
Proceedings of the Interspeech 2018, 2018

Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
Proceedings of the Interspeech 2018, 2018

Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Training Supervised Speech Separation System to Improve STOI and PESQ Directly.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.
CoRR, 2017

Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Multi-Target Ensemble Learning for Monaural Speech Separation.
Proceedings of the Interspeech 2017, 2017

Using Word Mover's Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Representing word image using visual word embeddings and RNN for keyword spotting on historical document images.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Segmentation-Free Printed Traditional Mongolian OCR Using Sequence to Sequence with Attention Model.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

2016
A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation.
Proceedings of the Interspeech 2016, 2016

Convolutional neural network for robust pitch determination.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Comparison on Neural Network based acoustic model in Mongolian speech recognition.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

2015
A pairwise algorithm for pitch estimation and speech separation using deep stacking network.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Document summarization based on semantic representations.
Proceedings of the 2015 International Conference on Asian Language Processing, 2015

Mongolian Speech Recognition Based on Deep Neural Networks.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

2014
Deep stacking networks with time series for speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Missing feature reconstruction methods for robust speaker identification.
Proceedings of the 22nd European Signal Processing Conference, 2014


  Loading...