Shilei Zhang

Orcid: 0009-0008-6755-7926

According to our database1, Shilei Zhang authored at least 53 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Fault Diagnosis of Highway Machinery Hydraulic System Based on LS-TF.
IEEE Trans. Instrum. Meas., 2024

Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network.
CoRR, 2024

2023
Harmonic Attention for Monaural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Cascaded Multi-task Adaptive Learning Based on Neural Architecture Search.
CoRR, 2023

MFAS: Emotion Recognition through Multiple Perspectives Fusion Architecture Search Emulating Human Cognition.
CoRR, 2023

VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

Semi-Supervised Speech Enhancement Based On Speech Purity.
Proceedings of the IEEE International Conference on Acoustics, 2023

Noise-robust Pitch Detection Based on Super-Resolution Harmonics.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution.
Neural Networks, 2022

Meta Auxiliary Learning for Low-resource Spoken Language Understanding.
Proceedings of the Interspeech 2022, 2022

HGCN: Harmonic Gated Compensation Network for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

Harmonic Gated Compensation Network Plus for ICASSP 2022 DNS Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Demosaicking enhancement method for color image based on low-rank constraint.
J. Electronic Imaging, 2021

Detecting Escalation Level from Speech with Transfer Learning and Acoustic-Lexical Information Fusion.
CoRR, 2021

Feasibility Analysis of Machine Learning Optimization on GPU-based Low-cost Edges.
Proceedings of the 2021 IEEE SmartWorld, 2021

DynGraphTrans: Dynamic Graph Embedding via Modified Universal Transformer Networks for Financial Transaction Data.
Proceedings of the IEEE International Conference on Smart Data Services, 2021

Our Learned Lessons from Cross-Lingual Speaker Verification: The CRMI-DKU System Description for the Short-Duration Speaker Verification Challenge 2021.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Boundary and Context Aware Training for CIF-Based Non-Autoregressive End-to-End ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
A virtual-physical collision detection interface for AR-based interactive teaching of robot.
Robotics Comput. Integr. Manuf., 2020

Gearbox fault diagnosis using data fusion based on self-organizing map neural network.
Int. J. Distributed Sens. Networks, 2020

2019
A Method to Accelerate and Visualize Iterative Clinical Paper Searching.
Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

Facilitating Clinical Trial Recruitment by Recommending Cost-Efficient Medical Exams.
Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

Few-Shot Audio Classification with Attentional Graph Neural Networks.
Proceedings of the Interspeech 2019, 2019

2018
Identity-Enhanced Network for Facial Expression Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Targeting Self-Binding Peptides as a Novel Strategy To Regulate Protein Activity and Function: A Case Study on the Proto-oncogene Tyrosine Protein Kinase <i>c</i>-Src.
J. Chem. Inf. Model., 2017

Autoencoder Regularized Network For Driving Style Representation Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Emotion recognition with multimodal features and temporal models.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Load balancing for multiple controllers in SDN based on switches group.
Proceedings of the 19th Asia-Pacific Network Operations and Management Symposium, 2017

Service failure diagnosis in service function chain.
Proceedings of the 19th Asia-Pacific Network Operations and Management Symposium, 2017

2016
Placement and motion Optimization of redundant Maxillofacial Surgical robot with New dexterity Measure.
Int. J. Robotics Autom., 2016

Speaker diarization system for autism children's real-life audio data.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Text-independent voice conversion using deep neural network based phonetic level features.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Rapid feature space MLLR speaker adaptation for deep neural network acoustic modeling.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Wake-up-word spotting using end-to-end deep neural network system.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Video emotion recognition in the wild based on fusion of multimodal features.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

2015
Self-Binding Peptides: Folding or Binding?
J. Chem. Inf. Model., 2015

2014
Automatic Variance Analysis of Multistage Care Pathways.
Proceedings of the e-Health - For Continuity of Care - Proceedings of MIE2014, the 25th European Medical Informatics Conference, Istanbul, Turkey, August 31, 2014

2013
Semi-supervised accent detection and modeling.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Model dimensionality selection in bilinear transformation for feature space MLLR rapid speaker adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Rapid feature space MLLR speaker adaptation with bilinear models.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Conflict Detection Based on Improved Unscented Particle Filter.
Proceedings of the Information and Automation - International Symposium, 2010

Spoken English assessment system for non-native speakers using acoustic and prosodic features.
Proceedings of the INTERSPEECH 2010, 2010

Improved Mandarin Keyword Spotting Using Confusion Garbage Model.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Modeling Syllable-Based Pronunciation Variation for Accented Mandarin Speech Recognition.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

The 2009 IBM GALE Mandarin broadcast transcription system.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR.
Proceedings of the IEEE International Conference on Acoustics, 2009

Utterance verification using improved confidence measures based on alignment confusion rate in Chinese digits recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Recent advances in the IBM GALE Mandarin transcription system.
Proceedings of the IEEE International Conference on Acoustics, 2008

2006
Robust Target Speaker Tracking in Broadcast TV Streams.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006

Fast SVM training based on the choice of effective samples for audio classification.
Proceedings of the INTERSPEECH 2006, 2006

A Two-level Method for Unsupervised Speaker-based Audio Segmentation.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

2005
Optimal model order selection based on regression tree in speaker identification.
Proceedings of the INTERSPEECH 2005, 2005


  Loading...