Yanhua Long

Orcid: 0000-0003-0924-408X

According to our database¹, Yanhua Long authored at least 71 papers between 2008 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Exploring Using Contrastive Learning for Improving BSRNN-Based Speech Enhancement.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., May, 2026

Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech Recognition.

[BibT_eX]

[DOI]

CoRR, March, 2026

Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios.

[BibT_eX]

[DOI]

CoRR, February, 2026

Bridging the gap: A comparative exploration of Speech-LLM and end-to-end architecture for multilingual conversational ASR.

[BibT_eX]

[DOI]

CoRR, January, 2026

A Language-Agnostic Hierarchical LoRA-MoE Architecture for CTC-based Multilingual ASR.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Noisy Disentanglement with Tri-stage Training for Noise-Robust Speech Recognition.

[BibT_eX]

[DOI]

CoRR, September, 2025

SHNU Multilingual Conversational Speech Recognition System for INTERSPEECH 2025 MLC-SLM Challenge.

[BibT_eX]

[DOI]

CoRR, July, 2025

Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement.

[BibT_eX]

[DOI]

Ziling Huang

Haixin Guan

Yanhua Long

CoRR, May, 2025

Exploring the Potential of SSL Models for Sound Event Detection.

[BibT_eX]

[DOI]

CoRR, May, 2025

Enhanced cross-modal parallel training for improving end-to-end accented speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2025

Revisiting SSL for sound event detection: complementary fusion and adaptive post-processing.

[BibT_eX]

[DOI]

J. King Saud Univ. Comput. Inf. Sci., 2025

Personalized Speech Enhancement without User Enrollment for Real-World Audio Replay Scenarios.

[BibT_eX]

[DOI]

Haoran Wei

Shilin Wang

Yanhua Long

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Leveraging Out-of-Domain Noise for Unsupervised Domain Adaptation in Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Improving low-complexity and real-time DeepFilterNet2 for personalized speech enhancement.

[BibT_eX]

[DOI]

Int. J. Speech Technol., June, 2024

ICSD: An Open-source Dataset for Infant Cry and Snoring Detection.

[BibT_eX]

[DOI]

CoRR, 2024

QMixCAT: Unsupervised Speech Enhancement Using Quality-guided Signal Mixing and Competitive Alternating Model Training.

[BibT_eX]

[DOI]

Shilin Wang

Haixin Guan

Yanhua Long

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Score Calibration Based on Consistency Measure Factor for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Accent-Specific Vector Quantization for Joint Unsupervised and Supervised Training in Accent Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Cross-Modal Parallel Training for Improving end-to-end Accented Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Boosting Character-based Mandarin ASR via Chinese Pinyin Representation.

[BibT_eX]

[DOI]

Int. J. Speech Technol., December, 2023

CI-Mix: cut instance mix for robust speaker verification.

[BibT_eX]

[DOI]

Yibo Duan

Yanhua Long

Yijie Li

Int. J. Speech Technol., December, 2023

Heterogeneous separation consistency training for adaptation of unsupervised speech separation.

[BibT_eX]

[DOI]

Jiangyu Han

Yanhua Long

EURASIP J. Audio Speech Music. Process., December, 2023

Dual-model self-regularization and fusion for domain adaptation of robust speaker verification.

[BibT_eX]

[DOI]

Yibo Duan

Yanhua Long

Jiaen Liang

Speech Commun., November, 2023

Acoustic-Sensing-Based Attribute-Driven Imbalanced Compensation for Anomalous Sound Detection without Machine Identity.

[BibT_eX]

[DOI]

Yifan Zhou

Yanhua Long

Haoran Wei

Sensors, November, 2023

Autoencoder with Group-based Decoder and Multi-task Optimization for Anomalous Sound Detection.

[BibT_eX]

[DOI]

CoRR, 2023

UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Advanced RawNet2 with Attention-based Channel Masking for Synthetic Speech Detection.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

FEW-Shot Continual Learning with Weight Alignment and Positive Enhancement for Bioacoustic Event Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

New Acoustic Features for Synthetic and Replay Spoofing Attack Detection.

[BibT_eX]

[DOI]

Symmetry, 2022

Universal and accent-discriminative encoders for conformer-based accent-invariant speech recognition.

[BibT_eX]

[DOI]

Xuefei Wang

Yanhua Long

Dongxing Xu

Int. J. Speech Technol., 2022

Acoustic domain mismatch compensation in bird audio detection.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2022

Exploring single channel speech separation for short-time text-dependent speaker verification.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2022

Joint framework with deep feature distillation and adaptive focal loss for weakly supervised audio tagging and acoustic event detection.

[BibT_eX]

[DOI]

Digit. Signal Process., 2022

Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Pronunciation augmentation for Mandarin-English code-switching speech recognition.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2021

Joint Weakly Supervised AT and AED Using Deep Feature Distillation and Adaptive Focal Loss.

[BibT_eX]

[DOI]

CoRR, 2021

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Attention-Based Scaling Adaptation for Target Speech Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Mask-based blind source separation and MVDR beamforming in ASR.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2020

Attention-based scaling adaptation for target speech extraction.

[BibT_eX]

[DOI]

Jiangyu Han

Yanhua Long

Jiaen Liang

CoRR, 2020

Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The SHNU System for Blizzard Challenge 2020.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019

Large-Scale Semi-Supervised Training in Deep Learning Acoustic Model for ASR.

[BibT_eX]

[DOI]

IEEE Access, 2019

SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Offline to online speaker adaptation for real-time deep neural network based LVCSR systems.

[BibT_eX]

[DOI]

Yanhua Long

Yijie Li

Bo Zhang

Multim. Tools Appl., 2018

Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech.

[BibT_eX]

[DOI]

Yiyan Wang

Yanhua Long

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Active Learning for LF-MMI Trained Neural Networks in ASR.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Articulatory movement features for short-duration text-dependent speaker verification.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2017

Domain adaptation of lattice-free MMI based TDNN models for speech recognition.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2017

Domain compensation based on phonetically discriminative features for speaker verification.

[BibT_eX]

[DOI]

Yanhua Long

Hong Ye

Jifeng Ni

Comput. Speech Lang., 2017

2016

Improvements on self-adaptive voice activity detector for telephone data.

[BibT_eX]

[DOI]

Haoran Wei

Yanhua Long

Hongwei Mao

Int. J. Speech Technol., 2016

2013

Improving lightly supervised training for broadcast transcription.

[BibT_eX]

[DOI]

Matthew Stephen Seigel

Philip C. Woodland

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic Transcription of Multi-genre Media Archives.

[BibT_eX]

[DOI]

Matthew Stephen Seigel

Pawel Swietojanski

Philip C. Woodland

Proceedings of the First Workshop on Speech, 2013

2012

Transcription of multi-genre media archives using out-of-domain data.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

2011

Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

The description of iFlyTek Speech Lab system for NIST2009 Language Recognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Non-negative matrix factorization based discriminative features for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Effects of the phonological relevance in speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

N-gram nearest neighbor algorithm for voice password system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Exploiting prosodic information for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

iFLY system for the NIST 2008 speaker recognition evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Interfusing the Confused Region Score of Speaker Verification Systems.

[BibT_eX]

[DOI]

Yanhua Long

Wu Guo

Li-Rong Dai

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Yanhua Long

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...