Yanhua Long

Orcid: 0000-0003-0924-408X

According to our database1, Yanhua Long authored at least 49 papers between 2008 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Boosting Character-based Mandarin ASR via Chinese Pinyin Representation.
Int. J. Speech Technol., December, 2023

CI-Mix: cut instance mix for robust speaker verification.
Int. J. Speech Technol., December, 2023

Heterogeneous separation consistency training for adaptation of unsupervised speech separation.
EURASIP J. Audio Speech Music. Process., December, 2023

Dual-model self-regularization and fusion for domain adaptation of robust speaker verification.
Speech Commun., November, 2023

Acoustic-Sensing-Based Attribute-Driven Imbalanced Compensation for Anomalous Sound Detection without Machine Identity.
Sensors, November, 2023

Autoencoder with Group-based Decoder and Multi-task Optimization for Anomalous Sound Detection.
CoRR, 2023

UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023.
CoRR, 2023

Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition.
CoRR, 2023

FEW-Shot Continual Learning with Weight Alignment and Positive Enhancement for Bioacoustic Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
New Acoustic Features for Synthetic and Replay Spoofing Attack Detection.
Symmetry, 2022

Universal and accent-discriminative encoders for conformer-based accent-invariant speech recognition.
Int. J. Speech Technol., 2022

Acoustic domain mismatch compensation in bird audio detection.
Int. J. Speech Technol., 2022

Exploring single channel speech separation for short-time text-dependent speaker verification.
Int. J. Speech Technol., 2022

Joint framework with deep feature distillation and adaptive focal loss for weakly supervised audio tagging and acoustic event detection.
Digit. Signal Process., 2022

Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system.
CoRR, 2022

Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection.
Proceedings of the Interspeech 2022, 2022

PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement.
Proceedings of the Interspeech 2022, 2022

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Pronunciation augmentation for Mandarin-English code-switching speech recognition.
EURASIP J. Audio Speech Music. Process., 2021

Joint Weakly Supervised AT and AED Using Deep Feature Distillation and Adaptive Focal Loss.
CoRR, 2021

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Attention-Based Scaling Adaptation for Target Speech Extraction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Mask-based blind source separation and MVDR beamforming in ASR.
Int. J. Speech Technol., 2020

Attention-based scaling adaptation for target speech extraction.
CoRR, 2020

Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR.
Proceedings of the Interspeech 2020, 2020

2019
Large-Scale Semi-Supervised Training in Deep Learning Acoustic Model for ASR.
IEEE Access, 2019

SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Offline to online speaker adaptation for real-time deep neural network based LVCSR systems.
Multim. Tools Appl., 2018

Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Active Learning for LF-MMI Trained Neural Networks in ASR.
Proceedings of the Interspeech 2018, 2018

2017
Articulatory movement features for short-duration text-dependent speaker verification.
Int. J. Speech Technol., 2017

Domain adaptation of lattice-free MMI based TDNN models for speech recognition.
Int. J. Speech Technol., 2017

Domain compensation based on phonetically discriminative features for speaker verification.
Comput. Speech Lang., 2017

2016
Improvements on self-adaptive voice activity detector for telephone data.
Int. J. Speech Technol., 2016

2013
Improving lightly supervised training for broadcast transcription.
Proceedings of the INTERSPEECH 2013, 2013

Automatic Transcription of Multi-genre Media Archives.
Proceedings of the First Workshop on Speech, 2013

2012
Transcription of multi-genre media archives using out-of-domain data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

2011
Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model.
Proceedings of the INTERSPEECH 2011, 2011

Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
The description of iFlyTek Speech Lab system for NIST2009 Language Recognition Evaluation.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Non-negative matrix factorization based discriminative features for speaker verification.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Effects of the phonological relevance in speaker verification.
Proceedings of the INTERSPEECH 2010, 2010

N-gram nearest neighbor algorithm for voice password system.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Exploiting prosodic information for Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

iFLY system for the NIST 2008 speaker recognition evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Interfusing the Confused Region Score of Speaker Verification Systems.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008


  Loading...