Liang He

Orcid: 0000-0003-4076-7479

Affiliations:
  • Tsinghua University, Department of Electronic Engineering, TNLIST, Beijing, China


According to our database1, Liang He authored at least 73 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
IIFC-Net: A Monaural Speech Enhancement Network With High-Order Information Interaction and Feature Calibration.
IEEE Signal Process. Lett., 2024

2023
Audio-Visual Fusion Based on Interactive Attention for Person Verification.
Sensors, December, 2023

W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision.
EURASIP J. Audio Speech Music. Process., December, 2023

Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction.
CoRR, 2023

MAKBQA: Multi-hop Knowledge Base Question Answering System Based on Sensors and Internet Agricultural Data.
Proceedings of the 20th Annual IEEE International Conference on Sensing, 2023

GhostVec: A New Threat to Speaker Privacy of End-to-End Speech Recognition System.
Proceedings of the ACM Multimedia Asia 2023, 2023

Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization.
Proceedings of the ACM Multimedia Asia 2023, 2023

CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Speech Topic Classification Based on Pre-trained and Graph Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

A Joint Network Based on Interactive Attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

2022
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation.
IEEE Signal Process. Lett., 2022

A bimodal network based on Audio-Text-Interactional-Attention with ArcFace loss for speech emotion recognition.
Speech Commun., 2022

Multi-stage music separation network with dual-branch attention and hybrid convolution.
J. Intell. Inf. Syst., 2022

OR-Gate: A Noisy Label Filtering Method for Speaker Verification.
CoRR, 2022

I4U System Description for NIST SRE'20 CTS Challenge.
CoRR, 2022

THUEE system description for NIST 2020 SRE CTS challenge.
CoRR, 2022

How to Boost Anti-Spoofing with X-Vectors.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

End-to-end speech topic classification based on pre-trained model Wavlm.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

A Multi-grained based Attention Network for Semi-supervised Sound Event Detection.
Proceedings of the Interspeech 2022, 2022

A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition.
Proceedings of the Interspeech 2022, 2022

Virtual Fully-Connected Layer for a Large-Scale Speaker Verification Dataset.
Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022

2021
End-to-End Cross-Lingual Spoken Language Understanding Model with Multilingual Pretraining.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Improved Lightcnn with Attention Modules for Asv Spoofing Detection.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2020
Adaptive Multi-Scale Detection of Acoustic Events.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

MTF-CRNN: Multiscale Time-Frequency Convolutional Recurrent Neural Network for Sound Event Detection.
IEEE Access, 2020

Combined Vector Based on Factorized Time-delay Neural Network for Text-Independent Speaker Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

A Joint Detection-Classification Model for Weakly Supervised Sound Event Detection Using Multi-Scale Attention Method.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2020

THUEE System for NIST SRE19 CTS Challenge.
Proceedings of the Interspeech 2020, 2020

2019
Distance-Dependent Metric Learning.
IEEE Signal Process. Lett., 2019

Latent class model with application to speaker diarization.
EURASIP J. Audio Speech Music. Process., 2019

THUEE system description for NIST 2019 SRE CTS Challenge.
CoRR, 2019

Large Margin Softmax Loss for Speaker Verification.
Proceedings of the Interspeech 2019, 2019

Multi-objective Optimization Training of PLDA for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Geometric Discriminant Analysis for I-vector Based Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Local Pairwise Linear Discriminant Analysis for Speaker Verification.
IEEE Signal Process. Lett., 2018

Semi-supervised minimum redundancy maximum relevance feature selection for audio classification.
Multim. Tools Appl., 2018

Multiobjective Optimization Training of PLDA for Speaker Verification.
CoRR, 2018

Defect characterization of amorphous silicon thin film solar cell based on low frequency noise.
Sci. China Inf. Sci., 2018

VB-HMM Speaker Diarization with Enhanced and Refined Segment Representation.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Latent Class Model for Single Channel Speaker Diarization.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Exploring a Unified Attention-Based Pooling Framework for Speaker Verification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Parallel Double Audio Fingerprinting.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Speaker Embedding Extraction with Phonetic Information.
Proceedings of the Interspeech 2018, 2018

Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Fading channel modelling using single-hidden layer feedforward neural networks.
Multidimens. Syst. Signal Process., 2017

Investigation of Frame Alignments for GMM-based Text-prompted Speaker Verification.
CoRR, 2017

Ivec-PLDA-AHC priors for VB-HMM speaker diarization system.
Proceedings of the 2017 IEEE International Workshop on Signal Processing Systems, 2017

Deep neural networks based speaker modeling at different levels of phonetic granularity.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Comparison of multiple features and modeling methods for text-dependent speaker verification.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Semi-supervised feature selection for audio classification based on constraint compensated Laplacian score.
EURASIP J. Audio Speech Music. Process., 2016

Voice activity detection algorithm based on long-term pitch information.
EURASIP J. Audio Speech Music. Process., 2016

Investigation of Senone-based Long-Short Term Memory RNNs for Spoken Language Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

A study of variational method for text-independent speaker recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data.
Proceedings of the Interspeech 2016, 2016

Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge.
Proceedings of the Interspeech 2016, 2016

THU-EE System Description for NIST LRE 2015.
Proceedings of the Interspeech 2016, 2016

2015
Convolutional maxout neural networks for speech separation.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Investigation of bottleneck features and multilingual deep neural networks for speaker verification.
Proceedings of the INTERSPEECH 2015, 2015

Simultaneous utilization of spectral magnitude and phase information to extract supervectors for speaker verification anti-spoofing.
Proceedings of the INTERSPEECH 2015, 2015

Stacked bottleneck features for speaker verification.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

PRISM: A statistical modeling framework for text-independent speaker verification.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

2014
Speaker verification using Fisher vector.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Improved multitaper PNCC feature for robust speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

2013
I-matrix for text-independent speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

THUEE system for the Albayzin 2012 language recognition evaluation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012
Complementary combination in i-vector level for language recognition.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012

2011
Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition.
IEEE Trans. Speech Audio Process., 2011

2010
Multi-feature combination for speaker recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Variant time-frequency cepstral features for speaker recognition.
Proceedings of the INTERSPEECH 2010, 2010

2009
Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

2008
Fractional Fourier transform based auditory feature for language identification.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

Channel compensation technology in differential GSV-SVM speaker verification system.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008


  Loading...