Qingyang Hong

Orcid: 0000-0001-7380-8690

According to our database¹, Qingyang Hong authored at least 97 papers between 2001 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

AlphaFlowTSE: One-Step Generative Target Speaker Extraction via Conditional AlphaFlow.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model.

[BibT_eX]

[DOI]

CoRR, December, 2025

UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Phoenix-VAD: Streaming Semantic Endpoint Detection for Full-Duplex Speech Interaction.

[BibT_eX]

[DOI]

CoRR, September, 2025

XMUspeech Systems for the ASVspoof 5 Challenge.

[BibT_eX]

[DOI]

CoRR, September, 2025

MeanFlowSE: one-step generative speech enhancement via conditional mean flow.

[BibT_eX]

[DOI]

CoRR, September, 2025

DiTReducio: A Training-Free Acceleration for DiT-Based TTS via Progressive Calibration.

[BibT_eX]

[DOI]

CoRR, September, 2025

Cross-attention and Self-attention for Audio-visual Speaker Diarization in MISP-Meeting Challenge.

[BibT_eX]

[DOI]

CoRR, June, 2025

ReFlow-VC: Zero-shot Voice Conversion Based on Rectified Flow and Speaker Feature Optimization.

[BibT_eX]

[DOI]

CoRR, June, 2025

A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, June, 2025

DS-Codec: Dual-Stage Training with Mirror-to-NonMirror Architecture Switching for Speech Codec.

[BibT_eX]

[DOI]

CoRR, May, 2025

SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow.

[BibT_eX]

[DOI]

CoRR, April, 2025

Discl-VC: Disentangled Discrete Tokens and In-Context Learning for Controllable Zero-Shot Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

ReFlow-VC: Zero-shot Voice Conversion Based on Rectified Flow and Speaker Feature Optimization.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Pseudo Labels-based Neural Speech Enhancement for the AVSR Task in the MISP-Meeting Challenge.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SuPseudo: A Pseudo-supervised Learning Method for Neural Speech Enhancement in Far-field Speech Recognition.

[BibT_eX]

[DOI]

Longjie Luo

Lin Li

Qingyang Hong

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Cross-attention and Self-attention for Audio-visual Speaker Diarization in MISP-Meeting Challenge.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

DS-Codec: Dual-Stage Training with Mirror-to-NonMirror Architecture Switching for Speech Codec.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Dynamic Language Group-based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Continual Audio Deepfake Detection via Universal Adversarial Perturbation.

[BibT_eX]

[DOI]

Wangjie Li

Lin Li

Qingyang Hong

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024

Dynamic Language Group-Based MoE: Enhancing Efficiency and Flexibility for Code-Switching Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing Code-Switching Speech Recognition With LID-Based Collaborative Mixture of Experts Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2024

MinSpeech: A Corpus of Southern Min Dialect for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Efficient Integrated Features Based on Pre-trained Models for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

The Xmuspeech System for Audio-Visual Target Speaker Extraction in Misp 2023 Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Improving Multi-Speaker ASR With Overlap-Aware Encoding And Monotonic Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

SR-HuBERT : An Efficient Pre-Trained Model for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Reflow-TTS: A Rectified Flow Model for High-Fidelity Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Conformer-based Language Embedding with Self-Knowledge Distillation for Spoken Language Identification.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Cross-Modal Semantic Alignment before Fusion for Two-Pass End-to-End Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Meta Learning with Adaptive Loss Weight for Low-Resource Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Community Detection Graph Convolutional Network for Overlap-Aware Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Towards A Unified Conformer Structure: from ASR to ASV Task.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

The XMU System for Audio-Visual Diarization and Recognition in MISP Challenge 2022.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Unsupervised Speaker Verification Using Pre-Trained Model and Label Correction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

CASA-Net: Cross-attention and Self-attention for End-to-End Audio-visual Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

When Speaker Recognition Meets Noisy Labels: Optimizations for Front-Ends and Back-Ends.

[BibT_eX]

[DOI]

Lin Li

Fuchuan Tong

Qingyang Hong

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting.

[BibT_eX]

[DOI]

CoRR, 2022

The xmuspeech system for multi-channel multi-party meeting transcription challenge.

[BibT_eX]

[DOI]

CoRR, 2022

Respiratory Sound Classification: From Fluid-Solid Coupling Analysis to Feature-Band Attention.

[BibT_eX]

[DOI]

IEEE Access, 2022

Deep Representation Decomposition for Rate-Invariant Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Towards Language-universal Mandarin-English Speech Recognition with Unsupervised Label Synchronous Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Oriental Language Recognition (OLR) 2021: Summary and Analysis.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Spatial-aware Speaker Diarizaiton for Multi-channel Multi-party Meeting.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Deep joint learning for language recognition.

[BibT_eX]

[DOI]

Neural Networks, 2021

XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021.

[BibT_eX]

[DOI]

CoRR, 2021

Phoneme-aware and Channel-wise Attentive Learning for Text DependentSpeaker Verification.

[BibT_eX]

[DOI]

CoRR, 2021

Multi-Feature Learning with Canonical Correlation Analysis Constraint for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Lightspeech: Lightweight Non-Autoregressive Multi-Speaker Text-To-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Automatic Error Correction for Speaker Embedding Learning with Noisy Labels.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Phoneme-Aware and Channel-Wise Attentive Learning for Text Dependent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Integrated Framework for Two-Pass Personalized Voice Trigger.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Oriental Language Recognition (OLR) 2020: Summary and Analysis.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Real-Time End-to-End Monaural Multi-Speaker Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Additive Phoneme-Aware Margin Softmax Loss for Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ASV-SUBTOOLS: Open Source Toolkit for Automatic Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

End-To-End Multi-Accent Speech Recognition with Unsupervised Accent Modelling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Light-TTS: Lightweight Multi-Speaker Multi-Lingual Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

OLR 2021 Challenge: Datasets, Rules and Baselines.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

The XMUSPEECH System for the AP19-OLR Challenge.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

On the Usage of Multi-Feature Integration for Speaker Verification and Language Identification.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Transformer-Based Speech Recognition with Unsupervised Pre-Training and Multi-Task Semantic Knowledge Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The XMUSPEECH System for Short-Duration Speaker Verification Challenge 2020.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

XMU-TS Systems for NIST SRE19 CTS Challenge.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

AP20-OLR Challenge: Three Tasks and Their Baselines.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.

[BibT_eX]

[DOI]

Jiawei Wang

Bingjiao Yang

Yi An

Tatiana T. Marquez-Lago

Briefings Bioinform., 2019

Deep Speaker Embedding Extraction with Channel-Wise Feature Responses and Additive Supervision Softmax Loss Function.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Anti-Spoofing Speaker Verification System with Multi-Feature Integration and Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Training Multi-task Adversarial Network for Extracting Noise-robust Speaker Embedding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Extraction of Noise-Robust Speaker Embedding Based on Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Phone-Aware Multi-task Learning and Length Expanding for Short-Duration Language Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speaker Embedding Extraction with Multi-feature Integration Structure.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Electroencephalogram-based brain-computer interface for the Chinese spelling system: a survey.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., 2018

2017

Transfer learning for PLDA-based speaker verification.

[BibT_eX]

[DOI]

Speech Commun., 2017

2016

Classification between normal and adventitious lung sounds using deep neural network.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Speech enhancement based on nonparametric factor analysis.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Transfer Learning for Speaker Verification on Short Utterances.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Advancement in the EEG-Based Chinese Spelling Systems.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Robotics and Applications - 9th International Conference, 2016

A transfer learning method for PLDA-based speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Duration dependent covariance regularization in PLDA modeling for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

A Robust Speaker-Adaptive and Text-Prompted Speaker Verification System.

[BibT_eX]

[DOI]

Qingyang Hong

Sheng Wang

Zhijian Liu

Proceedings of the Biometric Recognition - 9th Chinese Conference, 2014

2012

Fuzzy neural network based dynamic path planning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

2009

A word alignment model based on multiobjective evolutionary algorithms.

[BibT_eX]

[DOI]

Comput. Math. Appl., 2009

2008

A chunk-based reordering model for phrase-based SMT systems.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Natural Language Processing and Knowledge Engineering, 2008

Incorporating syntax-based language models in phrase-based SMT models.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Intelligent System and Knowledge Engineering, 2008

2007

Translation Memory Sharing Models in XMCAT.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Computer Supported Cooperative Work in Design, 2007

2004

Using Mel-Frequency Cepstral Coefficients in Missing Data Technique.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2004

2001

A hybrid method for syntactic and semantic structure disambiguation for Chinese.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2001

Qingyang Hong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...