Qingyang Hong

Orcid: 0000-0001-7380-8690

According to our database1, Qingyang Hong authored at least 63 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ReFlow-TTS: A Rectified Flow Model for High-fidelity Text-to-Speech.
CoRR, 2023

Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge.
CoRR, 2023

Meta Learning with Adaptive Loss Weight for Low-Resource Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Community Detection Graph Convolutional Network for Overlap-Aware Speaker Diarization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Towards A Unified Conformer Structure: from ASR to ASV Task.
Proceedings of the IEEE International Conference on Acoustics, 2023

The XMU System for Audio-Visual Diarization and Recognition in MISP Challenge 2022.
Proceedings of the IEEE International Conference on Acoustics, 2023

Unsupervised Speaker Verification Using Pre-Trained Model and Label Correction.
Proceedings of the IEEE International Conference on Acoustics, 2023

CASA-Net: Cross-attention and Self-attention for End-to-End Audio-visual Speaker Diarization.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
When Speaker Recognition Meets Noisy Labels: Optimizations for Front-Ends and Back-Ends.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Spatial-aware Speaker Diarization for Multi-channel Multi-party Meeting.
CoRR, 2022

The xmuspeech system for multi-channel multi-party meeting transcription challenge.
CoRR, 2022

Respiratory Sound Classification: From Fluid-Solid Coupling Analysis to Feature-Band Attention.
IEEE Access, 2022

Deep Representation Decomposition for Rate-Invariant Speaker Verification.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Towards Language-universal Mandarin-English Speech Recognition with Unsupervised Label Synchronous Adaptation.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Oriental Language Recognition (OLR) 2021: Summary and Analysis.
Proceedings of the Interspeech 2022, 2022

Spatial-aware Speaker Diarizaiton for Multi-channel Multi-party Meeting.
Proceedings of the Interspeech 2022, 2022

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Deep joint learning for language recognition.
Neural Networks, 2021

XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021.
CoRR, 2021

Phoneme-aware and Channel-wise Attentive Learning for Text DependentSpeaker Verification.
CoRR, 2021

Multi-Feature Learning with Canonical Correlation Analysis Constraint for Text-Independent Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Lightspeech: Lightweight Non-Autoregressive Multi-Speaker Text-To-Speech.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Automatic Error Correction for Speaker Embedding Learning with Noisy Labels.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Phoneme-Aware and Channel-Wise Attentive Learning for Text Dependent Speaker Verification.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

An Integrated Framework for Two-Pass Personalized Voice Trigger.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Oriental Language Recognition (OLR) 2020: Summary and Analysis.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Real-Time End-to-End Monaural Multi-Speaker Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Additive Phoneme-Aware Margin Softmax Loss for Language Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

ASV-SUBTOOLS: Open Source Toolkit for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

End-To-End Multi-Accent Speech Recognition with Unsupervised Accent Modelling.
Proceedings of the IEEE International Conference on Acoustics, 2021

Light-TTS: Lightweight Multi-Speaker Multi-Lingual Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2021

OLR 2021 Challenge: Datasets, Rules and Baselines.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
The XMUSPEECH System for the AP19-OLR Challenge.
Proceedings of the Interspeech 2020, 2020

On the Usage of Multi-Feature Integration for Speaker Verification and Language Identification.
Proceedings of the Interspeech 2020, 2020

Improving Transformer-Based Speech Recognition with Unsupervised Pre-Training and Multi-Task Semantic Knowledge Learning.
Proceedings of the Interspeech 2020, 2020

The XMUSPEECH System for Short-Duration Speaker Verification Challenge 2020.
Proceedings of the Interspeech 2020, 2020

XMU-TS Systems for NIST SRE19 CTS Challenge.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

AP20-OLR Challenge: Three Tasks and Their Baselines.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.
Briefings Bioinform., 2019

Deep Speaker Embedding Extraction with Channel-Wise Feature Responses and Additive Supervision Softmax Loss Function.
Proceedings of the Interspeech 2019, 2019

Anti-Spoofing Speaker Verification System with Multi-Feature Integration and Multi-Task Learning.
Proceedings of the Interspeech 2019, 2019

Training Multi-task Adversarial Network for Extracting Noise-robust Speaker Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Extraction of Noise-Robust Speaker Embedding Based on Generative Adversarial Networks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Phone-Aware Multi-task Learning and Length Expanding for Short-Duration Language Recognition.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speaker Embedding Extraction with Multi-feature Integration Structure.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Electroencephalogram-based brain-computer interface for the Chinese spelling system: a survey.
Frontiers Inf. Technol. Electron. Eng., 2018

2017
Transfer learning for PLDA-based speaker verification.
Speech Commun., 2017

2016
Classification between normal and adventitious lung sounds using deep neural network.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Speech enhancement based on nonparametric factor analysis.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Transfer Learning for Speaker Verification on Short Utterances.
Proceedings of the Interspeech 2016, 2016

Advancement in the EEG-Based Chinese Spelling Systems.
Proceedings of the Intelligent Robotics and Applications - 9th International Conference, 2016

A transfer learning method for PLDA-based speaker verification.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Modified-prior PLDA and score calibration for duration mismatch compensation in speaker recognition system.
Proceedings of the INTERSPEECH 2015, 2015

Duration dependent covariance regularization in PLDA modeling for speaker verification.
Proceedings of the INTERSPEECH 2015, 2015

2014
A Robust Speaker-Adaptive and Text-Prompted Speaker Verification System.
Proceedings of the Biometric Recognition - 9th Chinese Conference, 2014

2012
Fuzzy neural network based dynamic path planning.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

2009
A word alignment model based on multiobjective evolutionary algorithms.
Comput. Math. Appl., 2009

2008
A chunk-based reordering model for phrase-based SMT systems.
Proceedings of the 4th International Conference on Natural Language Processing and Knowledge Engineering, 2008

Incorporating syntax-based language models in phrase-based SMT models.
Proceedings of the 3rd International Conference on Intelligent System and Knowledge Engineering, 2008

2007
Translation Memory Sharing Models in XMCAT.
Proceedings of the 11th International Conference on Computer Supported Cooperative Work in Design, 2007

2004
Using Mel-Frequency Cepstral Coefficients in Missing Data Technique.
EURASIP J. Adv. Signal Process., 2004

2001
A hybrid method for syntactic and semantic structure disambiguation for Chinese.
Proceedings of the IEEE International Conference on Systems, 2001


  Loading...