Gaofeng Cheng

Orcid: 0000-0002-2102-6061

According to our database¹, Gaofeng Cheng authored at least 50 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

UniECG: Understanding and Generating ECG in One Unified Model.

[BibT_eX]

[DOI]

CoRR, September, 2025

Pinyin-Guided Chinese Speech Recognition with Large Language Model.

[BibT_eX]

[DOI]

Jie Zhengjie

Gaofeng Cheng

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Hybrid Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

HIPA-MoE: A Parameter-Efficient Fine-Tuning Architecture with Hierarchical Adapter-Based Mixture-Of-Experts for Multilingual ASR.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2025

2024

Boosting Cross-Domain Speech Recognition With Self-Supervision.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Interrelate Training and Clustering for Online Speaker Diarization.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

ASQ: An Ultra-Low Bit Rate ASR-Oriented Speech Quantization Method.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

Conversational Short-Phrase Speaker Diarization via Self-Adjusting Speech Segmentation and Embedding Extraction.

[BibT_eX]

[DOI]

Haitian Lu

Gaofeng Cheng

Yonghong Yan

IEEE Signal Process. Lett., 2024

Factorized and progressive knowledge distillation for CTC-based ASR models.

[BibT_eX]

[DOI]

Speech Commun., 2024

Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

Query-by-Example Speech Search using Mamba and Random Offset Mixed Padding.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Contextual Biasing with Confidence-based Homophone Detector for Mandarin End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Speech Corpora Divergence Based Unsupervised Data Selection for ASR.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Self-Supervised Pre-Training for Attention-Based Encoder-Decoder ASR Model.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Alleviating ASR Long-Tailed Problem by Decoupling the Learning of Representation and Classification.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

An E2E-ASR-Based Iteratively-Trained Timestamp Estimator.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

A layered grouping random access scheme based on dynamic preamble selection for massive machine type communications.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2022

Sequence Distribution Matching for Unsupervised Domain Adaptation in ASR.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Summary On The ISCSLP 2022 Chinese-English Code-Switching ASR Challenge.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines.

[BibT_eX]

[DOI]

Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Decoupled Federated Learning for ASR with Non-IID Data.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Non-Autoregressive End-to-End Speech Recognition with Pre-Trained Acoustic and Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Improving CTC-Based Speech Recognition Via Knowledge Transferring from Pre-Trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Keyword Search Using Attention-Based End-to-End ASR and Frame-Synchronous Phoneme Alignments.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Non-autoregressive Deliberation-Attention based End-to-End ASR.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Text Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

History Utterance Embedding Transformer LM for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Far-Field Speech Recognition Based on Complex-Valued Neural Networks and Inter-Frame Similarity Difference Method.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Automatic Speech Recognition System with Output-Gate Projected Gated Recurrent Unit.

[BibT_eX]

[DOI]

Gaofeng Cheng

Pengyuan Zhang

Ji Xu

IEICE Trans. Inf. Syst., 2019

Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Bidirectional LSTM with Extended Input Context.

[BibT_eX]

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Output-Gate Projected Gated Recurrent Unit for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

An Exploration of Dropout with LSTMs.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Gaofeng Cheng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...