Gaofeng Cheng

Orcid: 0000-0002-2102-6061

According to our database1, Gaofeng Cheng authored at least 39 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Boosting Cross-Domain Speech Recognition With Self-Supervision.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Interrelate Training and Clustering for Online Speaker Diarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

ASQ: An Ultra-Low Bit Rate ASR-Oriented Speech Quantization Method.
IEEE Signal Process. Lett., 2024

2023
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Speech Corpora Divergence Based Unsupervised Data Selection for ASR.
CoRR, 2023

2022
Self-Supervised Pre-Training for Attention-Based Encoder-Decoder ASR Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Alleviating ASR Long-Tailed Problem by Decoupling the Learning of Representation and Classification.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

An E2E-ASR-Based Iteratively-Trained Timestamp Estimator.
IEEE Signal Process. Lett., 2022

A layered grouping random access scheme based on dynamic preamble selection for massive machine type communications.
Sci. China Inf. Sci., 2022

Sequence Distribution Matching for Unsupervised Domain Adaptation in ASR.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Summary On The ISCSLP 2022 Chinese-English Code-Switching ASR Challenge.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Decoupled Federated Learning for ASR with Non-IID Data.
Proceedings of the Interspeech 2022, 2022

Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR.
Proceedings of the Interspeech 2022, 2022

Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods.
Proceedings of the Interspeech 2022, 2022

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset.
Proceedings of the Interspeech 2022, 2022

Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning.
Proceedings of the Interspeech 2022, 2022

Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies.
Proceedings of the Interspeech 2022, 2022

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization.
Proceedings of the Interspeech 2022, 2022

Improving Non-Autoregressive End-to-End Speech Recognition with Pre-Trained Acoustic and Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving CTC-Based Speech Recognition Via Knowledge Transferring from Pre-Trained Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Keyword Search Using Attention-Based End-to-End ASR and Frame-Synchronous Phoneme Alignments.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition.
CoRR, 2021

Wav2vec-S: Semi-Supervised Pre-Training for Speech Recognition.
CoRR, 2021

Non-autoregressive Deliberation-Attention based End-to-End ASR.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Text Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

History Utterance Embedding Transformer LM for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Far-Field Speech Recognition Based on Complex-Valued Neural Networks and Inter-Frame Similarity Difference Method.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Automatic Speech Recognition System with Output-Gate Projected Gated Recurrent Unit.
IEICE Trans. Inf. Syst., 2019

Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Bidirectional LSTM with Extended Input Context.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks.
Proceedings of the Interspeech 2018, 2018

Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR.
Proceedings of the Interspeech 2018, 2018

Output-Gate Projected Gated Recurrent Unit for Speech Recognition.
Proceedings of the Interspeech 2018, 2018

2017
An Exploration of Dropout with LSTMs.
Proceedings of the Interspeech 2017, 2017


  Loading...