Shaojin Ding

Orcid: 0000-0002-2108-3111

According to our database1, Shaojin Ding authored at least 25 papers between 2018 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models.
CoRR, 2023

RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models.
CoRR, 2023

Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023

Conditional Conformer: Improving Speaker Modulation For Single And Multi-User Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Role of Feature Correlation on Quantized Neural Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Efficient Cascaded Streaming ASR System Via Frame Rate Reduction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning.
Comput. Speech Lang., 2022

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes.
Proceedings of the Interspeech 2022, 2022

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition.
Proceedings of the Interspeech 2022, 2022

4-bit Conformer with Native Quantization Aware Training for Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Lifelong Learning of Multilingual Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Converting Foreign Accent Speech Without a Reference.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Textual Echo Cancellation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Learning Structured Sparse Representations for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Personal VAD: Speaker-Conditioned Voice Activity Detection.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition.
Proceedings of the Interspeech 2020, 2020

AutoSpeech: Neural Architecture Search for Speaker Recognition.
Proceedings of the Interspeech 2020, 2020

2019
Golden speaker builder - An interactive tool for pronunciation training.
Speech Commun., 2019

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams.
Proceedings of the Interspeech 2019, 2019

Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion.
Proceedings of the Interspeech 2019, 2019

ABD-Net: Attentive but Diverse Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function.
Proceedings of the Interspeech 2018, 2018

Learning Structured Dictionaries for Exemplar-based Voice Conversion.
Proceedings of the Interspeech 2018, 2018


  Loading...