Kaizhi Qian

According to our database1, Kaizhi Qian authored at least 29 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling.
CoRR, 2023

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning.
CoRR, 2023

Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning.
Proceedings of the International Conference on Machine Learning, 2023

Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Domain Generalization for Language-Independent Automatic Speech Recognition.
Frontiers Artif. Intell., 2022

Improving Self-Supervised Speech Representations by Disentangling Speakers.
CoRR, 2022

SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks.
CoRR, 2022

Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition.
Proceedings of the Interspeech 2022, 2022

WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models.
Proceedings of the Interspeech 2022, 2022

ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers.
Proceedings of the International Conference on Machine Learning, 2022

On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

SpeechSplit2.0: Unsupervised Speech Disentanglement for Voice Conversion without Tuning Autoencoder Bottlenecks.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Global Rhythm Style Transfer Without Text Transcriptions.
CoRR, 2021

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Speech Denoising with Auditory Models.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Global Prosody Style Transfer Without Text Transcriptions.
Proceedings of the 38th International Conference on Machine Learning, 2021

Continuous Cnn For Nonuniform Time Series.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Deep generative models for speech editing
PhD thesis, 2020

Deep Network Perceptual Losses for Speech Denoising.
CoRR, 2020

Unsupervised Speech Decomposition via Triple Information Bottleneck.
Proceedings of the 37th International Conference on Machine Learning, 2020

F0-Consistent Many-To-Many Non-Parallel Voice Conversion Via Conditional Autoencoder.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
An Efficient and Margin-Approaching Zero-Confidence Adversarial Attack.
CoRR, 2019

Zero-Shot Voice Style Transfer with Only Autoencoder Loss.
CoRR, 2019

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss.
Proceedings of the 36th International Conference on Machine Learning, 2019

Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Deep Learning Based Speech Beamforming.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Speech Enhancement Using Bayesian Wavenet.
Proceedings of the Interspeech 2017, 2017


  Loading...