Qiao Tian

Orcid: 0000-0002-4078-1273

Affiliations:
  • ByteDance, AI Research Lab, Shanghai, China


According to our database1, Qiao Tian authored at least 32 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Level Temporal-Channel Speaker Retrieval for Zero-Shot Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

U-Style: Cascading U-Nets With Multi-Level Speaker and Style Modeling for Zero-Shot Voice Cloning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Joint Multiscale Cross-Lingual Speaking Style Transfer With Bidirectional Attention Mechanism for Automatic Dubbing.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

A Unified Front-End Framework for English Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

Audiosr: Versatile Audio Super-Resolution at Scale.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
MSM-VC: High-Fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-Scale Style Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

LM-VC: Zero-Shot Voice Conversion via Speech Generation Based on Language Models.
IEEE Signal Process. Lett., 2023

PolyVoice: Language Models for Speech to Speech Translation.
CoRR, 2023

Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion.
CoRR, 2023

Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing.
CoRR, 2023

Efficient Neural Music Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Zero-Shot Accent Conversion using Pseudo Siamese Disentanglement Network.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Delivering Speaking Style in Low-Resource Voice Conversion with Multi-Factor Constraints.
Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech.
CoRR, 2022

Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Neural Vocoder is All You Need for Speech Super-resolution.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Neufa: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2022

Cloning One's Voice Using Very Limited Data in the Wild.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Neural Dubber: Dubbing for Silent Videos According to Scripts.
CoRR, 2021

VoiceFixer: Toward General Speech Restoration With Neural Vocoder.
CoRR, 2021

FeatherTTS: Robust and Efficient attention based Neural TTS.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Neural Dubber: Dubbing for Videos According to Scripts.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN.
CoRR, 2020

FeatherWave: An Efficient High-Fidelity Neural Vocoder with Multi-Band Linear Prediction.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The Tencent speech synthesis system for Blizzard Challenge 2020.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

The Tencent speech synthesis system for Blizzard Challenge 2019.
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

2018
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder.
CoRR, 2018


  Loading...