Qiao Tian

Orcid: 0000-0002-4078-1273

Affiliations:

ByteDance, AI Research Lab, Shanghai, China

According to our database¹, Qiao Tian authored at least 32 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Multi-Level Temporal-Channel Speaker Retrieval for Zero-Shot Voice Conversion.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

U-Style: Cascading U-Nets With Multi-Level Speaker and Style Modeling for Zero-Shot Voice Cloning.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Joint Multiscale Cross-Lingual Speaking Style Transfer With Bidirectional Attention Mechanism for Automatic Dubbing.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

A Unified Front-End Framework for English Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Audiosr: Versatile Audio Super-Resolution at Scale.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

MSM-VC: High-Fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-Scale Style Modeling.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

LM-VC: Zero-Shot Voice Conversion via Speech Generation Based on Language Models.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2023

PolyVoice: Language Models for Speech to Speech Translation.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion.

[BibT_eX]

[DOI]

CoRR, 2023

Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Neural Music Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Zero-Shot Accent Conversion using Pseudo Siamese Disentanglement Network.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Delivering Speaking Style in Low-Resource Voice Conversion with Multi-Factor Constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2022

Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Neural Vocoder is All You Need for Speech Super-resolution.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Neufa: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Cloning One's Voice Using Very Limited Data in the Wild.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Neural Dubber: Dubbing for Silent Videos According to Scripts.

[BibT_eX]

[DOI]

CoRR, 2021

VoiceFixer: Toward General Speech Restoration With Neural Vocoder.

[BibT_eX]

[DOI]

CoRR, 2021

FeatherTTS: Robust and Efficient attention based Neural TTS.

[BibT_eX]

[DOI]

Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

Neural Dubber: Dubbing for Videos According to Scripts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN.

[BibT_eX]

[DOI]

CoRR, 2020

FeatherWave: An Efficient High-Fidelity Neural Vocoder with Multi-Band Linear Prediction.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The Tencent speech synthesis system for Blizzard Challenge 2020.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019

Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder.

[BibT_eX]

[DOI]

Qiao Tian

Xucheng Wan

Shan Liu

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

The Tencent speech synthesis system for Blizzard Challenge 2019.

[BibT_eX]

[DOI]

Qiao Tian

Jing Chen

Shan Liu

Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019

2018

Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder.

[BibT_eX]

[DOI]

CoRR, 2018

Qiao Tian

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...