Zhiyuan Zhao

Affiliations:
  • Microsoft Research Asia, Beijing, China


According to our database1, Zhiyuan Zhao authored at least 10 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
ART·V: Auto-Regressive Text-to-Video Generation with Diffusion Models.
CoRR, 2023

TridentSE: Guiding Speech Enhancement with 32 Global Tokens.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
An Anchor-Free Detector for Continuous Speech Keyword Spotting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework.
CoRR, 2021

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Joint Time-Frequency and Time Domain Learning for Speech Enhancement.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020


  Loading...