Yicheng Gu

Orcid: 0009-0001-7819-5667

According to our database1, Yicheng Gu authored at least 18 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN.
CoRR, May, 2025

SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset.
CoRR, May, 2025

Diff-SSL-G-Comp: Towards a Large-Scale and Diverse Dataset for Virtual Analog Modeling.
CoRR, April, 2025

gCom: Fine-grained Compressors in Graphics Memory of Mobile GPU.
ACM Trans. Archit. Code Optim., March, 2025

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation.
CoRR, January, 2025

gFlow: Distributed Real-Time Reverse Remote Rendering System Model.
Proceedings of the MultiMedia Modeling, 2025

2024
CARE: Cloudified Android With Optimized Rendering Platform.
IEEE Trans. Multim., 2024

An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds.
CoRR, 2024

An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder.
CoRR, 2024

gVulkan: Scalable GPU Pooling for Pixel-Grained Rendering in Ray Tracing.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

Amphion: an Open-Source Audio, Music, and Speech Generation Toolkit.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Leveraging Diverse Semantic-Based Audio Pretrained Models for Singing Voice Conversion.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset For Large-Scale Speech Generation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

gHermes: Application-Unaware Acceleration for Cloud Rendering and Computing with Efficient GPU Utilization.
Proceedings of the 36th International Conference on Software Engineering and Knowledge Engineering, 2024

Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit.
CoRR, 2023

Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion.
CoRR, 2023


  Loading...