Heeseung Kim

According to our database¹, Heeseung Kim authored at least 20 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Style-Friendly SNR Sampler for Style-Driven Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Unified Speech-Text Pretraining for Spoken Dialog Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

Paralinguistics-Aware Speech-Empowered Large Language Models for Natural Conversation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Edit-A-Video: Single Video Editing with Object-Aware Consistency.

[BibT_eX]

[DOI]

Proceedings of the Asian Conference on Machine Learning, 2023

2022

Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data.

[BibT_eX]

[DOI]

Sungwon Kim

Heeseung Kim

Sungroh Yoon

CoRR, 2022

Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance.

[BibT_eX]

[DOI]

Heeseung Kim

Sungwon Kim

Sungroh Yoon

Proceedings of the International Conference on Machine Learning, 2022

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Stein Latent Optimization for Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Guided-TTS: Text-to-Speech with Untranscribed Speech.

[BibT_eX]

[DOI]

Heeseung Kim

Sungwon Kim

Sungroh Yoon

CoRR, 2021

Rare Words Degenerate All Words.

[BibT_eX]

[DOI]

CoRR, 2021

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive Prior.

[BibT_eX]

[DOI]

CoRR, 2021

Stein Latent Optimization for GANs.

[BibT_eX]

[DOI]

CoRR, 2021

Heeseung Kim

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...