Heeseung Kim

According to our database1, Heeseung Kim authored at least 20 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Style-Friendly SNR Sampler for Style-Driven Generation.
CoRR, 2024

Unified Speech-Text Pretraining for Spoken Dialog Modeling.
CoRR, 2024

Paralinguistics-Aware Speech-Empowered Large Language Models for Natural Conversation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Edit-A-Video: Single Video Editing with Object-Aware Consistency.
Proceedings of the Asian Conference on Machine Learning, 2023

2022
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data.
CoRR, 2022

Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance.
Proceedings of the International Conference on Machine Learning, 2022

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Stein Latent Optimization for Generative Adversarial Networks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Guided-TTS: Text-to-Speech with Untranscribed Speech.
CoRR, 2021

Rare Words Degenerate All Words.
CoRR, 2021

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive Prior.
CoRR, 2021

Stein Latent Optimization for GANs.
CoRR, 2021


  Loading...