Junhyeok Lee

Orcid: 0000-0002-4950-5371

Affiliations:
  • Johns Hopkins University, Center for Language and Signal Processing, Baltimore, MD, USA


According to our database1, Junhyeok Lee authored at least 15 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Super Monotonic Alignment Search.
CoRR, 2024

DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance.
CoRR, 2024

LatentSwap: An Efficient Latent Code Mapping Framework for Face Swapping.
CoRR, 2024

DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

JenGAN: Stacked Shifted Filters in GAN-Based Speech Synthesis.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
VIFS: An End-to-End Variational Inference for Foley Sound Synthesis.
CoRR, 2023

PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS.
CoRR, 2023

PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ASSEM-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques.
Proceedings of the IEEE International Conference on Acoustics, 2022

Talking Face Generation with Multilingual TTS.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Controllable and Interpretable Singing Voice Decomposition via Assem-VC.
CoRR, 2021

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021


  Loading...