Jaehyeon Kim

Orcid: 0000-0001-9347-3680

According to our database1, Jaehyeon Kim authored at least 30 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music.
CoRR, April, 2026

PersonaPlex: Voice and Role Control for Full Duplex Conversational Speech Models.
CoRR, February, 2026

2025
Miniature Testbed for Validating Multi-Agent Cooperative Autonomous Driving.
CoRR, November, 2025

Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models.
CoRR, July, 2025

Water-Based Proof of Jensen's Inequality.
Am. Math. Mon., March, 2025

BuilDroid: A Self-Correcting LLM Agent for Automated Android Builds.
Proceedings of the 40th IEEE/ACM International Conference on Automated Software Engineering, 2025

How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Efficient Generative Modeling with Residual Vector Quantization-Based Tokens.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Development of an Artificial Intelligence-Based Prediction System for Short-Term Solar Power Generation.
Proceedings of the IEEE International Conference on Consumer Electronics, 2025

2024
Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings.
CoRR, 2024

DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer.
CoRR, 2024

CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
1/<i>f</i> Noise in Synaptic Ferroelectric Tunnel Junction: Impact on Convolutional Neural Network.
Adv. Intell. Syst., June, 2023

Graph Learning-Based Blockchain Phishing Account Detection with a Heterogeneous Transaction Graph.
Sensors, 2023

Demonstration of crystalline IGZO transistor with high thermal stability for memory applications.
Proceedings of the 2023 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits), 2023

QASA: Advanced Question Answering on Scientific Articles.
Proceedings of the International Conference on Machine Learning, 2023

2022
Ethereum Cybercriminal Account.
Dataset, July, 2022

Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis.
CoRR, 2022

Neuron Circuits for Low-Power Spiking Neural Networks Using Time-To-First-Spike Encoding.
IEEE Access, 2022

On-Chip Trainable Spiking Neural Networks Using Time-To-First-Spike Encoding.
IEEE Access, 2022

Optimal Bias Conditions for FET-type Gas Sensors to Minimize Current Fluctuations.
Proceedings of the IEEE International Symposium on Olfaction and Electronic Nose, 2022

A Graph Embedding-based Identity Inference Attack on Blockchain Systems.
Proceedings of the International Conference on Electronics, Information, and Communication, 2022

Automatic Hepatocellular Carcinoma Diagnosis using Graph Convolutional Network.
Proceedings of the International Conference on Electronics, Information, and Communication, 2022

2021
Time Encoding in Languages and Investment Efficiency.
Manag. Sci., 2021

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
FloWaveNet : A Generative Flow for Raw Audio.
Proceedings of the 36th International Conference on Machine Learning, 2019


  Loading...