Sung-Bin Kim

Orcid: 0000-0003-3542-9934

According to our database1, Sung-Bin Kim authored at least 21 papers between 2009 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Efficient Hyper-Parameter Search for LoRA via Language-aided Bayesian Optimization.
CoRR, February, 2026

2025
FacEDiT: Unified Talking Face Editing and Generation via Facial Motion Infilling.
CoRR, December, 2025

AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SoundBrush: Sound as a Brush for Visual Scene Editing.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
The devil in the details: simple and effective optical flow synthetic data generation.
Vis. Comput., December, 2024

A Large-Scale 3D Face Mesh Video Dataset via Neural Re-parameterized Optimization.
Trans. Mach. Learn. Res., 2024

Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment.
CoRR, 2024

Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert.
CoRR, 2024

Revisiting Learning-based Video Motion Magnification for Real-time Processing.
CoRR, 2024

LaughTalk: Expressive 3D Talking Head Generation with Laughter.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023
Prefix Tuning for Automated Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Lightweight Speaker Recognition in Poincaré Spaces.
IEEE Signal Process. Lett., 2022

2009
Analytic Models for Peak Current Computation in General Interconnect Circuits.
Proceedings of the 2009 International Conference on Computer Design, 2009

Reducing the Far-end Crosstalk Using Advanced Guard Trace in PCB Transmission Lines.
Proceedings of the 2009 International Conference on Computer Design, 2009


  Loading...