Jaehong Yoon

Orcid: 0000-0002-9653-9590

According to our database1, Jaehong Yoon authored at least 76 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Are Video Reasoning Models Ready to Go Outside?
CoRR, March, 2026

AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories.
CoRR, February, 2026

When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning.
CoRR, February, 2026

Reliable and Responsible Foundation Models: A Comprehensive Survey.
CoRR, February, 2026

Self-Refining Video Sampling.
CoRR, January, 2026

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation.
CoRR, January, 2026

Enhanced Thermal-Only Object Detection via LoRA-Guided Thermal-to-Visible Translation and Cross-Modal Distillation.
IEEE Access, 2026

GGA: Gradient-Guided Augmentation for Robust Object Detection under Domain Shift.
Proceedings of the IEEE International Conference on Consumer Electronics, 2026

DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

Towards Continually-Evolving AI: Selective and Expandable Multimodal Memory System.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

DreamRunner: Fine-Grained Compositional Story-to-Video Generation with Retrieval-Augmented Motion Adaptation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI.
CoRR, December, 2025

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning.
CoRR, December, 2025

Planning with Sketch-Guided Verification for Physics-Aware Video Generation.
CoRR, November, 2025

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
CoRR, October, 2025

Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models.
CoRR, June, 2025

Movie Facts and Fibs (MF<sup>2</sup>): A Benchmark for Long Movie Understanding.
CoRR, June, 2025

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance.
CoRR, May, 2025

Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization.
CoRR, April, 2025

Continual Learning: Forget-Free Winning Subnetworks for Video Representations.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

RSQ: Learning from Important Tokens Leads to Better Quantized LLMs.
CoRR, March, 2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective.
CoRR, February, 2025

Reliable and Responsible Foundation Models.
Trans. Mach. Learn. Res., 2025

CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adapt-∞: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Enhancing Thermal Infrared Object Detection Using SimAM-Integrated YOLOX for Improved Feature Representation.
Proceedings of the IEEE International Conference on Consumer Electronics, 2025

MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

RACCooN: Versatile Instructional Video Editing with Auto-Generated Narratives.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Glider: Global and Local Instruction-Driven Expert Router.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation.
CoRR, 2024

VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement.
CoRR, 2024

Adapt-∞: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection.
CoRR, 2024

RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives.
CoRR, 2024

EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents.
CoRR, 2024

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion.
CoRR, 2024

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
CoRR, 2024

SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Carpe diem: On the Evaluation of World Knowledge in Lifelong Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

STELLA: Continual Audio-Video Pre-training with SpatioTemporal Localized Alignment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Analyzing and Mitigating Object Hallucination in Large Vision-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Progressive Fourier Neural Representation for Sequential Video Compilation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multimodal Representation Learning by Alternating Unimodal Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models.
CoRR, 2023

STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment.
CoRR, 2023

Progressive Neural Representation for Sequential Video Compilation.
CoRR, 2023

Forget-free Continual Learning with Soft-Winning SubNetworks.
CoRR, 2023

Continual Learners are Incremental Model Generalizers.
Proceedings of the International Conference on Machine Learning, 2023

Personalized Subgraph Federated Learning.
Proceedings of the International Conference on Machine Learning, 2023

On the Soft-Subnetwork for Few-Shot Class Incremental Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Efficient Video Representation Learning via Masked Video Modeling with Motion-centric Token Selection.
CoRR, 2022

Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization.
Proceedings of the International Conference on Machine Learning, 2022

Forget-free Continual Learning with Winning Subnetworks.
Proceedings of the International Conference on Machine Learning, 2022

Online Coreset Selection for Rehearsal-based Continual Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Representational Continuity for Unsupervised Continual Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

BiTAT: Neural Network Binarization with Task-Dependent Aggregated Transformation.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
Rethinking the Representational Continuity: Towards Unsupervised Continual Learning.
CoRR, 2021

Federated Continual Learning with Weighted Inter-client Transfer.
Proceedings of the 38th International Conference on Machine Learning, 2021

Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Rapid Structural Pruning of Neural Networks with Set-based Task-Adaptive Meta-Pruning.
CoRR, 2020

Federated Semi-Supervised Learning with Inter-Client Consistency.
CoRR, 2020

Federated Continual Learning with Adaptive Parameter Communication.
CoRR, 2020

Scalable and Order-robust Continual Learning with Additive Parameter Decomposition.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
ORACLE: Order Robust Adaptive Continual LEarning.
CoRR, 2019

2018
Adaptive Network Sparsification via Dependent Variational Beta-Bernoulli Dropout.
CoRR, 2018

Spatial and Time Domain Feature of ERP Speller System Extracted via Convolutional Neural Network.
Comput. Intell. Neurosci., 2018

Lifelong Learning with Dynamically Expandable Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Combined Group and Exclusive Sparsity for Deep Neural Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017


  Loading...