Jaehong Yoon

Orcid: 0000-0002-9653-9590

According to our database1, Jaehong Yoon authored at least 59 papers between 2017 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning.
CoRR, July, 2025

MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation.
CoRR, June, 2025

Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models.
CoRR, June, 2025

Movie Facts and Fibs (MF<sup>2</sup>): A Benchmark for Long Movie Understanding.
CoRR, June, 2025

Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning.
CoRR, June, 2025

EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance.
CoRR, May, 2025

Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization.
CoRR, April, 2025

Continual Learning: Forget-Free Winning Subnetworks for Video Representations.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

RSQ: Learning from Important Tokens Leads to Better Quantized LLMs.
CoRR, March, 2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective.
CoRR, February, 2025

CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Adapt-∞: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Enhancing Thermal Infrared Object Detection Using SimAM-Integrated YOLOX for Improved Feature Representation.
Proceedings of the IEEE International Conference on Consumer Electronics, 2025

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation.
CoRR, 2024

VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement.
CoRR, 2024

Adapt-∞: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection.
CoRR, 2024

Glider: Global and Local Instruction-Driven Expert Router.
CoRR, 2024

RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives.
CoRR, 2024

EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents.
CoRR, 2024

CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion.
CoRR, 2024

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
CoRR, 2024

SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Carpe diem: On the Evaluation of World Knowledge in Lifelong Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

STELLA: Continual Audio-Video Pre-training with SpatioTemporal Localized Alignment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Analyzing and Mitigating Object Hallucination in Large Vision-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Progressive Fourier Neural Representation for Sequential Video Compilation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multimodal Representation Learning by Alternating Unimodal Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models.
CoRR, 2023

STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment.
CoRR, 2023

Progressive Neural Representation for Sequential Video Compilation.
CoRR, 2023

Forget-free Continual Learning with Soft-Winning SubNetworks.
CoRR, 2023

Continual Learners are Incremental Model Generalizers.
Proceedings of the International Conference on Machine Learning, 2023

Personalized Subgraph Federated Learning.
Proceedings of the International Conference on Machine Learning, 2023

On the Soft-Subnetwork for Few-Shot Class Incremental Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Efficient Video Representation Learning via Masked Video Modeling with Motion-centric Token Selection.
CoRR, 2022

Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization.
Proceedings of the International Conference on Machine Learning, 2022

Forget-free Continual Learning with Winning Subnetworks.
Proceedings of the International Conference on Machine Learning, 2022

Online Coreset Selection for Rehearsal-based Continual Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Representational Continuity for Unsupervised Continual Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

BiTAT: Neural Network Binarization with Task-Dependent Aggregated Transformation.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
Rethinking the Representational Continuity: Towards Unsupervised Continual Learning.
CoRR, 2021

Federated Continual Learning with Weighted Inter-client Transfer.
Proceedings of the 38th International Conference on Machine Learning, 2021

Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Rapid Structural Pruning of Neural Networks with Set-based Task-Adaptive Meta-Pruning.
CoRR, 2020

Federated Semi-Supervised Learning with Inter-Client Consistency.
CoRR, 2020

Federated Continual Learning with Adaptive Parameter Communication.
CoRR, 2020

Scalable and Order-robust Continual Learning with Additive Parameter Decomposition.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
ORACLE: Order Robust Adaptive Continual LEarning.
CoRR, 2019

2018
Adaptive Network Sparsification via Dependent Variational Beta-Bernoulli Dropout.
CoRR, 2018

Spatial and Time Domain Feature of ERP Speller System Extracted via Convolutional Neural Network.
Comput. Intell. Neurosci., 2018

Lifelong Learning with Dynamically Expandable Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Combined Group and Exclusive Sparsity for Deep Neural Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017


  Loading...