Jaehong Yoon
Orcid: 0000-0002-9653-9590
According to our database1,
Jaehong Yoon authored at least 76 papers
between 2017 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories.
CoRR, February, 2026
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning.
CoRR, February, 2026
CoRR, February, 2026
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation.
CoRR, January, 2026
Enhanced Thermal-Only Object Detection via LoRA-Guided Thermal-to-Visible Translation and Cross-Modal Distillation.
IEEE Access, 2026
Proceedings of the IEEE International Conference on Consumer Electronics, 2026
DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
DreamRunner: Fine-Grained Compositional Story-to-Video Generation with Retrieval-Augmented Motion Adaptation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, December, 2025
CoRR, December, 2025
CoRR, November, 2025
CoRR, October, 2025
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models.
CoRR, June, 2025
CoRR, June, 2025
CoRR, May, 2025
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization.
CoRR, April, 2025
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025
CoRR, March, 2025
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective.
CoRR, February, 2025
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Adapt-∞: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Enhancing Thermal Infrared Object Detection Using SimAM-Integrated YOLOX for Improved Feature Representation.
Proceedings of the IEEE International Conference on Consumer Electronics, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation.
CoRR, 2024
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion.
CoRR, 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
CoRR, 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
CoRR, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
2022
Efficient Video Representation Learning via Masked Video Modeling with Motion-centric Token Selection.
CoRR, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
2021
CoRR, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021
2020
Rapid Structural Pruning of Neural Networks with Set-based Task-Adaptive Meta-Pruning.
CoRR, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
2019
2018
CoRR, 2018
Spatial and Time Domain Feature of ERP Speller System Extracted via Convolutional Neural Network.
Comput. Intell. Neurosci., 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
2017
Proceedings of the 34th International Conference on Machine Learning, 2017