Jiwan Chung

According to our database1, Jiwan Chung authored at least 27 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation.
CoRR, May, 2025

Explain with Visual Keypoints Like a Real Mentor! A Benchmark for Multimodal Solution Explanation.
CoRR, April, 2025

VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms.
CoRR, March, 2025

GuideDog: A Real-World Egocentric Multimodal Dataset for Blind and Low-Vision Accessibility-Aware Guidance.
CoRR, March, 2025

Teaching Metric Distance to Autoregressive Multimodal Foundational Models.
CoRR, March, 2025

SEAL: Entangled White-box Watermarks on Low-Rank Adaptation.
CoRR, January, 2025

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Are Any-to-Any Models More Consistent Across Modality Transfers Than Specialists?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MASS: Overcoming Language Bias in Image-Text Matching.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction.
CoRR, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.
CoRR, 2024

Towards Visual Text Design Transfer Across Languages.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you!
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Long Story Short: a Summarize-then-Search Method for Long Video Question Answering.
CoRR, 2023

Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

VLIS: Unimodal Language Models Guide Multimodal Language Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Long Story Short: a Summarize-then-Search Method for Prompt-Based Long Video Question Answering.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Multimodal Knowledge Alignment with Reinforcement Learning.
CoRR, 2022

2021
Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning.
CoRR, 2021

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Transitional Adaptation of Pretrained Models for Visual Storytelling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Character Grounding and Re-identification in Story of Videos and Text Descriptions.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...