Jiwan Chung

According to our database¹, Jiwan Chung authored at least 30 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

A11YN: aligning LLMs for accessible web UI code generation.

[BibT_eX]

[DOI]

CoRR, October, 2025

What MLLMs Learn about When they Learn about Multimodal Reasoning: Perception, Reasoning, or their Integration?

[BibT_eX]

[DOI]

CoRR, October, 2025

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation.

[BibT_eX]

[DOI]

CoRR, May, 2025

Explain with Visual Keypoints Like a Real Mentor! A Benchmark for Multimodal Solution Explanation.

[BibT_eX]

[DOI]

CoRR, April, 2025

VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms.

[BibT_eX]

[DOI]

CoRR, March, 2025

GuideDog: A Real-World Egocentric Multimodal Dataset for Blind and Low-Vision Accessibility-Aware Guidance.

[BibT_eX]

[DOI]

CoRR, March, 2025

Teaching Metric Distance to Autoregressive Multimodal Foundational Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

SEAL: Entangled White-box Watermarks on Low-Rank Adaptation.

[BibT_eX]

[DOI]

CoRR, January, 2025

Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Are Any-to-Any Models More Consistent Across Modality Transfers Than Specialists?

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MASS: Overcoming Language Bias in Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction.

[BibT_eX]

[DOI]

CoRR, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Visual Text Design Transfer Across Languages.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you!

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Long Story Short: a Summarize-then-Search Method for Long Video Question Answering.

[BibT_eX]

[DOI]

Jiwan Chung

Youngjae Yu

CoRR, 2023

Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commonsense Norms.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

VLIS: Unimodal Language Models Guide Multimodal Language Generation.

[BibT_eX]

[DOI]

Jiwan Chung

Youngjae Yu

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Fusing Pre-Trained Language Models with Multimodal Prompts through Reinforcement Learning.

[BibT_eX]

[DOI]

Prithviraj Ammanabrolu

Ronan Le Bras

Gunhee Kim

Yejin Choi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Long Story Short: a Summarize-then-Search Method for Prompt-Based Long Video Question Answering.

[BibT_eX]

[DOI]

Jiwan Chung

Youngjae Yu

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

Multimodal Knowledge Alignment with Reinforcement Learning.

[BibT_eX]

[DOI]

Prithviraj Ammanabrolu

CoRR, 2022

2021

Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Transitional Adaptation of Pretrained Models for Visual Storytelling.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Character Grounding and Re-identification in Story of Videos and Text Descriptions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Jiwan Chung

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...