Jiyoung Lee

Orcid: 0009-0000-6647-9403

Affiliations:
  • Ewha Womans University, Department of Artificial Intelligence, Seoul, South Korea
  • NAVER AI Lab, Seongnam, South Korea
  • Yonsei University, School of Electrical and Electronic Engineering, Seoul, Korea (PhD)


According to our database1, Jiyoung Lee authored at least 36 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Descriptive Image-Text Matching with Graded Contextual Similarity.
CoRR, May, 2025

Prototype-Guided Attention Distillation for Discriminative Person Search.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Read, Watch and Scream! Sound Generation from Text and Video.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Discriminative action tubelet detector for weakly-supervised action detection.
Pattern Recognit., 2024

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Bridging Vision and Language Spaces with Assignment Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Panoramic Image-to-Image Translation.
CoRR, 2023

Semi-Parametric Video-Grounded Text Generation.
CoRR, 2023

Language-free Training for Zero-shot Video Grounding.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Robust Camera Pose Refinement for Multi-Resolution Hash Encoding.
Proceedings of the International Conference on Machine Learning, 2023

Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dense Text-to-Image Generation with Attention Modulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Imaginary Voice: Face-Styled Diffusion Model for Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023

Dual-Path Adaptation from Image to Video Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MIDMs: Matching Interleaved Diffusion Models for Exemplar-Based Image Translation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Mutual Information Divergence: A Unified Metric for Multimodal Generative Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-Domain Unsupervised Image-to-Image Translation with Appearance Adaptive Convolution.
Proceedings of the IEEE International Conference on Acoustics, 2022

PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Probabilistic Representations for Video Contrastive Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pin the Memory: Learning to Generalize Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CausalCity: Complex Simulations with Agency for Causal Discovery and Reasoning.
Proceedings of the 1st Conference on Causal Learning and Reasoning, 2022

2021
Self-Balanced Learning for Domain Generalization.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Wide and Narrow: Video Prediction from Context and Motion.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Multi-Modal Recurrent Attention Networks for Facial Expression Recognition.
IEEE Trans. Image Process., 2020

SumGraph: Video Summarization via Recursive Graph Modeling.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Graph Regularization Network with Semantic Affinity for Weakly-Supervised Temporal Action Localization.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Video Summarization by Learning Relationships between Action and Scene.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Context-Aware Emotion Recognition Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Learning to Detect, Associate, and Recognize Human Actions and Surrounding Scenes in Untrimmed Videos.
Proceedings of the 1st Workshop and Challenge on Comprehensive Video Understanding in the Wild, 2018

Audio-Visual Attention Networks for Emotion Recognition.
Proceedings of the 2018 Workshop on Audio-Visual Scene Understanding for Immersive Multimedia, 2018

Spatiotemporal Attention Based Deep Neural Networks for Emotion Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Automatic 2D-to-3D conversion using multi-scale deep neural network.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017


  Loading...