Sunghoon Im

Orcid: 0000-0001-9776-8101

Affiliations:
  • Daegu Gyeongbuk Institute of Science and Technology (DGIST), Department of Electrical Engineering and Computer Sciences, Daegu, South Korea
  • Korea Advanced Institute of Science and Technology (KAIST), School of Electrical Engineering, Daejeon, South Korea (PhD 2019)


According to our database1, Sunghoon Im authored at least 70 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
CVA: Context-aware Video-text Alignment for Video Temporal Grounding.
CoRR, March, 2026

Scale-Invariant and View-Relational Representation Learning for Full Surround Monocular Depth.
IEEE Robotics Autom. Lett., January, 2026

A Review of Online Diffusion Policy RL Algorithms for Scalable Robotic Control.
CoRR, January, 2026

CascadeOcc: Rethinking 3D Occupancy World Models With Cascaded VQ Representations.
IEEE Signal Process. Lett., 2026

Infinite-Story: A Training-Free Consistent Text-to-Image Generation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
SAMDWICH: Moment-aware Video-text Alignment for Referring Video Object Segmentation.
CoRR, August, 2025

JPEG Processing Neural Operator for Backward-Compatible Coding.
CoRR, July, 2025

Latest Object Memory Management for Temporally Consistent Video Instance Segmentation.
CoRR, July, 2025

A Training-Free Style-Personalization via Scale-wise Autoregressive Model.
CoRR, July, 2025

Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation.
CoRR, May, 2025

Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation.
IEEE Robotics Autom. Lett., April, 2025

A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model.
CoRR, April, 2025

Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

CAVIS: Context-Aware Video Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LOMM: Latest Object Memory Management for Temporally Consistent Video Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

JPEG Processing Neural Operator for Backward-Compatible Coding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Style-Editor: Text-driven Object-centric Style Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Towards Lossless Implicit Neural Representation via Bit Plane Decomposition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
A Study on the Generality of Neural Network Structures for Monocular Depth Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2024

TEXTOC: Text-driven Object-Centric Style Transfer.
CoRR, 2024

Context-Aware Video Instance Segmentation.
CoRR, 2024

Implicit Neural Image Stitching With Enhanced and Blended Feature Reconstruction.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Offline-to-Online Knowledge Distillation for Video Instance Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Density-aware Domain Generalization for LiDAR Semantic Segmentation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains.
Proceedings of the Computer Vision - ECCV 2024, 2024

BurstM: Deep Burst Multi-scale SR Using Fourier Space with Optical Flow.
Proceedings of the Computer Vision - ECCV 2024, 2024

JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Content-Adaptive Style Transfer: A Training-Free Approach with VQ Autoencoders.
Proceedings of the Computer Vision - ACCV 2024, 2024

2023
A Large-Scale Virtual Dataset and Egocentric Localization for Disaster Responses.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Domain Generalization in LiDAR Semantic Segmentation Leveraged by Density Discriminative Feature Embedding.
CoRR, 2023

Rotation Matters: Generalized Monocular 3D Object Detection for Various Camera Systems.
CoRR, 2023

Offline-to-Online Knowledge Distillation for Video Instance Segmentation.
CoRR, 2023

Depth-discriminative Metric Learning for Monocular 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dynamic Neural Network for Multi-Task Learning Searching across Diverse Network Topologies.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multi-Target Domain Adaptation with Class-Wise Attribute Transfer in Semantic Segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
RVMOS: Range-View Moving Object Segmentation Leveraged by Semantic and Motion Features.
IEEE Robotics Autom. Lett., 2022

ProFeat: Unsupervised image clustering via progressive feature refinement.
Pattern Recognit. Lett., 2022

Self-Supervised Monocular Depth and Motion Learning in Dynamic Scenes: Semantic Prior to Rescue.
Int. J. Comput. Vis., 2022

CMSNet: Deep Color and Monochrome Stereo.
Int. J. Comput. Vis., 2022

Facial Depth and Normal Estimation Using Single Dual-Pixel Camera.
Proceedings of the Computer Vision - ECCV 2022, 2022

ADAS: A Direct Adaptation Strategy for Multi-Target Domain Adaptive Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Deep Depth from Uncalibrated Small Motion Clip.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Facial Depth and Normal Estimation using Single Dual-Pixel Camera.
CoRR, 2021

VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ZeBRA: Precisely Destroying Neural Networks with Zero-Data Based Repeated Bit Flip Attack.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Ring Difference Filter for Fast and Noise Robust Depth From Focus.
IEEE Trans. Image Process., 2020

Learning Shape-based Representation for Visual Localization in Extremely Changing Conditions.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

2019
Robust Depth Estimation Using Auto-Exposure Bracketing.
IEEE Trans. Image Process., 2019

Accurate 3D Reconstruction from Small Motion Clip for Rolling Shutter Cameras.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Instance-wise Depth and Motion Learning from Monocular Videos.
CoRR, 2019

Learning Residual Flow as Dynamic Motion from Stereo Videos.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

DISC: A Large-scale Virtual Dataset for Simulating Disaster Scenarios.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Depth Completion with Deep Geometry and Context Guidance.
Proceedings of the International Conference on Robotics and Automation, 2019

DPSNet: End-to-end Deep Plane Sweep Stereo.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
RANUS: RGB and NIR Urban Scene Dataset for Deep Scene Parsing.
IEEE Robotics Autom. Lett., 2018

Robust Depth Estimation From Auto Bracketed Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Geometry Guided Three-Dimensional Propagation for Depth From Small Motion.
IEEE Signal Process. Lett., 2017

Noise Robust Depth from Focus Using a Ring Difference Filter.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
All-Around Depth from Small Motion with a Spherical Panoramic Camera.
Proceedings of the Computer Vision - ECCV 2016, 2016

Stereo Matching with Color and Monochrome Cameras in Low-Light Conditions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

High-Quality Depth from Uncalibrated Small Motion Clip.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Relative attributes with deep Convolutional Neural Network.
Proceedings of the 12th International Conference on Ubiquitous Robots and Ambient Intelligence, 2015

Depth estimation from light field cameras.
Proceedings of the 12th International Conference on Ubiquitous Robots and Ambient Intelligence, 2015

Depth from accidental motion using geometry prior.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

High Quality Structure from Small Motion for Rolling Shutter Cameras.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015


  Loading...