Xin Jin

Orcid: 0000-0002-1820-8358

Affiliations:

Eastern Institute of Technology, Ningbo Institute of Digital Twin, Ningbo, China
University of Science and Technology of China (USTC), Intelligent Media Computing Lab, Hefei, China (PhD)

According to our database¹, Xin Jin authored at least 124 papers between 2005 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Diffusion Models for Image Restoration and Enhancement: A Comprehensive Survey.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., November, 2025

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method.

[BibT_eX]

[DOI]

CoRR, October, 2025

OmniNWM: Omniscient Driving Navigation World Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Vision-Centric Activation and Coordination for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation.

[BibT_eX]

[DOI]

CoRR, October, 2025

When MLLMs Meet Compression Distortion: A Coding Paradigm Tailored to MLLMs.

[BibT_eX]

[DOI]

CoRR, September, 2025

The 1st International Workshop on Disentangled Representation Learning for Controllable Generation (DRL4Real): Methods and Results.

[BibT_eX]

[DOI]

CoRR, September, 2025

Revisiting MLLM Token Technology through the Lens of Classical Visual Coding.

[BibT_eX]

[DOI]

CoRR, August, 2025

ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, August, 2025

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions.

[BibT_eX]

[DOI]

CoRR, August, 2025

Knowledge Regularized Negative Feature Tuning of Vision-Language Models for Out-of-Distribution Detection.

[BibT_eX]

[DOI]

CoRR, July, 2025

DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge.

[BibT_eX]

[DOI]

CoRR, July, 2025

MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., June, 2025

Behavior Foundation Model: Towards Next-Generation Whole-Body Control System of Humanoid Robots.

[BibT_eX]

[DOI]

CoRR, June, 2025

EDBench: Large-Scale Electron Density Data for Molecular Modeling.

[BibT_eX]

[DOI]

CoRR, May, 2025

Exploring Contrastive Pre-Training for Domain Connections in Medical Image Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, April, 2025

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2025

Interpretable Single-View 3D Gaussian Splatting using Unsupervised Hierarchical Disentangled Representation Learning.

[BibT_eX]

[DOI]

CoRR, April, 2025

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, March, 2025

ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, March, 2025

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation.

[BibT_eX]

[DOI]

CoRR, February, 2025

Deep Reinforcement Learning with Hybrid Intrinsic Reward Model.

[BibT_eX]

[DOI]

CoRR, January, 2025

Adaptive Data Exploitation in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, January, 2025

Unleash the Power of Vision-Language Models by Visual Attention Prompt and Multimodal Interaction.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning.

[BibT_eX]

[DOI]

Mingqi Yuan

Roger Creus Castanyer

Trans. Mach. Learn. Res., 2025

Multi-Attribute Continual Learning for Blind Image Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

Electron Density-enhanced Molecular Geometry Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Representation Disentanglement for Semantic Coding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Open-World Reinforcement Learning over Long Short-Term Imagination.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

UniScene: Unified Occupancy-centric Driving Scene Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RLLTE: Long-Term Evolution Project of Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Structure-preserving feature alignment for old photo colorization.

[BibT_eX]

[DOI]

Pattern Recognit., January, 2024

Domain Prompt Tuning via Meta Relabeling for Unsupervised Adversarial Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2024

TINQ: Temporal Inconsistency Guided Blind Video Quality Assessment.

[BibT_eX]

[DOI]

CoRR, 2024

Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task.

[BibT_eX]

[DOI]

CoRR, 2024

OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations.

[BibT_eX]

[DOI]

CoRR, 2024

MaskMol: Knowledge-guided Molecular Image Pre-Training Framework for Activity Cliffs.

[BibT_eX]

[DOI]

CoRR, 2024

StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization.

[BibT_eX]

[DOI]

CoRR, 2024

RailPC: A large-scale railway point cloud semantic segmentation dataset.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2024

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

An Explainable Spectral Analysis For Light Field Image Quality Assessment.

[BibT_eX]

[DOI]

Shengyang Zhao

Xin Jin

Proceedings of the IEEE International Conference on Image Processing, 2024

Rethinking Domain Adaptation and Generalization in the ERA Of Clip.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2024

Architecture-Agnostic Unsupervised Gradient Regularization for Parameter-Efficient Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

DreamLIP: Language-Image Pre-training with Long Captions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Temporal Context Learning for Camera-Based Semantic Scene Completion.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Closed-Loop Unsupervised Representation Disentanglement with β-VAE Distillation and Diffusion Probabilistic Feedback.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

ReGenNet: Towards Human Action-Reaction Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inter-X: Towards Versatile Human-Human Interaction Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Consistency Prior Matters: Biomedical-Prompting Dual Augmentation for Domain Adaptive Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

Multi-Prompts Learning with Cross-Modal Alignment for Attribute-Based Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SwiftPillars: High-Efficiency Pillar Encoder for Lidar-Based 3D Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Semantical video coding: Instill static-dynamic clues into structured bitstream for AI tasks.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., May, 2023

Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

RailSeg: Learning Local-Global Feature Aggregation With Contextual Information for Railway Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Extracting 3-D Structural Lines of Building From ALS Point Clouds Using Graph Neural Network Embedded With Corner Information.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

Breathing Life into Faces: Speech-driven 3D Facial Animation with Natural Head Pose and Detailed Shape.

[BibT_eX]

[DOI]

CoRR, 2023

One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation.

[BibT_eX]

[DOI]

CoRR, 2023

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model.

[BibT_eX]

[DOI]

CoRR, 2023

Collaborative World Models: An Online-Offline Transfer RL Approach.

[BibT_eX]

[DOI]

CoRR, 2023

Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts.

[BibT_eX]

[DOI]

CoRR, 2023

Inpaint Anything: Segment Anything Meets Image Inpainting.

[BibT_eX]

[DOI]

CoRR, 2023

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

StereoScene: BEV-Assisted Stereo Matching Empowers 3D Semantic Scene Completion.

[BibT_eX]

[DOI]

CoRR, 2023

Composable Image Coding for Machine via Task-oriented Internal Adaptor and External Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Semantically Structured Image Compression via Irregular Group-Based Decoupling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Distortion Invariant Representation for Image Restoration from a Causality Perspective.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Task Residual for Tuning Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Style Normalization and Restitution for Domain Generalization and Adaptation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Dual Prior Learning for Blind and Blended Image Restoration.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Learned Block-Based Hybrid Image Compression.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Tackling Visual Control via Multi-View Exploration Maximization.

[BibT_eX]

[DOI]

CoRR, 2022

Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks.

[BibT_eX]

[DOI]

CoRR, 2022

Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Learning with Recoverable Forgetting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Unleashing the Potential of Adaptation Models via Go-getting Domain Labels.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Image Coding for Machines with Omnipotent Feature Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Unleashing Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Multi-task Learning-based All-in-one Collaboration Framework for Degraded Image Super-resolution.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2021

CASINet: Content-Adaptive Scale Interaction Networks for scene parsing.

[BibT_eX]

[DOI]

Neurocomputing, 2021

Learning Cross-Scale Prediction for Efficient Neural Video Compression.

[BibT_eX]

[DOI]

CoRR, 2021

Confounder Identification-free Causal Visual Feature Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Few-Shot Real Image Restoration via Distortion-Relation Guided Transfer Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2021

Style Normalization and Restitution for DomainGeneralization and Adaptation.

[BibT_eX]

[DOI]

CoRR, 2021

Dense Interaction Learning for Video-based Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Re-energizing Domain Discriminator with Sample Relabeling for Adversarial Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Omni-Frequency Region-adaptive Representations for Real Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Learning for Video Compression.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

AI-GAN: Asynchronous interactive generative adversarial network for single image rain removal.

[BibT_eX]

[DOI]

Xin Jin

Zhibo Chen

Weiping Li

Pattern Recognit., 2020

Feature Alignment and Restoration for Domain Generalization and Adaptation.

[BibT_eX]

[DOI]

CoRR, 2020

AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

FAN: Frequency Aggregation Network for Real Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Learning Disentangled Feature Representation for Hybrid-Distorted Image Restoration.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Global Distance-Distributions Separation for Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Relation-Aware Global Attention for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Style Normalization and Restitution for Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Region Normalization for Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Semantics-Aligned Representation Learning for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing.

[BibT_eX]

[DOI]

CoRR, 2019

Relation-Aware Global Attention.

[BibT_eX]

[DOI]

CoRR, 2019

AI-GAN: Signal De-Interference via Asynchronous Interactive Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Unsupervised Single Image Deraining with Self-Supervised Constraints.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

2018

Multiscale Progressive Image Compression Network Guided by Learnable Just Noticeable Distortion.

[BibT_eX]

[DOI]

Xin Jin

Runchun Ye

Zhibo Chen

Proceedings of the IEEE Visual Communications and Image Processing, 2018

Augmented Coarse-to-Fine Video Frame Synthesis with Semantic Loss.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

A Decomposed Dual-Cross Generative Adversarial Network for Image Rain Removal.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2018, 2018

2005

H.264-compatible spatially scalable video coding with in-band prediction.

[BibT_eX]

[DOI]

Proceedings of the 2005 International Conference on Image Processing, 2005

Xin Jin

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...