Xin Jin

Orcid: 0000-0002-1820-8358

Affiliations:
  • Eastern Institute of Technology, Ningbo Institute of Digital Twin, Ningbo, China
  • University of Science and Technology of China (USTC), Intelligent Media Computing Lab, Hefei, China (PhD)


According to our database1, Xin Jin authored at least 105 papers between 2005 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Revisiting MLLM Token Technology through the Lens of Classical Visual Coding.
CoRR, August, 2025

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions.
CoRR, August, 2025

Knowledge Regularized Negative Feature Tuning of Vision-Language Models for Out-of-Distribution Detection.
CoRR, July, 2025

DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge.
CoRR, July, 2025

MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving.
IEEE Trans. Intell. Transp. Syst., June, 2025

Exploring Contrastive Pre-Training for Domain Connections in Medical Image Segmentation.
IEEE Trans. Medical Imaging, April, 2025

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
CoRR, April, 2025

Interpretable Single-View 3D Gaussian Splatting using Unsupervised Hierarchical Disentangled Representation Learning.
CoRR, April, 2025

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning.
CoRR, March, 2025

ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning.
CoRR, March, 2025

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation.
CoRR, February, 2025

Deep Reinforcement Learning with Hybrid Intrinsic Reward Model.
CoRR, January, 2025

Adaptive Data Exploitation in Deep Reinforcement Learning.
CoRR, January, 2025

Unleash the Power of Vision-Language Models by Visual Attention Prompt and Multimodal Interaction.
IEEE Trans. Multim., 2025

RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning.
Trans. Mach. Learn. Res., 2025

Open-World Reinforcement Learning over Long Short-Term Imagination.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

UniScene: Unified Occupancy-centric Driving Scene Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RLLTE: Long-Term Evolution Project of Reinforcement Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Structure-preserving feature alignment for old photo colorization.
Pattern Recognit., January, 2024

Domain Prompt Tuning via Meta Relabeling for Unsupervised Adversarial Adaptation.
IEEE Trans. Multim., 2024

Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task.
CoRR, 2024

OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation.
CoRR, 2024

Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction.
CoRR, 2024

MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations.
CoRR, 2024

RailPC: A large-scale railway point cloud semantic segmentation dataset.
CAAI Trans. Intell. Technol., 2024

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

An Explainable Spectral Analysis For Light Field Image Quality Assessment.
Proceedings of the IEEE International Conference on Image Processing, 2024

Rethinking Domain Adaptation and Generalization in the ERA Of Clip.
Proceedings of the IEEE International Conference on Image Processing, 2024

Architecture-Agnostic Unsupervised Gradient Regularization for Parameter-Efficient Transfer Learning.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects.
Proceedings of the Computer Vision - ECCV 2024, 2024

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Temporal Context Learning for Camera-Based Semantic Scene Completion.
Proceedings of the Computer Vision - ECCV 2024, 2024

Closed-Loop Unsupervised Representation Disentanglement with β-VAE Distillation and Diffusion Probabilistic Feedback.
Proceedings of the Computer Vision - ECCV 2024, 2024

ReGenNet: Towards Human Action-Reaction Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inter-X: Towards Versatile Human-Human Interaction Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Consistency Prior Matters: Biomedical-Prompting Dual Augmentation for Domain Adaptive Medical Image Segmentation.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SwiftPillars: High-Efficiency Pillar Encoder for Lidar-Based 3D Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Semantical video coding: Instill static-dynamic clues into structured bitstream for AI tasks.
J. Vis. Commun. Image Represent., May, 2023

Learning Cross-Scale Weighted Prediction for Efficient Neural Video Compression.
IEEE Trans. Image Process., 2023

RailSeg: Learning Local-Global Feature Aggregation With Contextual Information for Railway Point Cloud Semantic Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2023

Extracting 3-D Structural Lines of Building From ALS Point Clouds Using Graph Neural Network Embedded With Corner Information.
IEEE Trans. Geosci. Remote. Sens., 2023

Diffusion Models for Image Restoration and Enhancement - A Comprehensive Survey.
CoRR, 2023

One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation.
CoRR, 2023

EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model.
CoRR, 2023

Collaborative World Models: An Online-Offline Transfer RL Approach.
CoRR, 2023

Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts.
CoRR, 2023

Inpaint Anything: Segment Anything Meets Image Inpainting.
CoRR, 2023

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation.
CoRR, 2023

StereoScene: BEV-Assisted Stereo Matching Empowers 3D Semantic Scene Completion.
CoRR, 2023

Composable Image Coding for Machine via Task-oriented Internal Adaptor and External Prior.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Semantically Structured Image Compression via Irregular Group-Based Decoupling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Distortion Invariant Representation for Image Restoration from a Causality Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Task Residual for Tuning Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Style Normalization and Restitution for Domain Generalization and Adaptation.
IEEE Trans. Multim., 2022

Dual Prior Learning for Blind and Blended Image Restoration.
IEEE Trans. Image Process., 2022

Learned Block-Based Hybrid Image Compression.
IEEE Trans. Circuits Syst. Video Technol., 2022

Tackling Visual Control via Multi-View Exploration Maximization.
CoRR, 2022

Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning.
CoRR, 2022

Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks.
CoRR, 2022

Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Unleashing the Potential of Adaptation Models via Go-getting Domain Labels.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Image Coding for Machines with Omnipotent Feature Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unleashing Potential of Unsupervised Pre-Training with Intra-Identity Regularization for Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Multi-task Learning-based All-in-one Collaboration Framework for Degraded Image Super-resolution.
ACM Trans. Multim. Comput. Commun. Appl., 2021

CASINet: Content-Adaptive Scale Interaction Networks for scene parsing.
Neurocomputing, 2021

Learning Cross-Scale Prediction for Efficient Neural Video Compression.
CoRR, 2021

Confounder Identification-free Causal Visual Feature Learning.
CoRR, 2021

Few-Shot Real Image Restoration via Distortion-Relation Guided Transfer Learning.
CoRR, 2021

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification.
CoRR, 2021

Style Normalization and Restitution for DomainGeneralization and Adaptation.
CoRR, 2021

Dense Interaction Learning for Video-based Person Re-identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Re-energizing Domain Discriminator with Sample Relabeling for Adversarial Domain Adaptation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Omni-Frequency Region-adaptive Representations for Real Image Super-Resolution.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learning for Video Compression.
IEEE Trans. Circuits Syst. Video Technol., 2020

AI-GAN: Asynchronous interactive generative adversarial network for single image rain removal.
Pattern Recognit., 2020

Feature Alignment and Restoration for Domain Generalization and Adaptation.
CoRR, 2020


FAN: Frequency Aggregation Network for Real Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Learning Disentangled Feature Representation for Hybrid-Distorted Image Restoration.
Proceedings of the Computer Vision - ECCV 2020, 2020

Global Distance-Distributions Separation for Unsupervised Person Re-identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Relation-Aware Global Attention for Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Style Normalization and Restitution for Generalizable Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Region Normalization for Image Inpainting.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Semantics-Aligned Representation Learning for Person Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing.
CoRR, 2019

Relation-Aware Global Attention.
CoRR, 2019

AI-GAN: Signal De-Interference via Asynchronous Interactive Generative Adversarial Network.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Unsupervised Single Image Deraining with Self-Supervised Constraints.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

2018
Multiscale Progressive Image Compression Network Guided by Learnable Just Noticeable Distortion.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Augmented Coarse-to-Fine Video Frame Synthesis with Semantic Loss.
Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

A Decomposed Dual-Cross Generative Adversarial Network for Image Rain Removal.
Proceedings of the British Machine Vision Conference 2018, 2018

2005
H.264-compatible spatially scalable video coding with in-band prediction.
Proceedings of the 2005 International Conference on Image Processing, 2005


  Loading...