Xin Tan

Orcid: 0000-0001-9346-1196

Affiliations:
  • East China Normal University, Shanghai, China
  • City University of Hong Kong, Hong Kong (former)
  • Shanghai Jiao Tong University, China (PhD 2022)


According to our database1, Xin Tan authored at least 114 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MMoFusion: Multi-modal co-speech motion generation with diffusion model.
Pattern Recognit., 2026

2025
Bias to Balance: New-Knowledge-Preferred Few-Shot Class-Incremental Learning via Transition Calibration.
IEEE Trans. Neural Networks Learn. Syst., August, 2025

NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks.
CoRR, August, 2025

T2S: Tokenized Skill Scaling for Lifelong Imitation Learning.
CoRR, August, 2025

LidarPainter: One-Step Away From Any Lidar View To Novel Guidance.
CoRR, July, 2025

EyeSeg: An Uncertainty-Aware Eye Segmentation Framework for AR/VR.
CoRR, July, 2025

From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning.
CoRR, July, 2025

YouTube-Occ: Learning Indoor 3D Semantic Occupancy Prediction from YouTube Videos.
CoRR, June, 2025

UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images.
CoRR, June, 2025

SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail Voxels.
CoRR, May, 2025

DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation.
CoRR, May, 2025

GEOcc: Geometrically Enhanced 3D Occupancy Network With Implicit-Explicit Depth Fusion and Contextual Self-Supervision.
IEEE Trans. Intell. Transp. Syst., April, 2025

FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents.
CoRR, April, 2025

IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval.
CoRR, April, 2025

Learnable scene prior for point cloud semantic segmentation.
Vis. Comput., January, 2025

WV-LUT: Wide Vision Lookup Tables for Real-Time Low-Light Image Enhancement.
IEEE Trans. Multim., 2025

DHP-SLAM: A real-time visual slam system with high positioning accuracy under dynamic environment.
Displays, 2025

Domain-Incremental Learning Paradigm for scene understanding via Pseudo-Replay Generation.
Graph. Model., 2025

Semi-supervised Lip-Tongue segmentation with Boundary Region Contrast Sampling.
Appl. Soft Comput., 2025

Prototype Alignment with LoRA Fusion for Class-Incremental Learning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Efficient Prototypical Classifier for Class-Incremental Learning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MOS: Modeling Object-Scene Associations in Generalized Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

One-for-More: Continual Diffusion Model for Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

TAD: A Plug-and-Play Task Arithmetic Approach for Augmenting Diffusion Models.
Proceedings of the Computational Visual Media - 13th International Conference, 2025

DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye Cameras.
Proceedings of the Computational Visual Media - 13th International Conference, 2025

DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

FastLGS: Speeding Up Language Embedded Gaussians with Feature Grid Mapping.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Uni-to-Multi Modal Knowledge Distillation for Bidirectional LiDAR-Camera Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

MSPAN: Multi-scale pyramid attention network for efficient skin cancer lesion segmentation.
IET Image Process., May, 2024

Glass Makes Blurs: Learning the Visual Blurriness for Glass Surface Detection.
IEEE Trans. Ind. Informatics, April, 2024

CSFwinformer: Cross-Space-Frequency Window Transformer for Mirror Detection.
IEEE Trans. Image Process., 2024

PIG: Prompt Images Guidance for Night-Time Scene Parsing.
IEEE Trans. Image Process., 2024

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning.
IEEE Trans. Image Process., 2024

Cross-coupled prompt learning for few-shot image recognition.
Displays, 2024

Label-aware aggregation on heterophilous graphs for node representation learning.
Displays, 2024

Diffusion Implicit Policy for Unpaired Scene-aware Motion Synthesis.
CoRR, 2024

Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation.
CoRR, 2024

AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting.
CoRR, 2024

LLaCA: Multimodal Large Language Continual Assistant.
CoRR, 2024

Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification.
CoRR, 2024

Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining.
CoRR, 2024

FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping.
CoRR, 2024

Gradient Projection For Parameter-Efficient Continual Learning.
CoRR, 2024

Efficient Multimodal Large Language Models: A Survey.
CoRR, 2024

Exploring Safety Generalization Challenges of Large Language Models via Code.
CoRR, 2024

MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model.
CoRR, 2024

Harmonizing Visual Text Comprehension and Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Mutual Positive and Negative Learning for Weakly-supervised Point Cloud Semantic Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Prompt Gradient Projection for Continual Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

COTR: Compact Occupancy TRansformer for Vision-Based 3D Occupancy Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Isolation and Integration: A Strong Pre-trained Model-Based Paradigm for Class-Incremental Learning.
Proceedings of the Computational Visual Media - 12th International Conference, 2024

Explore and Enhance the Generalization of Anomaly DeepFake Detection.
Proceedings of the Computational Visual Media - 12th International Conference, 2024

Leveraging Panoptic Prior for 3D Zero-Shot Semantic Understanding Within Language Embedded Radiance Fields.
Proceedings of the Computational Visual Media - 12th International Conference, 2024

Image-text Retrieval with Main Semantics Consistency.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Domain Alignment with Large Vision-language Models for Cross-domain Remote Sensing Image Retrieval.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Continuous Piecewise-Affine Based Motion Model for Image Animation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Domain-Hallucinated Updating for Multi-Domain Face Anti-spoofing.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Positive-Negative Receptive Field Reasoning for Omni-Supervised 3D Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Multi-domain mixup for scenario-universal face anti-spoofing.
Comput. Graph., November, 2023

HSNet: hierarchical semantics network for scene parsing.
Vis. Comput., July, 2023

Mirror Detection With the Visual Chirality Cue.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

A new method proposed to Melanoma-skin cancer lesion detection and segmentation based on hybrid convolutional neural network.
Multim. Tools Appl., March, 2023

LW-CovidNet: Automatic covid-19 lung infection detection from chest X-ray images.
IET Image Process., February, 2023

Frequency-aware Camouflaged Object Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Boosting Night-Time Scene Parsing With Learnable Frequency.
IEEE Trans. Image Process., 2023

Semantic-Aware Dehazing Network With Adaptive Feature Fusion.
IEEE Trans. Cybern., 2023

Generalized Category Discovery in Semantic Segmentation.
CoRR, 2023

Unveiling the Power of CLIP in Unsupervised Visible-Infrared Person Re-Identification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Instance and Category Supervision are Alternate Learners for Continual Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space Decoupling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Detect Mirrors from Videos via Dual Correspondences.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multi-Centroid Task Descriptor for Dynamic Class Incremental Inference.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Self-supervised Contrastive Feature Refinement for Few-Shot Class-Incremental Learning.
Proceedings of the Computer-Aided Design and Computer Graphics, 2023

2022
Sketch-to-photo face generation based on semantic consistency preserving and similar connected component refinement.
Vis. Comput., 2022

DMT: Dynamic mutual training for semi-supervised learning.
Pattern Recognit., 2022

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning.
CoRR, 2022

Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window.
CoRR, 2022

Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking Efficient Lane Detection via Curve Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Understanding Geometry for Point Cloud Segmentation via Covariance.
Proceedings of the 15th International Congress on Image and Signal Processing, 2022

2021
Night-Time Scene Parsing With a Large Real Dataset.
IEEE Trans. Image Process., 2021

Weakly-Supervised Saliency Detection via Salient Object Subitizing.
IEEE Trans. Circuits Syst. Video Technol., 2021

Single Image Deraining via detail-guided Efficient Channel Attention Network.
Comput. Graph., 2021

Novelty Detection via Contrastive Learning with Negative Data Augmentation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Self-supervised Compressed Video Action Recognition via Temporal-Consistent Sampling.
Proceedings of the Neural Information Processing - 28th International Conference, 2021

Confident Semantic Ranking Loss for Part Parsing.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Omni-Supervised Point Cloud Segmentation via Gradual Receptive Field Component Reasoning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Boundary-Aware Geometric Encoding for Semantic Segmentation of Point Clouds.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Deep multi-center learning for face alignment.
Neurocomputing, 2020

Uncertainty-Aware Consistency Regularization for Cross-Domain Semantic Segmentation.
CoRR, 2020

Semi-Supervised Semantic Segmentation via Dynamic Self-Training and Class-Balanced Curriculum.
CoRR, 2020

Night-time Semantic Segmentation with a Large Real Dataset.
CoRR, 2020

SceneEncoder: Scene-Aware Semantic Segmentation of Point Clouds with A Learnable Scene Descriptor.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Learning Object Deformation and Motion Adaption for Semi-supervised Video Object Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

A Shape-Aware Feature Extraction Module for Semantic Segmentation of 3D Point Clouds.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

2019
MCCH: A novel convex hull prior based solution for saliency detection.
Inf. Sci., 2019

FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds.
CoRR, 2019

Accurate and Efficient Object Detection with Context Enhancement Block.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Re-ID Driven Localization Refinement for Person Search.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning the Spiral Sharing Network with Minimum Salient Region Regression for Saliency Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019

Object-Level Salience Detection by Progressively Enhanced Network.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Image Processing, 2019

FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds.
Proceedings of the 12th International Congress on Image and Signal Processing, 2019

2018
Facial Landmark Detection Under Large Pose.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Multi-Path Feature Fusion Network for Saliency Detection.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Saliency Detection by Deep Network with Boundary Refinement and Global Context.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018


  Loading...