Xin Tan

Vis. Comput., September, 2025

Optimal Transport with Arbitrary Prior for Dynamic Resolution Network.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., September, 2025

PFDepth: Heterogeneous Pinhole-Fisheye Joint Depth Estimation via Distortion-aware Gaussian-Splatted Volumetric Fusion.

[BibT_eX]

[DOI]

CoRR, September, 2025

Bias to Balance: New-Knowledge-Preferred Few-Shot Class-Incremental Learning via Transition Calibration.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., August, 2025

NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks.

[BibT_eX]

[DOI]

CoRR, August, 2025

T2S: Tokenized Skill Scaling for Lifelong Imitation Learning.

[BibT_eX]

[DOI]

CoRR, August, 2025

LidarPainter: One-Step Away From Any Lidar View To Novel Guidance.

[BibT_eX]

[DOI]

CoRR, July, 2025

From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning.

[BibT_eX]

[DOI]

CoRR, July, 2025

YouTube-Occ: Learning Indoor 3D Semantic Occupancy Prediction from YouTube Videos.

[BibT_eX]

[DOI]

CoRR, June, 2025

UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images.

[BibT_eX]

[DOI]

CoRR, June, 2025

SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail Voxels.

[BibT_eX]

[DOI]

Qiucheng Yu

Yuan Xie

CoRR, May, 2025

DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation.

[BibT_eX]

[DOI]

CoRR, May, 2025

GEOcc: Geometrically Enhanced 3D Occupancy Network With Implicit-Explicit Depth Fusion and Contextual Self-Supervision.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., April, 2025

FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents.

[BibT_eX]

[DOI]

CoRR, April, 2025

IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval.

[BibT_eX]

[DOI]

CoRR, April, 2025

Learnable scene prior for point cloud semantic segmentation.

[BibT_eX]

[DOI]

Vis. Comput., January, 2025

AttentionPainter: An Efficient and Adaptive Stroke Predictor for Scene Painting.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2025

WV-LUT: Wide Vision Lookup Tables for Real-Time Low-Light Image Enhancement.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

DHP-SLAM: A real-time visual slam system with high positioning accuracy under dynamic environment.

[BibT_eX]

[DOI]

Displays, 2025

Domain-Incremental Learning Paradigm for scene understanding via Pseudo-Replay Generation.

[BibT_eX]

[DOI]

Graph. Model., 2025

Semi-supervised Lip-Tongue segmentation with Boundary Region Contrast Sampling.

[BibT_eX]

[DOI]

Appl. Soft Comput., 2025

EyeSeg: An Uncertainty-Aware Eye Segmentation Framework for AR/VR.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Large Continual Instruction Assistant.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

LFNet: Cross-Modal LiDAR-Fisheye Fusion Network for 3D Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Prototype Alignment with LoRA Fusion for Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Efficient Prototypical Classifier for Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MOS: Modeling Object-Scene Associations in Generalized Category Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

One-for-More: Continual Diffusion Model for Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

TAD: A Plug-and-Play Task Arithmetic Approach for Augmenting Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computational Visual Media - 13th International Conference, 2025

DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye Cameras.

[BibT_eX]

[DOI]

Proceedings of the Computational Visual Media - 13th International Conference, 2025

DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

FastLGS: Speeding Up Language Embedded Gaussians with Feature Grid Mapping.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Uni-to-Multi Modal Knowledge Distillation for Bidirectional LiDAR-Camera Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

MSPAN: Multi-scale pyramid attention network for efficient skin cancer lesion segmentation.

[BibT_eX]

[DOI]

IET Image Process., May, 2024

Glass Makes Blurs: Learning the Visual Blurriness for Glass Surface Detection.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, April, 2024

CSFwinformer: Cross-Space-Frequency Window Transformer for Mirror Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

PIG: Prompt Images Guidance for Night-Time Scene Parsing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2024

Cross-coupled prompt learning for few-shot image recognition.

[BibT_eX]

[DOI]

Displays, 2024

Label-aware aggregation on heterophilous graphs for node representation learning.

[BibT_eX]

[DOI]

Displays, 2024

Diffusion Implicit Policy for Unpaired Scene-aware Motion Synthesis.

[BibT_eX]

[DOI]

CoRR, 2024

Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation.

[BibT_eX]

[DOI]

CoRR, 2024

LLaCA: Multimodal Large Language Continual Assistant.

[BibT_eX]

[DOI]

CoRR, 2024

Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining.

[BibT_eX]

[DOI]

CoRR, 2024

FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping.

[BibT_eX]

[DOI]

CoRR, 2024

Gradient Projection For Parameter-Efficient Continual Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Multimodal Large Language Models: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring Safety Generalization Challenges of Large Language Models via Code.

[BibT_eX]

[DOI]

CoRR, 2024

MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model.

[BibT_eX]

[DOI]

CoRR, 2024

Harmonizing Visual Text Comprehension and Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Mutual Positive and Negative Learning for Weakly-supervised Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Prompt Gradient Projection for Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

COTR: Compact Occupancy TRansformer for Vision-Based 3D Occupancy Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Isolation and Integration: A Strong Pre-trained Model-Based Paradigm for Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the Computational Visual Media - 12th International Conference, 2024

Explore and Enhance the Generalization of Anomaly DeepFake Detection.

[BibT_eX]

[DOI]

Proceedings of the Computational Visual Media - 12th International Conference, 2024

Leveraging Panoptic Prior for 3D Zero-Shot Semantic Understanding Within Language Embedded Radiance Fields.

[BibT_eX]

[DOI]

Proceedings of the Computational Visual Media - 12th International Conference, 2024

Image-text Retrieval with Main Semantics Consistency.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Domain Alignment with Large Vision-language Models for Cross-domain Remote Sensing Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Continuous Piecewise-Affine Based Motion Model for Image Animation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Domain-Hallucinated Updating for Multi-Domain Face Anti-spoofing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Positive-Negative Receptive Field Reasoning for Omni-Supervised 3D Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Multi-domain mixup for scenario-universal face anti-spoofing.

[BibT_eX]

[DOI]

Comput. Graph., November, 2023

HSNet: hierarchical semantics network for scene parsing.

[BibT_eX]

[DOI]

Vis. Comput., July, 2023

Mirror Detection With the Visual Chirality Cue.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

A new method proposed to Melanoma-skin cancer lesion detection and segmentation based on hybrid convolutional neural network.

[BibT_eX]

[DOI]

Multim. Tools Appl., March, 2023

LW-CovidNet: Automatic covid-19 lung infection detection from chest X-ray images.

[BibT_eX]

[DOI]

IET Image Process., February, 2023

Frequency-aware Camouflaged Object Detection.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2023

Boosting Night-Time Scene Parsing With Learnable Frequency.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Semantic-Aware Dehazing Network With Adaptive Feature Fusion.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2023

Generalized Category Discovery in Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Unveiling the Power of CLIP in Unsupervised Visible-Infrared Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Instance and Category Supervision are Alternate Learners for Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Gradient Projection Continual Learning: Stability/Plasticity Feature Space Decoupling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning to Detect Mirrors from Videos via Dual Correspondences.

[BibT_eX]

[DOI]

Jiaying Lin

Rynson W. H. Lau

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multi-Centroid Task Descriptor for Dynamic Class Incremental Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Self-supervised Contrastive Feature Refinement for Few-Shot Class-Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer-Aided Design and Computer Graphics, 2023

2022

Sketch-to-photo face generation based on semantic consistency preserving and similar connected component refinement.

[BibT_eX]

[DOI]

Vis. Comput., 2022

DMT: Dynamic mutual training for semi-supervised learning.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window.

[BibT_eX]

[DOI]

Qiuli Wang