Xi Chen

Orcid: 0000-0003-0165-6426

Affiliations:

University of Hong Kong, Department of Computer Science, Hong Kong

According to our database¹, Xi Chen authored at least 67 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Causal Prompts for Open-Vocabulary Video Instance Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2026

PanoWorld: Towards Spatial Supersensing in 360<sup>°</sup> Panorama World.

[BibT_eX]

[DOI]

CoRR, May, 2026

SURF: Signature-Retained Fast Video Generation.

[BibT_eX]

[DOI]

CoRR, March, 2026

SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety.

[BibT_eX]

[DOI]

CoRR, March, 2026

SAM4Dcap: Training-free Biomechanical Twin System from Monocular Video.

[BibT_eX]

[DOI]

CoRR, February, 2026

RareAlert: Aligning heterogeneous large language model reasoning for early rare disease risk screening.

[BibT_eX]

[DOI]

CoRR, January, 2026

Evaluating the Diagnostic Classification Ability of Multimodal Large Language Models: Insights from the Osteoarthritis Initiative.

[BibT_eX]

[DOI]

CoRR, January, 2026

GDRO: Group-level Reward Post-training Suitable for Diffusion Models.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection.

[BibT_eX]

[DOI]

CoRR, December, 2025

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives.

[BibT_eX]

[DOI]

CoRR, December, 2025

From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning.

[BibT_eX]

[DOI]

CoRR, November, 2025

KOM: A Multi-Agent Artificial Intelligence System for Precision Management of Knee Osteoarthritis (KOA).

[BibT_eX]

[DOI]

CoRR, November, 2025

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents.

[BibT_eX]

[DOI]

CoRR, October, 2025

From Noisy Traces to Stable Gradients: Bias-Variance Optimized Preference Optimization for Aligning Large Reasoning Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

AnyDoor: Zero-Shot Image Customization With Region-to-Region Reference.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

Animate-X++: Universal Character Image Animation with Dynamic Backgrounds.

[BibT_eX]

[DOI]

CoRR, August, 2025

UniDetector: Towards Universal Object Detection With Heterogeneous Supervision.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning.

[BibT_eX]

[DOI]

CoRR, June, 2025

FocalClick-XL: Towards Unified and High-quality Interactive Segmentation.

[BibT_eX]

[DOI]

Xi Chen

Hengshuang Zhao

CoRR, June, 2025

Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Effective LLM Knowledge Learning via Model Generalization.

[BibT_eX]

[DOI]

CoRR, March, 2025

Enhancing diagnostic capability with multi-agents conversational large language models.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2025

DiffCamera: Arbitrary Refocusing on Images.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control.

[BibT_eX]

[DOI]

Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

FashionComposer: Compositional Fashion Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

LayerFlow: A Unified Model for Layer-aware Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

PlayerOne: Egocentric World Simulator.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

ROSE: Remove Objects with Side Effects in Videos.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

iDPA: Instance Decoupled Prompt Attention for Incremental Medical Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

DiffDoctor: Diagnosing Image Diffusion Models Before Treating.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing LLM Knowledge Learning through Generalization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

ObjectMover: Generative Object Movement with Video Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

AHOR: Online Multi-Object Tracking With Authenticity Hierarchizing and Occlusion Recovery.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2024

ScopeViT: Scale-Aware Vision Transformer.

[BibT_eX]

[DOI]

Pattern Recognit., 2024

Prompt engineering in consistency and reliability with the evidence-based guideline for LLMs.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2024

Continuous-Time Digital Twin with Analogue Memristive Neural Ordinary Differential Equation Solver.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient and accurate neural field reconstruction using resistive memory.

[BibT_eX]

[DOI]

CoRR, 2024

Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model.

[BibT_eX]

[DOI]

CoRR, 2024

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Evaluating and Enhancing Large Language Models Performance in Domain-specific Medicine: Osteoarthritis Management with DocOA.

[BibT_eX]

[DOI]

CoRR, 2024

Triplet Attention Transformer for Spatiotemporal Predictive Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Zero-shot Image Editing with Reference Imitation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Object-Level Pseudo-3D Lifting for Distance-Aware Tracking.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SAMP: Adapting Segment Anything Model for Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

LogoSticker: Inserting Logos Into Diffusion Models for Customized Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LivePhoto: Real Image Animation with Text-Guided Motion Control.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AnyDoor: Zero-shot Object-level Image Customization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Wavelet-Driven Spatiotemporal Predictive Learning: Bridging Frequency and Time Variations.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Echo state graph neural networks with analogue random resistive memory arrays.

[BibT_eX]

[DOI]

Nat. Mac. Intell., February, 2023

A Lightweight Clustering Framework for Unsupervised Semantic Segmentation.

[BibT_eX]

[DOI]

Yau Shing Jonathan Cheung

Xi Chen

Lihe Yang

Hengshuang Zhao

CoRR, 2023

Pruning random resistive memory for optimizing analogue AI.

[BibT_eX]

[DOI]

CoRR, 2023

ScribbleSeg: Scribble-based Interactive Image Segmentation.

[BibT_eX]

[DOI]

Xi Chen

Yau Shing Jonathan Cheung

Ser-Nam Lim

Hengshuang Zhao

CoRR, 2023

Uni3DETR: Unified 3D Detection Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-vocabulary Panoptic Segmentation with Embedding Modulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Detecting Everything in the Open World: Towards Universal Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

iNL: Implicit non-local network.

[BibT_eX]

[DOI]

Neurocomputing, 2022

Convolutional Echo-State Network with Random Memristors for Spatiotemporal Signal Classification.

[BibT_eX]

[DOI]

Adv. Intell. Syst., 2022

FocalClick: Towards Practical Interactive Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Echo state graph neural networks with analogue random resistor arrays.

[BibT_eX]

[DOI]

CoRR, 2021

2020

State-Aware Tracker for Real-Time Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Boundary-Aware Network for Fast and High-Accuracy Portrait Segmentation.

[BibT_eX]

[DOI]

Xi Chen

Donglian Qi

Jianxin Shen

CoRR, 2019

Xi Chen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...