Xi Chen

Orcid: 0009-0008-5008-4720

Affiliations:
  • University of Hong Kong, Department of Computer Science, Hong Kong


According to our database1, Xi Chen authored at least 56 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
SURF: Signature-Retained Fast Video Generation.
CoRR, March, 2026

GDRO: Group-level Reward Post-training Suitable for Diffusion Models.
CoRR, January, 2026

2025
Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection.
CoRR, December, 2025

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives.
CoRR, December, 2025

From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning.
CoRR, November, 2025

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning.
CoRR, October, 2025

Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents.
CoRR, October, 2025

From Noisy Traces to Stable Gradients: Bias-Variance Optimized Preference Optimization for Aligning Large Reasoning Models.
CoRR, October, 2025

AnyDoor: Zero-Shot Image Customization With Region-to-Region Reference.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

ROSE: Remove Objects with Side Effects in Videos.
CoRR, August, 2025

Animate-X++: Universal Character Image Animation with Dynamic Backgrounds.
CoRR, August, 2025

UniDetector: Towards Universal Object Detection With Heterogeneous Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning.
CoRR, June, 2025

FocalClick-XL: Towards Unified and High-quality Interactive Segmentation.
CoRR, June, 2025

PlayerOne: Egocentric World Simulator.
CoRR, June, 2025

Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation.
CoRR, March, 2025

Effective LLM Knowledge Learning via Model Generalization.
CoRR, March, 2025

DiffCamera: Arbitrary Refocusing on Images.
Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

FashionComposer: Compositional Fashion Image Generation.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

LayerFlow: A Unified Model for Layer-aware Video Generation.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

DiffDoctor: Diagnosing Image Diffusion Models Before Treating.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Enhancing LLM Knowledge Learning through Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

ObjectMover: Generative Object Movement with Video Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
AHOR: Online Multi-Object Tracking With Authenticity Hierarchizing and Occlusion Recovery.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

ScopeViT: Scale-Aware Vision Transformer.
Pattern Recognit., 2024

Continuous-Time Digital Twin with Analogue Memristive Neural Ordinary Differential Equation Solver.
CoRR, 2024

Efficient and accurate neural field reconstruction using resistive memory.
CoRR, 2024

Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model.
CoRR, 2024

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding.
CoRR, 2024

Triplet Attention Transformer for Spatiotemporal Predictive Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Zero-shot Image Editing with Reference Imitation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Object-Level Pseudo-3D Lifting for Distance-Aware Tracking.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SAMP: Adapting Segment Anything Model for Pose Estimation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

LogoSticker: Inserting Logos Into Diffusion Models for Customized Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

LivePhoto: Real Image Animation with Text-Guided Motion Control.
Proceedings of the Computer Vision - ECCV 2024, 2024

PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AnyDoor: Zero-shot Object-level Image Customization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Wavelet-Driven Spatiotemporal Predictive Learning: Bridging Frequency and Time Variations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Echo state graph neural networks with analogue random resistive memory arrays.
Nat. Mac. Intell., February, 2023

A Lightweight Clustering Framework for Unsupervised Semantic Segmentation.
CoRR, 2023

Pruning random resistive memory for optimizing analogue AI.
CoRR, 2023

ScribbleSeg: Scribble-based Interactive Image Segmentation.
CoRR, 2023

Uni3DETR: Unified 3D Detection Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-vocabulary Panoptic Segmentation with Embedding Modulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Detecting Everything in the Open World: Towards Universal Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
iNL: Implicit non-local network.
Neurocomputing, 2022

Convolutional Echo-State Network with Random Memristors for Spatiotemporal Signal Classification.
Adv. Intell. Syst., 2022

FocalClick: Towards Practical Interactive Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Echo state graph neural networks with analogue random resistor arrays.
CoRR, 2021

2020
State-Aware Tracker for Real-Time Video Object Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Boundary-Aware Network for Fast and High-Accuracy Portrait Segmentation.
CoRR, 2019


  Loading...