Xi Chen

Orcid: 0009-0008-5008-4720

Affiliations:
  • University of Hong Kong, Department of Computer Science, Hong Kong


According to our database1, Xi Chen authored at least 49 papers between 2019 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning.
CoRR, October, 2025

Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents.
CoRR, October, 2025

From Noisy Traces to Stable Gradients: Bias-Variance Optimized Preference Optimization for Aligning Large Reasoning Models.
CoRR, October, 2025

DiffCamera: Arbitrary Refocusing on Images.
CoRR, September, 2025

AnyDoor: Zero-Shot Image Customization With Region-to-Region Reference.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

ROSE: Remove Objects with Side Effects in Videos.
CoRR, August, 2025

Animate-X++: Universal Character Image Animation with Dynamic Backgrounds.
CoRR, August, 2025

UniDetector: Towards Universal Object Detection With Heterogeneous Supervision.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

MiCo: Multi-image Contrast for Reinforcement Visual Reasoning.
CoRR, June, 2025

FocalClick-XL: Towards Unified and High-quality Interactive Segmentation.
CoRR, June, 2025

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization.
CoRR, June, 2025

PlayerOne: Egocentric World Simulator.
CoRR, June, 2025

LayerFlow: A Unified Model for Layer-aware Video Generation.
CoRR, June, 2025

Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation.
CoRR, March, 2025

Effective LLM Knowledge Learning via Model Generalization.
CoRR, March, 2025

DiffDoctor: Diagnosing Image Diffusion Models Before Treating.
CoRR, January, 2025

DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data.
CoRR, January, 2025

VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control.
CoRR, January, 2025

ObjectMover: Generative Object Movement with Video Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
AHOR: Online Multi-Object Tracking With Authenticity Hierarchizing and Occlusion Recovery.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

ScopeViT: Scale-Aware Vision Transformer.
Pattern Recognit., 2024

FashionComposer: Compositional Fashion Image Generation.
CoRR, 2024

Continuous-Time Digital Twin with Analogue Memristive Neural Ordinary Differential Equation Solver.
CoRR, 2024

Efficient and accurate neural field reconstruction using resistive memory.
CoRR, 2024

Resistive Memory-based Neural Differential Equation Solver for Score-based Diffusion Model.
CoRR, 2024

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding.
CoRR, 2024

Triplet Attention Transformer for Spatiotemporal Predictive Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Zero-shot Image Editing with Reference Imitation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Object-Level Pseudo-3D Lifting for Distance-Aware Tracking.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SAMP: Adapting Segment Anything Model for Pose Estimation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

LogoSticker: Inserting Logos Into Diffusion Models for Customized Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

LivePhoto: Real Image Animation with Text-Guided Motion Control.
Proceedings of the Computer Vision - ECCV 2024, 2024

PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AnyDoor: Zero-shot Object-level Image Customization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Wavelet-Driven Spatiotemporal Predictive Learning: Bridging Frequency and Time Variations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Echo state graph neural networks with analogue random resistive memory arrays.
Nat. Mac. Intell., February, 2023

A Lightweight Clustering Framework for Unsupervised Semantic Segmentation.
CoRR, 2023

Pruning random resistive memory for optimizing analogue AI.
CoRR, 2023

ScribbleSeg: Scribble-based Interactive Image Segmentation.
CoRR, 2023

Uni3DETR: Unified 3D Detection Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-vocabulary Panoptic Segmentation with Embedding Modulation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Detecting Everything in the Open World: Towards Universal Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
iNL: Implicit non-local network.
Neurocomputing, 2022

FocalClick: Towards Practical Interactive Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Echo state graph neural networks with analogue random resistor arrays.
CoRR, 2021

2020
State-Aware Tracker for Real-Time Video Object Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Boundary-Aware Network for Fast and High-Accuracy Portrait Segmentation.
CoRR, 2019


  Loading...