Xin Wen

Orcid: 0000-0003-3898-0406

Affiliations:

University of Hong Kong, CVMI Lab, Hong Kong
Tongji University, Shanghai, China (former)

According to our database¹, Xin Wen authored at least 26 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Vision Foundation Models as Generalist Tokenizers for Image Generation.

[BibT_eX]

[DOI]

CoRR, May, 2026

ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation.

[BibT_eX]

[DOI]

CoRR, April, 2026

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation.

[BibT_eX]

[DOI]

CoRR, April, 2026

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

"Principal Components" Enable a New Language of Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Learning from Neighbors: Category Extrapolation for Long-Tail Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Granularity Matters in Long-Tail Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights.

[BibT_eX]

[DOI]

CoRR, 2024

What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Can OOD Object Detectors Learn from Foundation Models?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

What If the TV was off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Large-Scale 3D Representation Learning with Multi-Dataset Point Prompt Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Classes Are Not Equal: An Empirical Study on Image Recognition Fairness.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery.

[BibT_eX]

[DOI]

Bingchen Zhao

Xin Wen

Kai Han

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Parametric Classification for Generalized Category Discovery: A Baseline Study.

[BibT_eX]

[DOI]

Xin Wen

Bingchen Zhao

Xiaojuan Qi

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

A Simple Parametric Classification Baseline for Generalized Category Discovery.

[BibT_eX]

[DOI]

Xin Wen

Bingchen Zhao

Xiaojuan Qi

CoRR, 2022

Self-Supervised Visual Representation Learning with Semantic Grouping.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Temporal Context Aggregation for Video Retrieval with Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

2020

Context Encoding for Video Retrieval with Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Distilling Visual Priors from Self-Supervised Learning.

[BibT_eX]

[DOI]

Bingchen Zhao

Xin Wen

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Xin Wen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...