Xiaoxin Chen

Affiliations:

Vivo AI Lab, Shenzhen, China

According to our database¹, Xiaoxin Chen authored at least 36 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, April, 2026

Caption First, VQA Second: Knowledge Density, Not Task Format, Drives Multimodal Scaling.

[BibT_eX]

[DOI]

CoRR, April, 2026

Skill-SD: Skill-Conditioned Self-Distillation for Multi-turn LLM Agents.

[BibT_eX]

[DOI]

CoRR, April, 2026

UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents.

[BibT_eX]

[DOI]

CoRR, February, 2026

Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts.

[BibT_eX]

[DOI]

CoRR, February, 2026

LENS: Learning to Segment Anything with Unified Reinforced Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

VideoChat-M1: Collaborative Policy Planning for Video Understanding via Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, November, 2025

BlueLM-2.5-3B Technical Report.

[BibT_eX]

[DOI]

CoRR, July, 2025

PixelHacker: Image Inpainting with Structural and Semantic Consistency.

[BibT_eX]

[DOI]

CoRR, April, 2025

Progressive Visual Prompt Learning with Contrastive Feature Re-formation.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., February, 2025

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

FCGhead: Fully Controllable Gaussian Human Heads from Monocular Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2025

Predictive Data Selection: The Data That Predicts Is the Data That Teaches.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

ControlAR: Controllable Image Generation with Autoregressive Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GenieBlue: Integrating Both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

GTA: Supervised-Guided Reinforcement Learning for Text Classification with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

EdgeInfinite: A Memory-Efficient Infinite-Context Transformer for Edge Devices.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), 2025

2024

A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Test-Time Prompt Tuning for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model.

[BibT_eX]

[DOI]

CoRR, 2024

FAGhead: Fully Animate Gaussian Head from Monocular Videos.

[BibT_eX]

[DOI]

CoRR, 2024

DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

ImageBind-LLM: Multi-modality Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2023

DPL: Decoupled Prompt Learning for Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Progressive Visual Prompt Learning with Contrastive Feature Re-formation.

[BibT_eX]

[DOI]

CoRR, 2023

Real-Time Image Demoiréing on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2021

EEM: An End-to-end Evaluation Metric for Scene Text Detection and Recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Weakly-Supervised Instance Segmentation via Class-Agnostic Learning With Salient Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2019

Hierarchical Reinforcement Learning for Multi-agent MOBA Game.

[BibT_eX]

[DOI]

CoRR, 2019

Xiaoxin Chen

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...