Xiaobo Xia

Orcid: 0000-0003-3615-0919

According to our database1, Xiaobo Xia authored at least 95 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
QuantClaw: Precision Where It Matters for OpenClaw.
CoRR, April, 2026

Omnimodal Dataset Distillation via High-order Proxy Alignment.
CoRR, April, 2026

Walk the Talk: Bridging the Reasoning-Action Gap for Thinking with Images via Multimodal Agentic Policy Optimization.
CoRR, April, 2026

BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization.
CoRR, March, 2026

FreeAct: Freeing Activations for LLM Quantization.
CoRR, March, 2026

AUHead: Realistic Emotional Talking Head Generation via Action Units Control.
CoRR, February, 2026

SPD-Faith Bench: Diagnosing and Improving Faithfulness in Chain-of-Thought for Multimodal Large Language Models.
CoRR, February, 2026

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models.
CoRR, February, 2026

APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation.
CoRR, February, 2026

Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning.
CoRR, February, 2026

Lingua-SafetyBench: A Benchmark for Safety Evaluation of Multilingual Vision-Language Models.
CoRR, January, 2026

Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models.
CoRR, January, 2026

Generalizable Multimodal Large Language Model Editing via Invariant Trajectory Learning.
CoRR, January, 2026

What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study.
CoRR, January, 2026

Logic Unseen: Revealing the Logical Blindspots of Vision-Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Potent but Stealthy: Rethink Profile Pollution Against Sequential Recommendation via Bi-Level Constrained Reinforcement Paradigm.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Taming Camera-Controlled Video Generation with Verifiable Geometry Reward.
CoRR, December, 2025

ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models.
CoRR, November, 2025

AnchorFlow: Training-Free 3D Editing via Latent Anchor-Aligned Flows.
CoRR, November, 2025

Calibrated Multimodal Representation Learning with Missing Modalities.
CoRR, November, 2025

Potent but Stealthy: Rethink Profile Pollution against Sequential Recommendation via Bi-level Constrained Reinforcement Paradigm.
CoRR, November, 2025

UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation.
CoRR, October, 2025

OFFSIDE: Benchmarking Unlearning Misinformation in Multimodal Large Language Models.
CoRR, October, 2025

NExT-OMNI: Towards Any-to-Any Omnimodal Foundation Models with Discrete Flow Matching.
CoRR, October, 2025

Principled Multimodal Representation Learning.
CoRR, July, 2025

LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment.
CoRR, June, 2025

Semi-Supervised Conformal Prediction With Unlabeled Nonconformity Score.
CoRR, May, 2025

L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models.
CoRR, May, 2025

VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning.
CoRR, April, 2025

GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents.
CoRR, April, 2025

Continual Multimodal Contrastive Learning.
CoRR, March, 2025

Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction.
CoRR, March, 2025

A three-tier AI solution for equitable glaucoma diagnosis across China's hierarchical healthcare system.
npj Digit. Medicine, 2025

Towards Modality Generalization: A Benchmark and Prospective Analysis.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

DEEM: Diffusion models serve as the eyes of large language models for image perception.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Where, What, Why: Towards Explainable Driver Attention Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LaVin-DiT: Large Vision Diffusion Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Transferring Annotator- and Instance-Dependent Transition Matrix for Learning From Crowds.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2024

Tackling Noisy Labels With Network Parameter Additive Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Regularly Truncated M-Estimators for Learning With Noisy Labels.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Conditional Consistency Regularization for Semi-Supervised Multi-Label Image Classification.
IEEE Trans. Multim., 2024

Towards Modality Generalization: A Benchmark and Prospective Analysis.
CoRR, 2024

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct.
CoRR, 2024

Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection.
CoRR, 2024

Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs.
CoRR, 2024

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception.
CoRR, 2024

Mitigating Label Noise on Graph via Topological Sample Selection.
CoRR, 2024

Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision.
CoRR, 2024

Few-Shot Adversarial Prompt Learning on Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance Constraints.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Mitigating Label Noise on Graphs via Topological Sample Selection.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Towards Realistic Model Selection for Semi-supervised Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

One-Shot Learning as Instruction Data Prospector for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Dynamics-aware loss for learning with label noise.
Pattern Recognit., December, 2023

Extended $T$T: Learning With Mixed Closed-Set and Open-Set Noisy Labels.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

One Shot Learning as Instruction Data Prospector for Large Language Models.
CoRR, 2023

Coreset Selection with Prioritized Multiple Objectives.
CoRR, 2023

VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence.
CoRR, 2023

Multi-Label Noise Transition Matrix Estimation with Label Correlations: Theory and Algorithm.
CoRR, 2023

Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision.
CoRR, 2023

Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Moderate Coreset: A Universal Method of Data Selection for Real-world Data-efficient Deep Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Holistic View of Label Noise Transition Matrix in Deep Learning and Beyond.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Harnessing Out-Of-Distribution Examples via Augmenting Content and Style.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Holistic Label Correction for Noisy Multi-Label Classification.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

HumanMAC: Masked Motion Completion for Human Motion Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robust Generalization Against Photon-Limited Corruptions via Worst-Case Sharpness Minimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
A machine learning approach for predicting human shortest path task performance.
Vis. Informatics, 2022

LR-SVM+: Learning Using Privileged Information with Noisy Labels.
IEEE Trans. Multim., 2022

Pluralistic Image Completion with Probabilistic Mixture-of-Experts.
CoRR, 2022

Pluralistic Image Completion with Gaussian Mixture Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Estimating Noise Transition Matrix with Label Correlations for Noisy Multi-Label Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Out-of-Distribution Detection with An Adaptive Likelihood Ratio on Informative Hierarchical VAE.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sample-Efficient Kernel Mean Estimator with Marginalized Corrupted Data.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Objects in Semantic Topology.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Sample Selection with Uncertainty of Losses for Learning with Noisy Labels.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Selective-Supervised Contrastive Learning with Noisy Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Learning lightweight super-resolution networks with weight pruning.
Neural Networks, 2021

Kernel Mean Estimation by Marginalized Corrupted Distributions.
CoRR, 2021

Instance Correction for Learning with Open-set Noisy Labels.
CoRR, 2021

BloodCaps: A capsule network based model for the multiclassification of human peripheral blood cells.
Comput. Methods Programs Biomed., 2021

Class2Simi: A Noise Reduction Perspective on Learning with Noisy Labels.
Proceedings of the 38th International Conference on Machine Learning, 2021

Robust early-learning: Hindering the memorization of noisy labels.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Extended T: Learning with Mixed Closed-set and Open-set Noisy Labels.
CoRR, 2020

Parts-dependent Label Noise: Towards Instance-dependent Label Noise.
CoRR, 2020

Class2Simi: A New Perspective on Learning with Label Noise.
CoRR, 2020

Multi-Class Classification from Noisy-Similarity-Labeled Data.
CoRR, 2020

Part-dependent Label Noise: Towards Instance-dependent Label Noise.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Are Anchor Points Really Indispensable in Label-Noise Learning?
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019


  Loading...