Weihao Xuan

Orcid: 0009-0001-4271-9035

According to our database1, Weihao Xuan authored at least 53 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection.
CoRR, May, 2026

Can LLM Agents Respond to Disasters? Benchmarking Heterogeneous Geospatial Reasoning in Emergency Operations.
CoRR, May, 2026

A Versatile AI Agent for Rare Disease Diagnosis and Risk Gene Prioritization.
CoRR, May, 2026

Proteo-R1: Reasoning Foundation Models for De Novo Protein Design.
CoRR, May, 2026

The Chameleon's Limit: Investigating Persona Collapse and Homogenization in Large Language Models.
CoRR, April, 2026

Code-Switching Information Retrieval: Benchmarks, Analysis, and the Limits of Current Retrievers.
CoRR, April, 2026

Say Something Else: Rethinking Contextual Privacy as Information Sufficiency.
CoRR, April, 2026

OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation.
CoRR, March, 2026

Direction-aware 3D Large Multimodal Models.
CoRR, February, 2026

Experience-Driven Multi-Agent Systems Are Training-free Context-aware Earth Observers.
CoRR, February, 2026

Sentipolis: Emotion-Aware Agents for Social Simulations.
CoRR, January, 2026

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents.
CoRR, January, 2026

Towards Valid Student Simulation with Large Language Models.
CoRR, January, 2026

Toward Global Large Language Models in Medicine.
CoRR, January, 2026

Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

Taming Object Hallucinations with Verified Atomic Confidence Estimation.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

LandCraft: Designing the Structured 3D Landscapes via Text Guidance.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
TeamPath: Building MultiModal Pathology Experts with Reasoning AI Copilots.
CoRR, November, 2025

Retrieval-Augmented Generation in Medicine: A Scoping Review of Technical Implementations, Clinical Applications, and Ethical Considerations.
CoRR, November, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
Dataset, November, 2025

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search.
CoRR, September, 2025

Multiplayer Nash Preference Optimization.
CoRR, September, 2025

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards.
CoRR, September, 2025

VeriGUI: Verifiable Long-Chain GUI Dataset.
CoRR, August, 2025

The Invisible Leash: Why RLVR May Not Escape Its Origin.
CoRR, July, 2025

DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding.
CoRR, May, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
Dataset, May, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
Dataset, May, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
Dataset, May, 2025

Do Reasoning Models Show Better Verbalized Calibration?
CoRR, April, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
Dataset, April, 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation.
CoRR, March, 2025

Is Pre-training Applicable to the Decoder for Dense Prediction?
CoRR, March, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
Dataset, March, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
CoRR, January, 2025

BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response.
Dataset, January, 2025

DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

LR<sup>2</sup>Depth: Large-Region Aggregation at Low Resolution for Efficient Monocular Depth Estimation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Thinking Out Loud: Do Reasoning Models Know When They're Right?
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
SynRS3D : A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery.
Dataset, October, 2024

Foundation Models for Remote Sensing and Earth Observation: A Survey.
CoRR, 2024

Segment Anything with Multiple Modalities.
CoRR, 2024

Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model.
CoRR, 2024

SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
MaskVO: Self-Supervised Visual Odometry with a Learnable Dynamic Mask.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2022

2021
Improving Geographically Weighted Regression Considering Directional Nonstationary for Ground-Level PM2.5 Estimation.
ISPRS Int. J. Geo Inf., 2021

On a Discrete-Time Network SIS Model with Opinion Dynamics.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2019
Multi-agent Interactive Prediction under Challenging Driving Scenarios.
CoRR, 2019


  Loading...