Yihua Shao

Orcid: 0009-0002-0475-7142

According to our database¹, Yihua Shao authored at least 34 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis.

[BibT_eX]

[DOI]

CoRR, March, 2026

Geometry OR Tracker: Universal Geometric Operating Room Tracking.

[BibT_eX]

[DOI]

CoRR, March, 2026

Do MLLMs Really Understand Space? A Mathematical Reasoning Evaluation.

[BibT_eX]

[DOI]

CoRR, February, 2026

Nüwa: Mending the Spatial Integrity Torn by VLM Token Pruning.

[BibT_eX]

[DOI]

CoRR, February, 2026

StyMam: A Mamba-Based Generator for Artistic Style Transfer.

[BibT_eX]

[DOI]

CoRR, January, 2026

Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation.

[BibT_eX]

[DOI]

CoRR, January, 2026

Enhancing point cloud feature representation via historical node state increments in graph neural networks.

[BibT_eX]

[DOI]

Pattern Recognit., 2026

3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

TR-DQ: Time-Rotation Diffusion Quantization.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RAGAR: Retrieval Augmented Personalized Image Generation Guided by Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

1 + 1 > 2: Detector-Empowered Video Large Language Model for Spatio-Temporal Grounding and Reasoning.

[BibT_eX]

[DOI]

CoRR, December, 2025

CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion.

[BibT_eX]

[DOI]

CoRR, October, 2025

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook.

[BibT_eX]

[DOI]

CoRR, September, 2025

PDFT: parameter-diminish fine-tuning for transformer-based models.

[BibT_eX]

[DOI]

Vis. Comput., July, 2025

EventVAD: Training-Free Event-Aware Video Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, April, 2025

WonderVerse: Extendable 3D Scene Generation with Video Generative Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts.

[BibT_eX]

[DOI]

CoRR, March, 2025

TR-DQ: Time-Rotation Diffusion Quantization.

[BibT_eX]

[DOI]

CoRR, March, 2025

EventVAD: Training-Free Event-Aware Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

AccidentBlip: Agent of Accident Warning Based on MA-Former.

[BibT_eX]

[DOI]

Proceedings of the IEEE Intelligent Vehicles Symposium, 2025

In-Context Meta LoRA Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Renderworld: World Model with Self-Supervised 3D Label.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

AdsQA: Towards Advertisement Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MambaIC: State Space Models for High-Performance Learned Image Compression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

TDADL-IE: A Deep Learning-Driven Cryptographic Architecture for Medical Image Security.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

HAFT: Hierarchical Attentional Fusion Transformer for Adaptive Feature Fusion in Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

2024

3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, 2024

GWQ: Gradient-Aware Weight Quantization for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

AccidentBlip2: Accident Detection With Multi-View MotionBlip2.

[BibT_eX]

[DOI]

CoRR, 2024

Yihua Shao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...