Yihao Ding

Orcid: 0000-0001-5065-6911

According to our database¹, Yihao Ding authored at least 42 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

NarrativeSense: Predicting Affective States in University Students through Smartphone Sensing and Contextual Narratives.

[BibT_eX]

[DOI]

ACM Trans. Comput. Heal., April, 2026

MARCH: Multi-Agent Radiology Clinical Hierarchy for CT Report Generation.

[BibT_eX]

[DOI]

CoRR, April, 2026

Deep learning based visually rich document content understanding: a survey.

[BibT_eX]

[DOI]

Artif. Intell. Rev., April, 2026

LPL3D: LVLM-Driven Pseudo-Labeling for 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., March, 2026

GeoChemAD: Benchmarking Unsupervised Geochemical Anomaly Detection for Mineral Exploration.

[BibT_eX]

[DOI]

CoRR, March, 2026

ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning.

[BibT_eX]

[DOI]

CoRR, March, 2026

BRIDGE: Benchmark for multi-hop Reasoning In long multimodal Documents with Grounded Evidence.

[BibT_eX]

[DOI]

Biao Xiang

Soyeon Caren Han

Yihao Ding

CoRR, March, 2026

Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs.

[BibT_eX]

[DOI]

Dhita Putri Pratama

Soyeon Caren Han

Yihao Ding

CoRR, February, 2026

Statistical Verification of Medium-Access Parameterization for Power-Grid Edge Ad Hoc Sensor Networks.

[BibT_eX]

[DOI]

CoRR, February, 2026

Docs2Synth: A Synthetic Data Trained Retriever Framework for Scanned Visually Rich Documents Understanding.

[BibT_eX]

[DOI]

CoRR, January, 2026

Embodied intelligence for 3D understanding: A survey on 3D Scene question answering.

[BibT_eX]

[DOI]

Inf. Fusion, 2026

SynJAC: synthetic-data-driven joint-granular adaptation and calibration for domain specific scanned document key information extraction.

[BibT_eX]

[DOI]

Inf. Fusion, 2026

STIndex: A Context-Aware Multi-Dimensional Spatiotemporal Information Extraction System.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM Web Conference 2026, 2026

Unstructured to Structured: Building Knowledge Graphs from Documents for Web Applications.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM Web Conference 2026, 2026

Docs2Synth: A Synthetic Data Tuned Retriever Framework for Documents Understanding.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM Web Conference 2026, 2026

A Disease-Aware Dual-Stage Framework for Chest X-ray Report Generation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

PROPA: Toward Process-level Optimization in Visual Reasoning via Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, November, 2025

SynDoc: A Hybrid Discriminative-Generative Framework for Enhancing Synthetic Domain-Adaptive Document Key Information Extraction.

[BibT_eX]

[DOI]

CoRR, September, 2025

DocHop-QA: Towards Multi-Hop Reasoning over Multimodal Document Collections.

[BibT_eX]

[DOI]

CoRR, August, 2025

A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends.

[BibT_eX]

[DOI]

CoRR, July, 2025

Pseudo-labeling and knowledge-guided contrastive learning for radiology report generation.

[BibT_eX]

[DOI]

J. Biomed. Informatics, 2025

VRD-IU: Lessons from Visually Rich Document Intelligence and Understanding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

KIEPrompter: Leveraging Lightweight Models' Predictions for Cost-Effective Key Information Extraction using Vision LLMs.

[BibT_eX]

[DOI]

Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Natural Language Processing in Support of Evidence-based Medicine: A Scoping Review.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Graph neural networks for text classification: a survey.

[BibT_eX]

[DOI]

Kunze Wang

Yihao Ding

Soyeon Caren Han

Artif. Intell. Rev., August, 2024

DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights.

[BibT_eX]

[DOI]

CoRR, 2024

Deep Learning based Visually Rich Document Content Understanding: A Survey.

[BibT_eX]

[DOI]

Yihao Ding

Jean Lee

Soyeon Caren Han

CoRR, 2024

PDF-MVQA: A Dataset for Multimodal Information Retrieval in PDF-based Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2024

M3-VRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

MMVQA: A Comprehensive Dataset for Investigating Multipage Multimodal Information Retrieval in PDF-based Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

The Language Model Can Have the Personality: Joint Learning for Personality Enhanced Language Model (Student Abstract).

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

PDFVQA: A New Dataset for Real-World VQA on PDF Documents.

[BibT_eX]

[DOI]

CoRR, 2023

Form-NLU: Dataset for the Form Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Form-NLU: Dataset for the Form Natural Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

PDF-VQA: A New Dataset for Real-World VQA on PDF Documents.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track, 2023

Workshop on Document Intelligence Understanding.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022

V-Doc : Visual questions answers with Documents.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

DDI-MuG: Multi-aspect Graphs for Drug-Drug Interaction Extraction.

[BibT_eX]

[DOI]

Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis, 2022

Yihao Ding

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...