Diandian Guo

Orcid: 0009-0002-8468-3285

According to our database1, Diandian Guo authored at least 22 papers between 2023 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CRAFT: Calibrated Reasoning with Answer-Faithful Traces via Reinforcement Learning for Multi-Hop Question Answering.
CoRR, February, 2026

MuVaC: AVariational Causal Framework for Multimodal Sarcasm Understanding in Dialogues.
CoRR, January, 2026

PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering.
CoRR, January, 2026

MuVaC: A Variational Causal Framework for Multimodal Sarcasm Understanding in Dialogues.
Proceedings of the ACM Web Conference 2026, 2026

2025
Hot-Swap MarkBoard: An Efficient Black-box Watermarking Approach for Large-scale Model Distribution.
CoRR, July, 2025

Benchmarking Laparoscopic Surgical Image Restoration and Beyond.
CoRR, May, 2025

Synergistic Bleeding Region and Point Detection in Surgical Videos.
CoRR, March, 2025

S²Former-OR: Single-Stage Bi-Modal Transformer for Scene Graph Generation in OR.
IEEE Trans. Medical Imaging, January, 2025

Efficient frequency-decomposed transformer via large vision model guidance for surgical image desmoking.
Comput. Medical Imaging Graph., 2025

See Better, Say Better: Vision-Augmented Decoding for Mitigating Hallucinations in Large Vision-Language Models.
Proceedings of the Natural Language Processing and Chinese Computing, 2025

Hot-Swap MarkBoard: An Efficient Black-box Watermarking Approach for Large-scale Model Distribution.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

CASD: Counterfactual Augmentation for Social Bot Detection on Twitter.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Multi-View Incongruity Learning for Multimodal Sarcasm Detection.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resection with Pringle Maneuver.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Can Multimodal Large Language Model Think Analogically?
CoRR, 2024

Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resections with Pringle Maneuver.
CoRR, 2024

S^2Former-OR: Single-Stage Bimodal Transformer for Scene Graph Generation in OR.
CoRR, 2024

Tri-Modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Towards Pragmatic Semantic Image Synthesis for Urban Scenes.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

A Semi-Paired Approach for Label-to-Image Translation.
Proceedings of the IEEE International Conference on Image Processing, 2023

Curvature-Driven Knowledge Graph Embedding for Link Prediction.
Proceedings of the 26th International Conference on Computer Supported Cooperative Work in Design, 2023


  Loading...