Yanbo Wang

Affiliations:
  • Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, United Arab Emirates


According to our database1, Yanbo Wang authored at least 17 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Guardian-as-an-Advisor: Advancing Next-Generation Guardian Models for Trustworthy LLMs.
CoRR, April, 2026

2025
DyFlow: Dynamic Workflow Framework for Agentic Reasoning.
CoRR, September, 2025

ChemOrch: Empowering LLMs with Chemical Intelligence via Synthetic Instructions.
CoRR, September, 2025

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models.
CoRR, May, 2025

Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs.
CoRR, May, 2025

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models.
CoRR, May, 2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective.
CoRR, February, 2025

Breaking Focus: Contextual Distraction Curse in Large Language Models.
CoRR, February, 2025

DyFlow: Dynamic Workflow Framework for Agentic Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Adaptive Distraction: Probing LLM Contextual Robustness with Automated Tree Search.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

TRUSTEVAL: A Dynamic Evaluation Toolkit on Trustworthiness of Generative Foundation Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Under the Shadow of Babel: How Language Shapes Reasoning in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Cross-Lingual Pitfalls: Automatic Probing Cross-Lingual Weakness of Multilingual Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?
CoRR, 2024


  Loading...