Huichi Zhou

Orcid: 0009-0005-5312-6699

According to our database1, Huichi Zhou authored at least 40 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction.
CoRR, April, 2026

ProMMSearchAgent: A Generalizable Multimodal Search Agent Trained with Process-Oriented Rewards.
CoRR, April, 2026

DR-MMSearchAgent: Deepening Reasoning in Multimodal Search Agents.
CoRR, April, 2026

How Adversarial Environments Mislead Agentic AI?
CoRR, April, 2026

InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking.
CoRR, April, 2026

Memento-Skills: Let Agents Design Agents.
CoRR, March, 2026

Unleashing Video Language Models for Fine-grained HRCT Report Generation.
CoRR, March, 2026

UIS-Digger: Towards Comprehensive Research Agent Systems for Real-world Unindexed Information Seeking.
CoRR, March, 2026

Reliable and Responsible Foundation Models: A Comprehensive Survey.
CoRR, February, 2026

TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking.
CoRR, February, 2026

Reason like a radiologist: Chain-of-thought and reinforcement learning for verifiable report generation.
Medical Image Anal., 2026

Musical Score Understanding Benchmark: Evaluating Large Language Models' Comprehension of Complete Musical Scores.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

GEMA-Score: Granular Explainable Multi-Agent Scoring Framework for Radiology Report Evaluation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Training LLMs with LogicReward for Faithful and Rigorous Reasoning.
CoRR, December, 2025

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist.
CoRR, November, 2025

Beyond the Hype: A Dispassionate Look at Vision-Language Models in Medical Scenario.
IEEE Trans. Neural Networks Learn. Syst., October, 2025

SemanticShield: LLM-Powered Audits Expose Shilling Attacks in Recommender Systems.
CoRR, September, 2025

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs.
CoRR, August, 2025

VeriGUI: Verifiable Long-Chain GUI Dataset.
CoRR, August, 2025

REAL-IoT: Characterizing GNN Intrusion Detection Robustness under Practical Adversarial Attack.
CoRR, July, 2025

Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis.
CoRR, June, 2025

Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning.
CoRR, May, 2025

EfficientLLM: Efficiency in Large Language Models.
CoRR, May, 2025

Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs.
CoRR, April, 2025

Evaluate-and-Purify: Fortifying Code Language Models Against Adversarial Attacks Using LLM-as-a-Judge.
CoRR, April, 2025

GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation.
CoRR, March, 2025

DOTA-ME-CS: Daily Oriented Text Audio-Mandarin English-Code Switching Dataset.
CoRR, January, 2025

TrustRAG: Enhancing Robustness and Trustworthiness in RAG.
CoRR, January, 2025

Reliable and Responsible Foundation Models.
Trans. Mach. Learn. Res., 2025

Revisiting medical image retrieval via knowledge consolidation.
Medical Image Anal., 2025

Can Large Language Models Handle Numeric Constraints? A Comprehensive Study and Solutions.
Proceedings of the PRICAI 2025: Trends in Artificial Intelligence, 2025

Verifiable Format Control for Large Language Model Generations.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Can Large Language Models Improve the Adversarial Robustness of Graph Neural Networks?
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DiffuseDef: Improved Robustness to Adversarial Attacks via Iterative Denoising.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Beyond the Hype: A dispassionate look at vision-language models in medical scenario.
CoRR, 2024

GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents.
CoRR, 2024

MPAT: Building Robust Deep Neural Networks against Textual Adversarial Attacks.
CoRR, 2024

MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Evaluating the Validity of Word-level Adversarial Attacks with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024


  Loading...