Hao Zhang

Orcid: 0000-0002-3572-7053

Affiliations:
  • Xi'an Jiaotong University, Institute of Artificial Intelligence and Robotics, China
  • Shanghai AI Laboratory, Xuhui, China


According to our database1, Hao Zhang authored at least 14 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
B-AVIBench: Toward Evaluating the Robustness of Large Vision-Language Model on Black-Box Adversarial Visual-Instructions.
IEEE Trans. Inf. Forensics Secur., 2025

OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Open-Vocabulary Animal Keypoint Detection with Semantic-Feature Matching.
Int. J. Comput. Vis., December, 2024

HF-HRNet: A Simple Hardware Friendly High-Resolution Network.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

FMGNet: An efficient feature-multiplex group network for real-time vision task.
Pattern Recognit., 2024

GATE OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation.
CoRR, 2024

HRVMamba: High-Resolution Visual State Space Model for Dense Prediction.
CoRR, 2024

ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models.
CoRR, 2024

AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions.
CoRR, 2024

ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
SCGNet: Shifting and Cascaded Group Network.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face.
CoRR, 2023


  Loading...