Fan Zhang

Orcid: 0000-0001-5250-7258

Affiliations:
  • Georgia Institute of Technology, Department of Electrical and Computer Engineering, Shenzhen, China


According to our database1, Fan Zhang authored at least 28 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
RoboAlign-R1: Distilled Multimodal Reward Alignment for Robot Video World Models.
CoRR, May, 2026

DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery.
CoRR, April, 2026

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning.
CoRR, March, 2026

3D landmark detection on human point clouds: A benchmark and a dual cascade point transformer framework.
Expert Syst. Appl., 2026

CMID: Towards Medical Visual Question Answering via Contrastive Mutual Information Decoding.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

S³-MSD: Large Vision-Language Model for Explainable and Generalizable Multi-modal Sarcasm Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
NeuralOGCM: Differentiable Ocean Modeling with Learnable Physics.
CoRR, December, 2025

Rethinking Facial Expression Recognition in the Era of Multimodal Large Language Models: Benchmark, Datasets, and Beyond.
CoRR, November, 2025

Spatiotemporal Forecasting as Planning: A Model-Based Reinforcement Learning Approach with Generative World Models.
CoRR, October, 2025

Differential-Integral Neural Operator for Long-Term Turbulence Forecasting.
CoRR, September, 2025

MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models.
CoRR, August, 2025

EMER-Ranker: Learning to Rank Emotion Descriptions in the Absence of Ground Truth.
CoRR, July, 2025

FourierFlow: Frequency-aware Flow Matching for Generative Turbulence Modeling.
CoRR, June, 2025

Advanced long-term earth system forecasting by learning the small-scale nature.
CoRR, May, 2025

CellVerse: Do Large Language Models Really Understand Cell Biology?
CoRR, May, 2025

Facial Action Units as a Joint Dataset Training Bridge for Facial Expression Recognition.
IEEE Trans. Multim., 2025

LEAF: Unveiling two sides of the same coin in semi-supervised facial expression recognition.
Comput. Vis. Image Underst., 2025

A Survey on Multi-modal Intent Recognition: Recent Advances and New Frontiers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

A Survey on Foundation Language Models for Single-cell Biology.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DREAM: Decoupled Discriminative Learning with Bigraph-aware Alignment for Semi-supervised 2D-3D Cross-modal Retrieval.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
HOPE: A Hierarchical Perspective for Semi-Supervised 2D-3D Cross-Modal Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

FATE: Learning Effective Binary Descriptors With Group Fairness.
IEEE Trans. Image Process., 2024

MIMIC: Mask Image Pre-training with Mix Contrastive Fine-tuning for Facial Expression Recognition.
CoRR, 2024

Semi-supervised Knowledge Transfer Across Multi-omic Single-cell Data.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DEMO: A Statistical Perspective for Efficient Image-Text Matching.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Semi-Supervised Multimodal Emotion Recognition with Expression MAE.
Proceedings of the 31st ACM International Conference on Multimedia, 2023


  Loading...