We stand with Ukraine

We stand with Ukraine

Fan Zhang

Orcid: 0000-0001-5250-7258

Affiliations:

Georgia Institute of Technology, Department of Electrical and Computer Engineering, Shenzhen, China

According to our database¹, Fan Zhang authored at least 31 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org
on ieeexplore.ieee.org

On csauthors.net:

Bibliography

2026

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

PnP-Corrector: A Universal Correction Framework for Coupled Spatiotemporal Forecasting.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

AffectGPT-RL: Revealing Roles of Reinforcement Learning in Open-Vocabulary Emotion Recognition.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

RoboAlign-R1: Distilled Multimodal Reward Alignment for Robot Video World Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery.

[DOI]

,

,

,

,

Teresa Head-Gordon

,

CoRR, April, 2026

OMNIFLOW: A Physics-Grounded Multimodal Agent for Generalized Scientific Reasoning.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

3D landmark detection on human point clouds: A benchmark and a dual cascade point transformer framework.

[DOI]

,

,

,

Expert Syst. Appl., 2026

CMID: Towards Medical Visual Question Answering via Contrastive Mutual Information Decoding.

[DOI]

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

S³-MSD: Large Vision-Language Model for Explainable and Generalizable Multi-modal Sarcasm Detection.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

NeuralOGCM: Differentiable Ocean Modeling with Learnable Physics.

[DOI]

,

,

,

,

,

,

CoRR, December, 2025

Rethinking Facial Expression Recognition in the Era of Multimodal Large Language Models: Benchmark, Datasets, and Beyond.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

Spatiotemporal Forecasting as Planning: A Model-Based Reinforcement Learning Approach with Generative World Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Differential-Integral Neural Operator for Long-Term Turbulence Forecasting.

[DOI]

,

,

,

,

,

,

,

CoRR, September, 2025

MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

EMER-Ranker: Learning to Rank Emotion Descriptions in the Absence of Ground Truth.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

FourierFlow: Frequency-aware Flow Matching for Generative Turbulence Modeling.

[DOI]

,

,

,

,

CoRR, June, 2025

Advanced long-term earth system forecasting by learning the small-scale nature.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

CellVerse: Do Large Language Models Really Understand Cell Biology?

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Facial Action Units as a Joint Dataset Training Bridge for Facial Expression Recognition.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2025

LEAF: Unveiling two sides of the same coin in semi-supervised facial expression recognition.

[DOI]

,

,

,

,

Comput. Vis. Image Underst., 2025

A Survey on Multi-modal Intent Recognition: Recent Advances and New Frontiers.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

A Survey on Foundation Language Models for Single-cell Biology.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

DREAM: Decoupled Discriminative Learning with Bigraph-aware Alignment for Semi-supervised 2D-3D Cross-modal Retrieval.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

HOPE: A Hierarchical Perspective for Semi-Supervised 2D-3D Cross-Modal Retrieval.

[DOI]

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

FATE: Learning Effective Binary Descriptors With Group Fairness.

[DOI]

,

,

,

IEEE Trans. Image Process., 2024

MIMIC: Mask Image Pre-training with Mix Contrastive Fine-tuning for Facial Expression Recognition.

[DOI]

,

,

,

CoRR, 2024

Semi-supervised Knowledge Transfer Across Multi-omic Single-cell Data.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DEMO: A Statistical Perspective for Efficient Image-Text Matching.

[DOI]

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval.

[DOI]

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Semi-Supervised Multimodal Emotion Recognition with Expression MAE.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Loading...