Daoan Zhang

Orcid: 0000-0002-6959-165X

According to our database1, Daoan Zhang authored at least 32 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge.
CoRR, July, 2025

GPT-4V(ision) as A Social Media Analysis Engine.
ACM Trans. Intell. Syst. Technol., June, 2025

On Path to Multimodal Generalist: General-Level and General-Bench.
CoRR, May, 2025

WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation.
CoRR, May, 2025

Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1).
CoRR, April, 2025

Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs.
CoRR, February, 2025

How LLMs React to Industrial Spatio-Temporal Data? Assessing Hallucination with a Novel Traffic Incident Benchmark Dataset.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

DeFine: Decision-Making with Analogical Reasoning over Factor Profiles.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

GaussianStyle: Gaussian Head Avatar via StyleGAN.
Proceedings of the International Conference on 3D Vision, 2025

2024
EIBC: a deep learning framework for Chinese toponym recognition with multiple layers.
J. Geogr. Syst., July, 2024

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment.
CoRR, 2024

DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning.
CoRR, 2024

Learning Brain Tumor Representation in 3D High-Resolution MR Images via Interpretable State Space Models.
CoRR, 2024

NTIRE 2024 Challenge on Image Super-Resolution (⨉4): Methods and Results.
CoRR, 2024

CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs.
CoRR, 2024

A Benchmark and Chain-of-Thought Prompting Strategy for Large Multimodal Models with Multiple Image Inputs.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction.
Proceedings of the Computer Vision - ECCV 2024, 2024



2023
Video Understanding with Large Language Models: A Survey.
CoRR, 2023

Semi-supervised Semantic Segmentation via Boosting Uncertainty on Unlabeled Data.
CoRR, 2023

Cross Contrastive Feature Perturbation for Domain Generalization.
CoRR, 2023

DNAGPT: A Generalized Pretrained Tool for Multiple DNA Sequence Analysis Tasks.
CoRR, 2023

Towards Generalizable Medical Image Segmentation with Pixel-wise Uncertainty Estimation.
CoRR, 2023

Black-box Source-free Domain Adaptation via Two-stage Knowledge Distillation.
CoRR, 2023

Bootstrap The Original Latent: Learning a Private Model from a Black-box Model.
CoRR, 2023

Aggregation of Disentanglement: Reconsidering Domain Variations in Domain Generalization.
CoRR, 2023

Cross Contrasting Feature Perturbation for Domain Generalization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Prototype Knowledge Distillation for Medical Segmentation with Missing Modality.
Proceedings of the IEEE International Conference on Acoustics, 2023

Feature Alignment and Uniformity for Test Time Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Rethinking Alignment and Uniformity in Unsupervised Image Semantic Segmentation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
TransVLAD: Focusing on Locally Aggregated Descriptors for Few-Shot Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022


  Loading...