Yan Zhang

Orcid: 0000-0003-1642-0758

Affiliations:
  • Xiamen University, Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Xiamen, China


According to our database1, Yan Zhang authored at least 54 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
GraphMSR: A graph foundation model-based approach for MRI image super-resolution with multimodal semantic integration.
Pattern Recognit., 2026

2025
Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs.
CoRR, May, 2025

S<sup>2</sup>Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection.
CoRR, April, 2025

Baichuan-M1: Pushing the Medical Capability of Large Language Models.
CoRR, February, 2025

Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy.
CoRR, February, 2025

Baichuan-Omni-1.5 Technical Report.
CoRR, January, 2025

NAPG: Neighborhood-Assisted Multiprototype Group Model for Cross-Domain Semantic Segmentation of Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2025

Attention-driven acoustic properties learning for underwater target ranging.
Pattern Recognit., 2025

HGTL: A hypergraph transfer learning framework for survival prediction of ccRCC.
Medical Image Anal., 2025

DPCA: Dynamic multi-prototype cross-attention for change detection unsupervised domain adaptation of remote sensing images.
Knowl. Based Syst., 2025

Multi-scale and contrastive learning for pediatric chest radiograph classification tasks.
Displays, 2025

A mixed-scale dynamic attention transformer for pediatric pneumonia diagnosis.
Displays, 2025

RPF-Net: A multimodal model for the postoperative UISS risk stratification of non-metastatic ccRCC based on CT and whole-slide images.
Comput. Methods Programs Biomed., 2025

SysBench: Can LLMs Follow System Message?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Feature Denoising Diffusion Model for Blind Image Quality Assessment.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

BUFF: Bayesian Uncertainty Guided Diffusion Probabilistic Model for Single Image Super-Resolution.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Multicenter evaluation of CT deep radiomics model in predicting Leibovich score risk groups for non-metastatic clear cell renal cell carcinoma.
Displays, 2024

Breaking the Bias: Recalibrating the Attention of Industrial Anomaly Detection.
CoRR, 2024

Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment.
CoRR, 2024

Boosting CLIP Adaptation for Image Quality Assessment via Meta-Prompt Learning and Gradient Regularization.
CoRR, 2024

SysBench: Can Large Language Models Follow System Messages?
CoRR, 2024

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark.
CoRR, 2024

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.
CoRR, 2024

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System.
CoRR, 2024

Local Manifold Learning for No-Reference Image Quality Assessment.
CoRR, 2024

Multi-Modal Prompt Learning on Blind Image Quality Assessment.
CoRR, 2024

Feature Denoising Diffusion Model for Blind Image Quality Assessment.
CoRR, 2024

CPE COIN++: Towards Optimized Implicit Neural Representation Compression Via Chebyshev Positional Encoding.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-Identification.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Adaptive Selection based Referring Image Segmentation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

GreedyAgent: Crafting Efficient Agents for Meta-learning from Learning Curves via Greedy Algorithm Selection.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Semi-Supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
MMICT: Boosting Multi-Modal Fine-Tuning with In-Context Examples.
CoRR, 2023

Adaptive Feature Selection for No-Reference Image Quality Assessment using Contrastive Mitigating Semantic Noise Sensitivity.
CoRR, 2023

Less is More: Learning Reference Knowledge Using No-Reference Image Quality Assessment.
CoRR, 2023

Prompt Based Lifelong Person Re-identification.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Quality-Aware CLIP for Blind Image Quality Assessment.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Data-Free Low-Bit Quantization via Dynamic Multi-teacher Knowledge Distillation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Cross-Dataset Distillation with Multi-tokens for Image Quality Assessment.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Classifier Decoupled Training for Black-Box Unsupervised Domain Adaptation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Learning Occlusion Disentanglement with Fine-grained Localization for Occluded Person Re-identification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Event-Diffusion: Event-Based Image Reconstruction and Restoration with Diffusion Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

A Novel Neighbor Aggregation Function for Medical Point Cloud Analysis.
Proceedings of the Advances in Computer Graphics, 2023

Data-Efficient Image Quality Assessment with Attention-Panel Decoder.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2016
3D object retrieval with multi-feature collaboration and bipartite graph matching.
Neurocomputing, 2016

Search-Based Depth Estimation via Coupled Dictionary Learning with Large-Margin Structure Inference.
Proceedings of the Computer Vision - ECCV 2016, 2016

3D Object Retrieval with Multimodal Views.
Proceedings of the 9th Eurographics Workshop on 3D Object Retrieval, 2016

2015


  Loading...