Yu Zhang

Orcid: 0009-0008-6938-204X

Affiliations:
  • Tongji University, Shanghai, China
  • Tiangong University, School of Computer Science and Technology, Tianjin, China (former)
  • Tianjin Polytechnic University, School of Computer Science and Technology, China (former)


According to our database1, Yu Zhang authored at least 26 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
HMR-1: Hierarchical Massage Robot with Vision-Language-Model for Embodied Healthcare.
CoRR, March, 2026

Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models.
CoRR, February, 2026

Adaptive Visual Autoregressive Acceleration via Dual-Linkage Entropy Analysis.
CoRR, February, 2026

2025
Markovian Scale Prediction: A New Era of Visual Autoregressive Generation.
CoRR, November, 2025

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning.
CoRR, June, 2025

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning.
CoRR, May, 2025

CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation.
CoRR, April, 2025

Efficient Diffusion Models: A Survey.
Trans. Mach. Learn. Res., 2025

LMM-VQA: Advancing Video Quality Assessment With Large Multimodal Models.
IEEE Trans. Circuits Syst. Video Technol., 2025

Collaboration Wins More: Dual-Modal Collaborative Attention Reinforcement for Mitigating Large Vision Language Models Hallucination.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Text-guided Multimodal Fusion for the Multimodal Emotion and Intent Joint Understanding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation.
Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models.
CoRR, 2024

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results.
CoRR, 2024

NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Lite-Mind: Towards Efficient and Robust Brain Representation Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


2023
MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


2021
See clearly on rainy days: Hybrid multiscale loss guided multi-feature fusion network for single image rain removal.
Comput. Vis. Media, 2021

2020
Lag output synchronization for multiple output coupled complex networks with positive semidefinite or positive definite output matrix.
J. Frankl. Inst., 2020

2019
Finite-time passivity of multiple weighted coupled uncertain neural networks with directed and undirected topologies.
Neurocomputing, 2019

Multi-scale Attentive Residual Network for Single Image Deraining.
Proceedings of the Human Centered Computing - 5th International Conference, 2019


  Loading...