Zeyu Zhang

Orcid: 0009-0006-8819-3741

Affiliations:
  • Australian National University, Australia


According to our database1, Zeyu Zhang authored at least 83 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Group Cognition Learning: Making Everything Better Through Governed Two-Stage Agents Collaboration.
CoRR, May, 2026

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation.
CoRR, April, 2026

Less Detail, Better Answers: Degradation-Driven Prompting for VQA.
CoRR, April, 2026

FlashSign: Pose-Free Guidance for Efficient Sign Language Video Generation.
CoRR, March, 2026

Rethinking Token Pruning for Historical Screenshots in GUI Visual Agents: Semantic, Spatial, and Temporal Perspectives.
CoRR, March, 2026

LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models.
CoRR, March, 2026

GeoWorld: Geometric World Models.
CoRR, February, 2026

OCR-Agent: Agentic OCR with Capability and Memory Reflection.
CoRR, February, 2026

TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs.
CoRR, February, 2026

MMA: Multimodal Memory Agent.
CoRR, February, 2026

NeuroSymActive: Differentiable Neural-Symbolic Reasoning with Active Exploration for Knowledge Graph Question Answering.
CoRR, February, 2026

GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning.
CoRR, February, 2026

CoV: Chain-of-View Prompting for Spatial Reasoning.
CoRR, January, 2026

WebCryptoAgent: Agentic Crypto Trading with Web Informatics.
CoRR, January, 2026

Exploring dynamic interpretable brain networks via hierarchical graph transformer.
Pattern Recognit., 2026

Federated graph-level clustering network with adaptive knowledge compensation.
Neural Networks, 2026

DOEI: Dual optimization of embedding information for attention-enhanced class activation maps.
Neurocomputing, 2026

SSS: Semi-Supervised SAM-2 with efficient prompting for medical imaging segmentation.
Biomed. Signal Process. Control., 2026

A Unified Graph Clustering Network.
Proceedings of the ACM Web Conference 2026, 2026

VaseVQA: Multimodal Agent and Benchmark for Ancient Greek Pottery.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

V-Pruner: A Fast and Globally-informed Token Pruning Framework for Vision Transformer.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
DragMesh: Interactive 3D Generation Made Easy.
CoRR, December, 2025

EgoLCD: Egocentric Video Generation with Long Context Diffusion.
CoRR, December, 2025

BlockVid: Block Diffusion for High-Quality and Consistent Minute-Long Video Generation.
CoRR, November, 2025

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation.
CoRR, November, 2025

MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots.
CoRR, November, 2025

EvoVLA: Self-Evolving Vision-Language-Action Model.
CoRR, November, 2025

VaseVQA-3D: Benchmarking 3D VLMs on Ancient Greek Pottery.
CoRR, October, 2025

UniVid: The Open-Source Unified Video Model.
CoRR, September, 2025

VolSplat: Rethinking Feed-Forward 3D Gaussian Splatting with Voxel-Aligned Prediction.
CoRR, September, 2025

StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes.
CoRR, September, 2025

Nav-R1: Reasoning and Navigation in Embodied Scenes.
CoRR, September, 2025

ReMoMask: Retrieval-Augmented Masked Motion Generation.
CoRR, August, 2025

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding.
CoRR, July, 2025

SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging Segmentation.
CoRR, June, 2025

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting.
CoRR, June, 2025

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion.
CoRR, June, 2025

Dynamic Domain Adaptation-Driven Physics-Informed Graph Representation Learning for AC-OPF.
CoRR, June, 2025

ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS.
CoRR, May, 2025

Dual-channel Heterophilic Message Passing for Graph Fraud Detection.
CoRR, April, 2025

3D CoCa: Contrastive Learners are 3D Captioners.
CoRR, April, 2025

Multi-Relation Graph-Kernel Strengthen Network for Graph-Level Clustering.
CoRR, April, 2025

PathoHR: Breast Cancer Survival Prediction on High-Resolution Pathological Images.
CoRR, March, 2025

Motion Anything: Any to Motion Generation.
CoRR, March, 2025

DOEI: Dual Optimization of Embedding Information for Attention-Enhanced Class Activation Maps.
CoRR, February, 2025

Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework.
CoRR, February, 2025

MedConv: Convolutions Beat Transformers on Long-Tailed Bone Density Prediction.
CoRR, February, 2025

Medical artificial intelligence for early detection of lung cancer: A survey.
Eng. Appl. Artif. Intell., 2025

Hazards in Daily Life? Enabling Robots to Proactively Detect and Resolve Anomalies.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

You Can Generate It Again: Data-to-Text Generation with Verification and Correction Prompting.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

MARL-MambaContour: Unleashing Multi-Agent Deep Reinforcement Learning for Active Contour Optimization in Medical Image Segmentation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Unified Medical Image Segmentation with State Space Modeling Snake.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MediAug: Exploring Visual Augmentation in Medical Imaging.
Proceedings of the Medical Image Understanding and Analysis - 29th Annual Conference, 2025

DHGFormer: Dynamic Hierarchical Graph Transformer for Disorder Brain Disease Diagnosis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Adaptive Embedding for Long-Range High-Order Dependencies via Time-Varying Transformer on fMRI.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Dual-channel Heterophilic Message Passing for Graph Fraud Detection.
Proceedings of the International Joint Conference on Neural Networks, 2025

RL-Pruner: Retraining-Free Global Exploration Pruning Method Based on Reinforcement Learning.
Proceedings of the International Joint Conference on Neural Networks, 2025

JTFM: Joint Time-Frequency Method For Long-term Time Series Forecasting.
Proceedings of the International Joint Conference on Neural Networks, 2025

Multi-Relation Graph-Kernel Strengthen Network for Graph-Level Clustering.
Proceedings of the International Joint Conference on Neural Networks, 2025

MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Efficient Learning with Sine-Activated Low-Rank Matrices.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PresentAgent: Multimodal Agent for Presentation Video Generation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

PedDet: Adaptive Spectral Optimization for Multimodal Pedestrian Detection.
Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

ProjectedEx: Enhancing Generation in Explainable AI for Prostate Cancer.
Proceedings of the 38th IEEE International Symposium on Computer-Based Medical Systems, 2025

2024
SegKAN: High-Resolution Medical Image Segmentation with Long-Distance Dependencies.
CoRR, 2024

KMM: Key Frame Mask Mamba for Extended Motion Generation.
CoRR, 2024

Medical AI for Early Detection of Lung Cancer: A Survey.
CoRR, 2024

MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule.
CoRR, 2024

ESA: Annotation-Efficient Active Learning for Semantic Segmentation.
CoRR, 2024

SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation.
CoRR, 2024

XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training.
CoRR, 2024

InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation.
CoRR, 2024

Sine Activated Low-Rank Matrices for Parameter Efficient Learning.
CoRR, 2024

Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM.
CoRR, 2024

JointViT: Modeling Oxygen Saturation Levels with Joint Supervision on Long-Tailed OCTA.
Proceedings of the Medical Image Understanding and Analysis - 28th Annual Conference, 2024

A Landmark-Based Approach for Instability Prediction in Distal Radius Fractures.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

SegReg: Segmenting OARs by Registering MR Images and CT Annotations.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Motion Mamba: Efficient and Long Sequence Motion Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion.
Proceedings of the 35th British Machine Vision Conference, 2024

MedDet: Generative Adversarial Distillation for Efficient Cervical Disc Herniation Detection.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

A Deep Learning Approach to Diabetes Diagnosis.
Proceedings of the Recent Challenges in Intelligent Information and Database Systems, 2024

2023
SegReg: Segmenting OARs by Registering MR Images and CT Annotations.
CoRR, 2023

BHSD: A 3D Multi-class Brain Hemorrhage Segmentation Dataset.
Proceedings of the Machine Learning in Medical Imaging - 14th International Workshop, 2023


  Loading...