Rongyu Zhang

Orcid: 0000-0002-9174-1765

According to our database1, Rongyu Zhang authored at least 54 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
A multicenter study: habitat imaging and radiomics to guide precision and individualized surgical treatment in chronic osteomyelitis.
BMC Medical Imaging, December, 2026

Mask World Model: Predicting What Matters for Robust Robot Policy Learning.
CoRR, April, 2026

Key-Embedded Privacy for Decentralized AI in Biomedical Omics.
CoRR, March, 2026

FinToolSyn: A forward synthesis Framework for Financial Tool-Use Dialogue Data with Dynamic Tool Retrieval.
CoRR, March, 2026

Linking Perception, Confidence and Accuracy in MLLMs.
CoRR, March, 2026

Two-stage rolling optimization resilience enhancement strategy for Multi-Microgrid under extreme weather conditions.
Reliab. Eng. Syst. Saf., 2026

PANDA: Empowering Small Language Models for Proactive Dialogue Through Agent-Based Synthesis (Student Abstract).
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Implicit neural image field for biological microscopy image compression.
Nat. Comput. Sci., November, 2025

Unimodal Training-Multimodal Prediction: Cross-Modal Federated Learning With Hierarchical Aggregation.
IEEE Trans. Mob. Comput., October, 2025

Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain.
CoRR, October, 2025

RepCaM++: Exploring Transparent Visual Prompt With Inference-Time Re-Parameterization for Neural Video Delivery.
IEEE Trans. Mob. Comput., September, 2025

Orochi: Versatile Biomedical Image Processor.
CoRR, September, 2025

BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

SpikeGen: Generative Framework for Visual Spike Stream Processing.
CoRR, May, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
CoRR, May, 2025

Second FRCSyn-onGoing: Winning solutions and post-challenge analysis to improve face recognition with synthetic data.
Inf. Fusion, 2025

Omni-LLaMA-AD: A Unified Model for Open-Set Visual Anomaly Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FBQuant: FeedBack Quantization for Large Language Models.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Empowering World Models with Reflection for Embodied Video Prediction.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025


PAT: Pruning-Aware Tuning for Large Language Models.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Multi-Level Personalized Federated Learning on Heterogeneous and Long-Tailed Data.
IEEE Trans. Mob. Comput., December, 2024

EVA: An Embodied World Model for Future Video Anticipation.
CoRR, 2024

FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models.
CoRR, 2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions.
CoRR, 2024

Implicit Neural Image Field for Biological Microscopy Image Compression.
CoRR, 2024

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation.
CoRR, 2024

Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning.
CoRR, 2024

VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness.
CoRR, 2024

VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024


Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
FedAB: Truthful Federated Learning With Auction-Based Combinatorial Multi-Armed Bandit.
IEEE Internet Things J., September, 2023

A Spatial Information Extraction Method Based on Multi-Modal Social Media Data: A Case Study on Urban Inundation.
ISPRS Int. J. Geo Inf., September, 2023

Optimizing Efficient Personalized Federated Learning with Hypernetworks at Edge.
IEEE Netw., 2023

Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation.
CoRR, 2023

ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model.
CoRR, 2023

NTIRE 2023 Quality Assessment of Video Enhancement Challenge.
CoRR, 2023

Unimodal Training-Multimodal Prediction: Cross-modal Federated Learning with Hierarchical Aggregation.
CoRR, 2023

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery.
Proceedings of the 33rd Workshop on Network and Operating System Support for Digital Audio and Video, 2023

Cluster-driven GNN-based Federated Recommendation with Biased Message Dropout.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

DFGC-VRA: DeepFake Game Competition on Visual Realism Assessment.
Proceedings of the IEEE International Joint Conference on Biometrics, 2023

Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
CSCD2: an integrated interactional database of cancer-specific circular RNAs.
Nucleic Acids Res., 2022

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection.
CoRR, 2022

Multi-Frames Temporal Abnormal Clues Learning Method for Face Anti-Spoofing.
Proceedings of the 34th International Conference on Software Engineering and Knowledge Engineering, 2022


Image Quality Assessment with Gradient Siamese Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2020
A Dense U-Net with Cross-Layer Intersection for Detection and Localization of Image Forgery.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


  Loading...