Zhaoyu Chen

Orcid: 0000-0002-7112-2596

Affiliations:
  • Fudan University, Shanghai, China


According to our database1, Zhaoyu Chen authored at least 59 papers between 2021 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection.
CoRR, July, 2025

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops.
CoRR, June, 2025

Enhancing Diffusion-based Unrestricted Adversarial Attacks via Adversary Preferences Alignment.
CoRR, June, 2025

MMARD: Improving the Min-Max Optimization Process in Adversarial Robustness Distillation.
CoRR, March, 2025

VideoPure: Diffusion-based Adversarial Purification for Video Recognition.
CoRR, January, 2025

Boosting Adversarial Transferability with Spatial Adversarial Alignment.
CoRR, January, 2025

Enhancing the accuracy of Generative Adversarial Networks with Fokker-Planck Equations.
Neurocomputing, 2025

Pruning for Sparse Diffusion Models Based on Gradient Flow.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Debiased Multimodal Understanding for Human Language Sequences.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Towards Context-Aware Emotion Recognition Debiasing From a Causal Demystification Perspective via De-Confounded Training.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Boosting the transferability of adversarial attacks with global momentum initialization.
Expert Syst. Appl., 2024

MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration.
CoRR, 2024

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation.
CoRR, 2024

General Compression Framework for Efficient Transformer Object Tracking.
CoRR, 2024

Improving Adversarial Transferability with Neighbourhood Gradient Information.
CoRR, 2024

PG-Attack: A Precision-Guided Adversarial Attack Framework Against Vision Foundation Models for Autonomous Driving.
CoRR, 2024

LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation.
CoRR, 2024

Improving Adversarial Transferability of Visual-Language Pre-training Models through Collaborative Multimodal Interaction.
CoRR, 2024

ClickVOS: Click Video Object Segmentation.
CoRR, 2024

Towards Multimodal Human Intention Understanding Debiasing via Subject-Deconfounding.
CoRR, 2024

Delving into Decision-based Black-box Attacks on Semantic Segmentation.
CoRR, 2024

Sampling to Distill: Knowledge Transfer from Open-World Data.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

FAMIM: A Novel Frequency-Domain Augmentation Masked Image Model Framework for Domain Generalizable Face Anti-Spoofing.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Multimodal Sentiment Analysis Debiasing via Bias Purification.
Proceedings of the Computer Vision - ECCV 2024, 2024

Self-cooperation Knowledge Distillation for Novel Class Discovery.
Proceedings of the Computer Vision - ECCV 2024, 2024

De-Confounded Data-Free Knowledge Distillation for Handling Distribution Shifts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Large Vision-Language Models as Emotion Recognizers in Context Awareness.
Proceedings of the Asian Conference on Machine Learning, 2024

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Memory Network With Pixel-Level Spatio-Temporal Learning for Visual Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., November, 2023

Query-Efficient Decision-Based Black-Box Patch Attack.
IEEE Trans. Inf. Forensics Secur., 2023

Exploring Decision-based Black-box Attacks on Face Forgery Detection.
CoRR, 2023

OpenVIS: Open-vocabulary Video Instance Segmentation.
CoRR, 2023

Efficient Decision-based Black-box Patch Attacks on Video Recognition.
CoRR, 2023

Model Robustness Meets Data Privacy: Adversarial Robustness Distillation without Original Data.
CoRR, 2023

Content-based Unrestricted Adversarial Attack.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Capture to Registration Framework for Realistic Image Super-Resolution in the Industry Environment.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Freq-HD: An Interpretable Frequency-based High-Dynamics Affective Clip Selection Method for in-the-Wild Facial Expression Recognition in Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards Decision-based Sparse Attacks on Video Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

GSFormer: Geometric-Spatial Transformer on Point Cloud Completion.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficient Decision-based Black-box Patch Attacks on Video Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LVOS: A Benchmark for Long-term Video Object Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Explicit and Implicit Knowledge Distillation via Unlabeled Data.
Proceedings of the IEEE International Conference on Acoustics, 2023

Adversarial Contrastive Distillation with Adaptive Denoising.
Proceedings of the IEEE International Conference on Acoustics, 2023

Context De-Confounded Emotion Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Boosting the Transferability of Adversarial Attacks with Global Momentum Initialization.
CoRR, 2022

DPCNet: Dual Path Multi-Excitation Collaborative Network for Facial Expression Representation Learning in Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Efficient Universal Shuffle Attack for Visual Object Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2022

Shape Matters: Deformable Patch Attack.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Practical Certifiable Patch Defense with Vision Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes.
CoRR, 2021

Rpattack: Refined Patch Attack on General Object Detectors.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021


  Loading...