Yifeng Gao

Orcid: 0000-0001-5034-1928

Affiliations:
  • Fudan University, Shanghai, China


According to our database1, Yifeng Gao authored at least 20 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Towards Context-Invariant Safety Alignment for Large Language Models.
CoRR, May, 2026

DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models.
CoRR, May, 2026

HazardArena: Evaluating Semantic Safety in Vision-Language-Action Models.
CoRR, April, 2026

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents.
CoRR, April, 2026

Internal Safety Collapse in Frontier Large Language Models.
CoRR, March, 2026

Mirror: A Multi-Agent System for AI-Assisted Ethics Review.
CoRR, February, 2026

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5.
CoRR, January, 2026

2025
ADMIT: Few-shot Knowledge Poisoning Attacks on RAG-based Fact Checking.
CoRR, October, 2025

FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models.
CoRR, September, 2025

DAVID-XR1: Detecting AI-Generated Videos with Explainable Reasoning.
CoRR, June, 2025

Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words.
CoRR, March, 2025

Safety at Scale: A Comprehensive Survey of Large Model Safety.
CoRR, February, 2025

Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety.
Found. Trends Priv. Secur., 2025

SafeVid: Toward Safety Aligned Video Large Multimodal Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

T2UE: Generating Unlearnable Examples from Text Descriptions.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

BrokenVideos: A Benchmark Dataset for Fine-Grained Artifact Localization in AI-Generated Videos.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model.
CoRR, 2024

The Dog Walking Theory: Rethinking Convergence in Federated Learning.
CoRR, 2024

ModelLock: Locking Your Model With a Spell.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024


  Loading...