Ranjie Duan

Orcid: 0009-0002-2261-4268

According to our database1, Ranjie Duan authored at least 41 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Knowledge-Guided Adversarial Training for Infrared Object Detection via Thermal Radiation Modeling.
Int. J. Comput. Vis., May, 2026

Adversarial Orthogonal Disentanglement for LVLM Hallucination Mitigation.
CoRR, May, 2026

Improving Safety Alignment via Balanced Direct Preference Optimization.
CoRR, March, 2026

Obscure but Effective: Classical Chinese Jailbreak Prompt Optimization via Bio-Inspired Search.
CoRR, February, 2026

Pruning as a Cooperative Game: Surrogate-Assisted Layer Contribution Estimation for Large Language Models.
CoRR, February, 2026

YuFeng-XGuard: A Reasoning-Centric, Interpretable, and Flexible Guardrail Model for Large Language Models.
CoRR, January, 2026

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5.
CoRR, January, 2026

2025
OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation.
CoRR, December, 2025

VRSA: Jailbreaking Multimodal Large Language Models through Visual Reasoning Sequential Attack.
CoRR, December, 2025

SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG.
CoRR, October, 2025

Energy-Driven Steering: Reducing False Refusals in Large Language Models.
CoRR, October, 2025

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention.
CoRR, September, 2025

Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models.
CoRR, September, 2025

Strata-Sword: A Hierarchical Safety Evaluation towards LLMs based on Reasoning Complexity of Jailbreak Instructions.
CoRR, September, 2025

Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation.
CoRR, June, 2025

Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning.
CoRR, May, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
CoRR, April, 2025

Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency.
CoRR, January, 2025

RobustPrompt: Learning to defend against adversarial attacks with adaptive visual prompts.
Pattern Recognit. Lett., 2025

Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink.
Proceedings of the 34th USENIX Security Symposium, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image Generation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

STAIR: Improving Safety Alignment with Introspective Reasoning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Heuristic-Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack for Toxicity Maximization.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging.
IEEE Trans. Inf. Forensics Secur., 2024

Heuristic-Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models.
CoRR, 2024

PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack for Toxicity Maximization.
CoRR, 2024

MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue.
CoRR, 2024

RT-Attack: Jailbreaking Text-to-Image Models via Random Token.
CoRR, 2024

Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging.
CoRR, 2023

Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Inequality phenomenon in l<sub>∞</sub>-adversarial training, and its unrealized threats.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Enhance the Visual Representation via Discrete Adversarial Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Robust Vision Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
AdvDrop: Adversarial Attack to DNNs by Dropping Information.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adversarial Laser Beam: Effective Physical-World Attack to DNNs in a Blink.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Adversarial Camouflage: Hiding Physical-World Attacks With Natural Styles.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...