Weidi Luo

Orcid: 0000-0001-9244-4677

According to our database1, Weidi Luo authored at least 11 papers between 2024 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Alignment and Safety in Large Language Models: Safety Mechanisms, Training Paradigms, and Emerging Challenges.
CoRR, July, 2025

Doxing via the Lens: Revealing Privacy Leakage in Image Geolocation for Agentic Multi-Modal Large Reasoning Model.
CoRR, April, 2025

Dynamic Guided and Domain Applicable Safeguards for Enhanced Security in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Disentangling Memory and Reasoning Ability in Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Robustness-aware Automatic Prompt Optimization.
CoRR, 2024

Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models.
CoRR, 2024

Visual-RolePlay: Universal Jailbreak Attack on MultiModal Large Language Models via Role-playing Image Characte.
CoRR, 2024

JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks.
CoRR, 2024

Bringing Back the Context: Camera Trap Species Identification as Link Prediction on Multimodal Knowledge Graphs.
CoRR, 2024

Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024


  Loading...