Yida Lu

Orcid: 0009-0000-4492-9047

According to our database¹, Yida Lu authored at least 14 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Survive at All Costs: Exploring LLM's Risky Behaviors under Survival Pressure.

[BibT_eX]

[DOI]

CoRR, March, 2026

The Missing Half: Unveiling Training-time Implicit Safety Risks Beyond Deployment.

[BibT_eX]

[DOI]

CoRR, February, 2026

The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning.

[BibT_eX]

[DOI]

Victor Shea-Jay Huang

CoRR, January, 2026

2025

ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs.

[BibT_eX]

[DOI]

CoRR, May, 2025

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement.

[BibT_eX]

[DOI]

CoRR, February, 2025

ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs: ShieldVLM.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LongSafety: Evaluating Long-Context Safety of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Agent-SafetyBench: Evaluating the Safety of LLM Agents.

[BibT_eX]

[DOI]

CoRR, 2024

Global Challenge for Safe and Secure LLMs Track 1.

[BibT_eX]

[DOI]

CoRR, 2024

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

Rethinking Dense Retrieval's Few-Shot Ability.

[BibT_eX]

[DOI]

CoRR, 2023

Yida Lu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...