Yida Lu

According to our database1, Yida Lu authored at least 10 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs.
CoRR, May, 2025

VPO: Aligning Text-to-Video Generation Models with Prompt Optimization.
CoRR, March, 2025

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement.
CoRR, February, 2025

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LongSafety: Evaluating Long-Context Safety of Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Agent-SafetyBench: Evaluating the Safety of LLM Agents.
CoRR, 2024

Global Challenge for Safe and Secure LLMs Track 1.
CoRR, 2024

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Rethinking Dense Retrieval's Few-Shot Ability.
CoRR, 2023


  Loading...