Jialing Tao
According to our database1,
Jialing Tao
authored at least 9 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, September, 2025
Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencoder Interpretation Framework.
CoRR, September, 2025
Oyster-I: Beyond Refusal - Constructive Safety Alignment for Responsible Language Models.
CoRR, September, 2025
Strata-Sword: A Hierarchical Safety Evaluation towards LLMs based on Reasoning Complexity of Jailbreak Instructions.
CoRR, September, 2025
Learning to Detect Unknown Jailbreak Attacks in Large Vision-Language Models: A Unified and Accurate Approach.
CoRR, August, 2025
CoRR, January, 2025
Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink.
Proceedings of the 34th USENIX Security Symposium, 2025
2024
2018
Proceedings of the 2018 IEEE International Conference on Data Mining Workshops, 2018