Aaron J. Li

According to our database¹, Aaron J. Li authored at least 3 papers between 2025 and 2026.

Collaborative distances:

Timeline

Book In proceedings Article PhD thesis Dataset Other

2026

Green Shielding: A User-Centric Approach Towards Trustworthy AI.

[DOI]

CoRR, April, 2026

Evaluating Adversarial Robustness of Concept Representations in Sparse Autoencoders.

[DOI]

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025

Interpretability Illusions with Sparse Autoencoders: Evaluating Robustness of Concept Representations.

[DOI]

CoRR, May, 2025