Aaron J. Li

According to our database1, Aaron J. Li authored at least 3 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Green Shielding: A User-Centric Approach Towards Trustworthy AI.
CoRR, April, 2026

Evaluating Adversarial Robustness of Concept Representations in Sparse Autoencoders.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Interpretability Illusions with Sparse Autoencoders: Evaluating Robustness of Concept Representations.
CoRR, May, 2025


  Loading...