Will Cai

According to our database¹, Will Cai authored at least 6 papers between 2024 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

The Geometry of Harmfulness in LLMs through Subconcept Probing.

[BibT_eX]

[DOI]

McNair Shah

Saleena Angeline

Adhitya Rajendra Kumar

CoRR, July, 2025

PromptArmor: Simple yet Effective Prompt Injection Defenses.

[BibT_eX]

[DOI]

CoRR, July, 2025

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs.

[BibT_eX]

[DOI]

CoRR, April, 2025

Improving LLM Safety Alignment with Dual-Objective Optimization.

[BibT_eX]

[DOI]

CoRR, March, 2025

Scaling Trends for Data Poisoning in LLMs.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Scaling Laws for Data Poisoning in LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Will Cai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...