Alex Cloud

According to our database1, Alex Cloud authored at least 10 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Recontextualization Mitigates Specification Gaming without Modifying the Specification.
CoRR, December, 2025

Beyond Data Filtering: Knowledge Localization for Capability Removal in LLMs.
CoRR, December, 2025

Natural Emergent Misalignment from Reward Hacking in Production RL.
CoRR, November, 2025

Output Supervision Can Obfuscate the Chain of Thought.
CoRR, November, 2025

Subliminal Learning: Language models transmit behavioral traits via hidden signals in data.
CoRR, July, 2025

Distillation Robustifies Unlearning.
CoRR, June, 2025

2024
Gradient Routing: Masking Gradients to Localize Computation in Neural Networks.
CoRR, 2024

2023
Anticipatory Fictitious Play.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

2021
Variance Decompositions for Extensive-Form Games.
Proceedings of the 2021 IEEE Conference on Games (CoG), 2021

2015
Fast Perfect Simulation of Vervaat Perpetutities.
CoRR, 2015


  Loading...