David D. Baek

According to our database1, David D. Baek authored at least 8 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring.
CoRR, February, 2026

2025
Any-Depth Alignment: Unlocking Innate Safety Alignment of LLMs to Any-Depth.
CoRR, October, 2025

Scaling Laws For Scalable Oversight.
CoRR, April, 2025

Towards Understanding Distilled Reasoning Models: A Representational Approach.
CoRR, March, 2025

Harmonic Loss Trains Interpretable AI Models.
Trans. Mach. Learn. Res., 2025

The Geometry of Concepts: Sparse Autoencoder Feature Structure.
Entropy, 2025

2024
Generalization from Starvation: Hints of Universality in LLM Knowledge Graph Learning.
CoRR, 2024

GenEFT: Understanding Statics and Dynamics of Model Generalization via Effective Theory.
CoRR, 2024


  Loading...