Alexander Hägele

According to our database1, Alexander Hägele authored at least 7 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, September, 2025

Inverse Scaling in Test-Time Compute.
CoRR, July, 2025

The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model Training.
CoRR, January, 2025

Training Dynamics of the Cooldown Stage in Warmup-Stable-Decay Learning Rate Scheduler.
Trans. Mach. Learn. Res., 2025

2024
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
BaCaDI: Bayesian Causal Discovery with Unknown Interventions.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2021
Robustness certification with generative models.
Proceedings of the PLDI '21: 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2021


  Loading...