Leon Lang

Orcid: 0000-0002-1950-2831

According to our database1, Leon Lang authored at least 12 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Modeling Human Beliefs about AI Behavior for Scalable Oversight.
CoRR, February, 2025

2024
Factored space models: Towards causality between levels of abstraction.
CoRR, 2024

Abstract Markov Random Fields.
CoRR, 2024

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret.
CoRR, 2024

When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning.
CoRR, 2024

When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Evaluating Shutdown Avoidance of Language Models in Textual Scenarios.
CoRR, 2023

2022
Information Decomposition Diagrams Applied beyond Shannon Entropy: A Generalization of Hu's Theorem.
CoRR, 2022

A Program to Build E(N)-Equivariant Steerable CNNs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
A Wigner-Eckart Theorem for Group Equivariant Convolution Kernels.
Proceedings of the 9th International Conference on Learning Representations, 2021

2019
Learning to Request Guidance in Emergent Communication.
CoRR, 2019

Learning to request guidance in emergent language.
Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge, 2019


  Loading...