Jacob Hilton

According to our database1, Jacob Hilton authored at least 12 papers between 2016 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Scaling laws for single-agent reinforcement learning.
CoRR, 2023

Scaling Laws for Reward Model Overoptimization.
Proceedings of the International Conference on Machine Learning, 2023

2022
Teaching Models to Express Their Uncertainty in Words.
Trans. Mach. Learn. Res., 2022

Training language models to follow instructions with human feedback.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Batch size-invariance for policy optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

TruthfulQA: Measuring How Models Mimic Human Falsehoods.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
WebGPT: Browser-assisted question-answering with human feedback.
CoRR, 2021

Training Verifiers to Solve Math Word Problems.
CoRR, 2021

Phasic Policy Gradient.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark.
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 2020

Leveraging Procedural Generation to Benchmark Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2016
The Topological Pigeonhole Principle for Ordinals.
J. Symb. Log., 2016


  Loading...