Erik Jones

According to our database1, Erik Jones authored at least 10 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Feedback Loops With Language Models Drive In-Context Reward Hacking.
CoRR, 2024

2023
Orca 2: Teaching Small Language Models How to Reason.
CoRR, 2023

Teaching Language Models to Hallucinate Less with Synthetic Tasks.
CoRR, 2023

Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models.
CoRR, 2023

Mass-Producing Failures of Multimodal Systems with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Automatically Auditing Large Language Models via Discrete Optimization.
Proceedings of the International Conference on Machine Learning, 2023

2022
Capturing Failures of Large Language Models via Human Cognitive Biases.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Selective Classification Can Magnify Disparities Across Groups.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Impact of a deep learning assistant on the histopathologic classification of liver cancer.
npj Digit. Medicine, 2020

Robust Encodings: A Framework for Combating Adversarial Typos.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020


  Loading...