Jonathan Nöther

According to our database1, Jonathan Nöther authored at least 4 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Policy Teaching via Data Poisoning in Learning from Human Preferences.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Defending Against Unknown Corrupted Agents: Reinforcement Learning of Adversarially Robust Nash Equilibria.
Trans. Mach. Learn. Res., 2024

2023
Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023


  Loading...