Jannik Brinkmann

According to our database1, Jannik Brinkmann authored at least 18 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Mitigating Adaptive Attacks against Reasoning Models with Activation Consistency Training.
CoRR, May, 2026

Agents of Chaos.
CoRR, February, 2026

Mechanisms of AI Protein Folding in ESMFold.
CoRR, February, 2026

NSA: Neuro-symbolic ARC Challenge.
Proceedings of the 34th European Symposium on Artificial Neural Networks, 2026

2025
In-Context Algebra.
CoRR, December, 2025

In-Context Learning Without Copying.
CoRR, November, 2025

Jailbreak Strength and Model Similarity Predict Transferability.
CoRR, June, 2025

Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability.
CoRR, 2024

NNsight and NDIF: Democratizing Access to Foundation Model Internals.
CoRR, 2024

Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Unsupervised Extraction of Test Scenarios from Time-Series Sensor Data using Trace Graphs.
Proceedings of the 57th Hawaii International Conference on System Sciences, 2024

GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
A Multidimensional Analysis of Social Biases in Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Bias Mitigation for Large Language Models using Adversarial Learning.
Proceedings of the 1st Workshop on Fairness and Bias in AI co-located with 26th European Conference on Artificial Intelligence (ECAI 2023), 2023


  Loading...