Filip Sondej

According to our database¹, Filip Sondej authored at least 10 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Implementing surrogate goals for safer bargaining in LLM-based agents.

[BibT_eX]

[DOI]

CoRR, April, 2026

2025

Collapse of Irrelevant Representations (CIR) Ensures Robust and Non-Disruptive LLM Unlearning.

[BibT_eX]

[DOI]

Filip Sondej

Yushi Yang

CoRR, September, 2025

Robust LLM Unlearning with MUDMAN: Meta-Unlearning with Disruption Masking And Normalization.

[BibT_eX]

[DOI]

CoRR, June, 2025

Individual differences in neurophysiological correlates of post-response adaptation: A model-based approach.

[BibT_eX]

[DOI]

NeuroImage, 2025

How Does DPO Reduce Toxicity? A Mechanistic Neuron-Level Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Are there different types of error monitoring? A microstates analysis of error-related brain activity across three tasks.

[BibT_eX]

[DOI]

Anna Grabowska

Filip Sondej

Magdalena Senderecka

Proceedings of the 47th Annual Meeting of the Cognitive Science Society, 2025

Multi-Agent Security Tax: Trading Off Security and Collaboration Capabilities in Multi-Agent Systems.

[BibT_eX]

[DOI]

Jason Hoelscher-Obermaier

Christian Schröder de Witt

Esben Kran

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

A Machine Learning Study of Anxiety-related Symptoms and Error-related Brain Activity.

[BibT_eX]

[DOI]

Anna Grabowska

Filip Sondej

Magdalena Senderecka

J. Cogn. Neurosci., May, 2024

Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive Toxicity Reduction.

[BibT_eX]

[DOI]

CoRR, 2024

2019

On the Role of Trust in Child-Robot Interaction.

[BibT_eX]

[DOI]

Paulina Zguda

Bartlomiej Sniezynski

Proceedings of the 28th IEEE International Conference on Robot and Human Interactive Communication, 2019

Filip Sondej

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...