Tomasz Korbak

Orcid: 0000-0002-6258-2013

According to our database¹, Tomasz Korbak authored at least 24 papers between 2017 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2023

Self-organisation, (M, R)-systems and enactive cognitive science.

[BibT_eX]

[DOI]

Tomasz Korbak

Adapt. Behav., February, 2023

Towards Understanding Sycophancy in Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Compositional preference models for aligning LMs.

[BibT_eX]

[DOI]

CoRR, 2023

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A".

[BibT_eX]

[DOI]

CoRR, 2023

Taken out of context: On measuring situational awareness in LLMs.

[BibT_eX]

[DOI]

CoRR, 2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Inverse Scaling: When Bigger Isn't Better.

[BibT_eX]

[DOI]

CoRR, 2023

Training Language Models with Language Feedback at Scale.

[BibT_eX]

[DOI]

CoRR, 2023

Improving Code Generation by Training with Natural Language Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Models of symbol emergence in communication: a conceptual review and a guide for avoiding local minima.

[BibT_eX]

[DOI]

Julian Zubek

Tomasz Korbak

Joanna Raczaszek-Leonardi

CoRR, 2023

Pretraining Language Models with Human Preferences.

[BibT_eX]

[DOI]

Tomasz Korbak

Kejian Shi

Angelica Chen

Rasika Vinayak Bhalerao

Christopher L. Buckley

Jason Phang

Samuel R. Bowman

Ethan Perez

Proceedings of the International Conference on Machine Learning, 2023

Aligning Language Models with Preferences through f-divergence Minimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Controlling Conditional Language Models without Catastrophic Forgetting.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

RL with KL penalties is better viewed as Bayesian inference.

[BibT_eX]

[DOI]

Tomasz Korbak

Ethan Perez

Christopher L. Buckley

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Controlling Conditional Language Models with Distributional Policy Gradients.

[BibT_eX]

[DOI]

CoRR, 2021

Energy-Based Models for Code Generation under Compilability Constraints.

[BibT_eX]

[DOI]

CoRR, 2021

Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Measuring non-trivial compositionality in emergent communication.

[BibT_eX]

[DOI]

Tomasz Korbak

Julian Zubek

Joanna Raczaszek-Leonardi

CoRR, 2020

The Emergence of Action-grounded Compositional Communication.

[BibT_eX]

[DOI]

Joanna Raczaszek-Leonardi

Julian Zubek

Proceedings of the 42th Annual Meeting of the Cognitive Science Society, 2020

2019

Developmentally motivated emergence of compositional communication via template transfer.

[BibT_eX]

[DOI]

Joanna Raczaszek-Leonardi

CoRR, 2019

Exploiting Unsupervised Pre-training and Automated Feature Engineering for Low-resource Hate Speech Detection in Polish.

[BibT_eX]

[DOI]

CoRR, 2019

2017

Fine-tuning Tree-LSTM for phrase-level sentiment classification on a Polish dependency treebank. Submission to PolEval task 2.

[BibT_eX]

[DOI]

Tomasz Korbak

Paulina Zak

CoRR, 2017

Fine-Tuning Tree-LSTM for Phrase-Level Sentiment Classification on a Polish Dependency Treebank.

[BibT_eX]

[DOI]

Tomasz Korbak

Paulina Zak

Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2017

Tomasz Korbak

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...