Piotr Stanczyk

According to our database¹, Piotr Stanczyk authored at least 12 papers between 2010 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

BOND: Aligning LLMs with Best-of-N Distillation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

BOND: Aligning LLMs with Best-of-N Distillation.

[BibT_eX]

[DOI]

CoRR, 2024

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models.

[BibT_eX]

[DOI]

CoRR, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2021

RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Launchpad: A Programming Model for Distributed Machine Learning Research.

[BibT_eX]

[DOI]

CoRR, 2021

What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

2020

What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study.

[BibT_eX]

[DOI]

CoRR, 2020

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Google Research Football: A Novel Reinforcement Learning Environment.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2010

Perfect Matching for Biconnected Cubic Graphs in O(n log2n) Time.

[BibT_eX]

[DOI]

Krzysztof Diks

Piotr Stanczyk

Proceedings of the SOFSEM 2010: Theory and Practice of Computer Science, 2010

Piotr Stanczyk

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...