Ariel Kwiatkowski

Orcid: 0000-0002-9391-9993

According to our database1, Ariel Kwiatkowski authored at least 13 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
torchtune: PyTorch native post-training library.
CoRR, May, 2026

Likelihood-Based Reward Designs for General LLM Reasoning.
CoRR, February, 2026

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability.
CoRR, January, 2026

2025
Soft Tokens, Hard Truths.
CoRR, September, 2025

Gymnasium: A Standard Interface for Reinforcement Learning Environments.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

PILAF: Optimal Human Preference Sampling for Reward Modeling.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2023
Understanding reinforcement learned crowds.
Comput. Graph., February, 2023

Simulating crowds with reinforcement learning. (Simulation de foules avec l'apprentissage par renforcement).
PhD thesis, 2023

UGAE: A Novel Approach to Non-exponential Discounting.
CoRR, 2023

Reward Function Design for Crowd Simulation via Reinforcement Learning.
Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games, 2023

2022
A Survey on Reinforcement Learning Methods in Character Animation.
Comput. Graph. Forum, 2022

Creating Interactive Crowds with Reinforcement Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Behaviour-Conditioned Policies for Cooperative Reinforcement Learning Tasks.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021


  Loading...