Barna Pásztor

According to our database1, Barna Pásztor authored at least 14 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Aligning Language Models from User Interactions.
CoRR, March, 2026

ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning.
CoRR, March, 2026

RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models.
CoRR, February, 2026

2025
Stackelberg Learning from Human Feedback: Preference Optimization as a Sequential Game.
CoRR, December, 2025

Ride-Sourcing Vehicle Rebalancing with Service Accessibility Guarantees via Constrained Mean-Field Reinforcement Learning.
CoRR, March, 2025

Learning Collusion in Episodic, Inventory-Constrained Markets.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

2024
Stochastic Bilevel Optimization with Lower-Level Contextual Markov Decision Processes.
CoRR, 2024

Melting Pot Contest: Charting the Future of Generalized Cooperative Intelligence.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Contextual Bilevel Reinforcement Learning for Incentive Alignment.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Bandits with Preference Feedback: A Stackelberg Game Perspective.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

2020
On the impact of publicly available news and information transfer to financial markets.
CoRR, 2020

Stochastic Gradient Descent Works Really Well for Stress Minimization.
Proceedings of the Graph Drawing and Network Visualization - 28th International Symposium, 2020


  Loading...