Marco Mussi

Orcid: 0000-0001-8356-6744

According to our database1, Marco Mussi authored at least 20 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Generalized Kernelized Bandits: Self-Normalized Bernstein-Like Dimension-Free Inequality and Regret Bounds.
CoRR, August, 2025

Gym4ReaL: A Suite for Benchmarking Real-World Reinforcement Learning.
CoRR, July, 2025

Reusing Trajectories in Policy Gradients Enables Fast Convergence.
CoRR, June, 2025

Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes.
CoRR, June, 2025

A Refined Analysis of UCBVI.
CoRR, February, 2025

Generalizing the Regret: an Analysis of Lower and Upper Bounds.
J. Artif. Intell. Res., 2025

Factored-reward bandits with intermediate observations: Regret minimization and best arm identification.
Artif. Intell., 2025

2024
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting.
CoRR, 2024

State and Action Factorization in Power Grids.
CoRR, 2024

Open Problem: Tight Bounds for Kernelized Multi-Armed Bandits with Bernoulli Rewards.
CoRR, 2024

Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Best Arm Identification for Stochastic Rising Bandits.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Factored-Reward Bandits with Intermediate Observations.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning Optimal Deterministic Policies with Stochastic Policy Gradients.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Graph-Triggered Rising Bandits.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Autoregressive Bandits.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
ARLO: A framework for Automated Reinforcement Learning.
Expert Syst. Appl., August, 2023

Dynamical Linear Bandits.
Proceedings of the International Conference on Machine Learning, 2023

Dynamic Pricing with Volume Discounts in Online Settings.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Pricing the Long Tail by Explainable Product Aggregation and Monotonic Bandits.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022


  Loading...