Gheorghe Comanici

According to our database1, Gheorghe Comanici authored at least 15 papers between 2010 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Affordances Enable Partial World Modeling with LLMs.
CoRR, February, 2026

2025
An AI system to help scientists write expert-level empirical software.
CoRR, September, 2025

2024
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

2023
Vision-Language Models as a Source of Rewards.
CoRR, 2023

2022
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning.
CoRR, 2022

2021
AndroidEnv: A Reinforcement Learning Platform for Android.
CoRR, 2021

Temporally Abstract Partial Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
What can I do here? A Theory of Affordances in Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
The Option Keyboard: Combining Skills in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2015
Basis refinement strategies for linear value function approximation in MDPs.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Representation Discovery for MDPs Using Bisimulation Metrics.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2012
On-the-Fly Algorithms for Bisimulation Metrics.
Proceedings of the Ninth International Conference on Quantitative Evaluation of Systems, 2012

An Empirical Analysis of Off-policy Learning in Discrete MDPs.
Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

2011
Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Optimal policy switching algorithms for reinforcement learning.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010


  Loading...