Luckeciano Carvalho Melo

Orcid: 0000-0003-2599-6265

According to our database1, Luckeciano Carvalho Melo authored at least 13 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context.
CoRR, February, 2025

Uncertainty-Aware Step-wise Verification with Generative Reward Models.
CoRR, February, 2025

2024
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning.
CoRR, 2024

Temporal-Difference Variational Continual Learning.
CoRR, 2024

Deep Bayesian Active Learning for Preference Modeling in Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2022
Multiagent Reinforcement Learning for Strategic Decision Making and Control in Robotic Soccer Through Self-Play.
IEEE Access, 2022

Transformers are Meta-Reinforcement Learners.
Proceedings of the International Conference on Machine Learning, 2022

2021
Learning Humanoid Robot Running Motions with Symmetry Incentive through Proximal Policy Optimization.
J. Intell. Robotic Syst., 2021

2020
Contextual Meta-Bandit for Recommender Systems Selection.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces.
Proceedings of the 20th International Conference on Data Mining Workshops, 2020

2019
Bottom-Up Meta-Policy Search.
CoRR, 2019

Learning Humanoid Robot Motions Through Deep Neural Networks.
CoRR, 2019

Learning Humanoid Robot Running Skills through Proximal Policy Optimization.
Proceedings of the Latin American Robotics Symposium, 2019


  Loading...