Bruno C. da Silva

Orcid: 0000-0002-3708-5728

Affiliations:
  • University of Massachusetts, Amherst, MA, USA
  • Federal University of Rio Grande do Sul (UFRGS), Institute of Informatics, Porto Alegre, Brazil (former)


According to our database1, Bruno C. da Silva authored at least 54 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
From Past to Future: Rethinking Eligibility Traces.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Coagent Networks: Generalized and Scaled.
CoRR, 2023

Behavior Alignment via Reward Function Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-Step Generalized Policy Improvement by Leveraging Approximate Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Seldonian Toolkit: Building Software with Safe and Fair Machine Learning.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: ICSE 2023 Companion Proceedings, 2023

Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Model-Based Reinforcement Learning with SINDy.
CoRR, 2022

Enforcing Delayed-Impact Fairness Guarantees.
CoRR, 2022

Off-Policy Evaluation for Action-Dependent Non-stationary Environments.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Look-Ahead Reinforcement Learning for Load Balancing Network Traffic.
Proceedings of the IEEE Symposium on Computers and Communications, 2022

Constrained Offline Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022

Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer.
Proceedings of the International Conference on Machine Learning, 2022

Fairness Guarantees under Demographic Shift.
Proceedings of the Tenth International Conference on Learning Representations, 2022

RADAR: Reactive and Deliberative Adaptive Reasoning - Learning When to Think Fast and When to Think Slow.
Proceedings of the IEEE International Conference on Development and Learning, 2022

2021
Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control.
PeerJ Comput. Sci., 2021

Patterns of high-risk drinking among medical students: A web-based survey with machine learning.
Comput. Biol. Medicine, 2021

Universal Off-Policy Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Posterior Value Functions: Hindsight Baselines for Policy Gradient Methods.
Proceedings of the 38th International Conference on Machine Learning, 2021

Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
Toll-based reinforcement learning for efficient equilibria in route choice.
Knowl. Eng. Rev., 2020

Data clustering for efficient approximate computing.
Des. Autom. Embed. Syst., 2020

Autonomous learning of multiple, context-dependent tasks.
CoRR, 2020

Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints.
CoRR, 2020

2019
Autonomous Open-Ended Learning of Interdependent Tasks.
CoRR, 2019

Parameterized Melody Generation with Autoencoders and Temporally-Consistent Noise.
Proceedings of the 19th International Conference on New Interfaces for Musical Expression, 2019

A Methodology for Neural Network Architectural Tuning Using Activation Occurrence Maps.
Proceedings of the International Joint Conference on Neural Networks, 2019

Identifying Reusable Early-Life Options.
Proceedings of the Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics, 2019

A Compression-Inspired Framework for Macro Discovery.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
A task-and-technique centered survey on visual analytics for deep learning model engineering.
Comput. Graph., 2018

Efficient Local Memory Support for Approximate Computing.
Proceedings of the VIII Brazilian Symposium on Computing Systems Engineering, 2018

Comparing Multi-Armed Bandit Algorithms and Q-learning for Multiagent Action Selection: a Case Study in Route Choice.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Towards Designing Optimal Reward Functions in Multi-Agent Reinforcement Learning Problems.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

2017
Identifying Reusable Macros for Efficient Exploration via Policy Compression.
CoRR, 2017

On Ensuring that Intelligent Machines Are Well-Behaved.
CoRR, 2017

Task-based behavior generalization via manifold clustering.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Learning to Minimise Regret in Route Choice.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

A Flexible Approach for Designing Optimal Reward Functions.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Context-Based Concurrent Experience Sharing in Multiagent Systems.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

2016
Using Topological Statistics to Bias and Accelerate Route Choice: Preliminary Findings in Synthetic and Real-World Road Networks.
Proceedings of the Ninth International Workshop on Agents in Traffic and Transportation (ATT 2016) co-located with the 25th International Joint Conference On Artificial Intelligence (IJCAI 2016), 2016

Energetic Natural Gradient Descent.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2014
Learning parameterized motor skills on a humanoid robot.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Active Learning of Parameterized Skills.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Biasing the behavior of organizationally adept agents: (extended abstract).
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012
Learning Parameterized Skills.
Proceedings of the 29th International Conference on Machine Learning, 2012

TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2010
Learning in groups of traffic signals.
Eng. Appl. Artif. Intell., 2010

2007
Distributed constraint propagation for diagnosis of faults in physical processes.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

2006
Dealing with non-stationary environments using context detection.
Proceedings of the Machine Learning, 2006

Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator.
Proceedings of the 4th European Workshop on Multi-Agent Systems EUMAS'06, 2006

ITSUMO: an Intelligent Transportation System for Urban Mobility.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

Improving reinforcement learning with context detection.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

RL-CD: Dealing with Non-Stationarity in Reinforcement Learning.
Proceedings of the Proceedings, 2006

2004
ITSUMO: An Intelligent Transportation System for Urban Mobility.
Proceedings of the Innovative Internet Community Systems, 4th InternationalWorkshop, 2004


  Loading...