Pierre-Luc Bacon

According to our database1, Pierre-Luc Bacon authored at least 43 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons.
CoRR, 2024

Do Transformer World Models Give Better Policy Gradients?
CoRR, 2024

Bridging State and History Representations: Understanding Self-Predictive RL.
CoRR, 2024

2023
Maximum entropy GFlowNets with soft Q-learning.
CoRR, 2023

Course Correcting Koopman Representations.
CoRR, 2023

Motif: Intrinsic Motivation from Artificial Intelligence Feedback.
CoRR, 2023

Block-State Transformer.
CoRR, 2023

Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design.
CoRR, 2023

Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Block-State Transformers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Double Gumbel Q-Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Designing Biological Sequences via Meta-Reinforcement Learning and Bayesian Optimization.
CoRR, 2022

Myriad: a real-world testbed to bridge trajectory optimization and deep learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Direct Behavior Specification via Constrained Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

The Primacy Bias in Deep Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Continuous-Time Meta-Learning with Forward Mode Differentiation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning.
CoRR, 2021

Neural Algorithmic Reasoners are Implicit Planners.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
XLVIN: eXecuted Latent Value Iteration Nets.
CoRR, 2020

Graph neural induction of value iteration.
CoRR, 2020

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
CoRR, 2020

Policy Evaluation Networks.
CoRR, 2020

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling.
Proceedings of the 37th International Conference on Machine Learning, 2020

Options of Interest: Temporal Abstraction with Interest Functions.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods.
CoRR, 2019

All-Action Policy Gradient Methods: A Numerical Integration Approach.
CoRR, 2019

2018
The Barbados 2018 List of Open Issues in Continual Learning.
CoRR, 2018

Constructing Temporal Abstractions Autonomously in Reinforcement Learning.
AI Mag., 2018

Convergent TREE BACKUP and RETRACE with Function Approximation.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Robust Options.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Learning With Options That Terminate Off-Policy.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

When Waiting Is Not an Option: Learning Options With a Deliberation Cost.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learnings Options End-to-End for Continuous Action Tasks.
CoRR, 2017

The Option-Critic Architecture.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
A Matrix Splitting Perspective on Planning with Options.
CoRR, 2016

2015
Conditional Computation in Neural Networks for faster models.
CoRR, 2015

Learning and Planning with Timing Information in Markov Decision Processes.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Analyzing Open Data from the City of Montreal.
Proceedings of the 2nd International Workshop on Mining Urban Data co-located with 32nd International Conference on Machine Learning (ICML 2015), 2015


  Loading...