Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Data Governance in the Age of Large-Scale Data-Driven Language Technology.

[BibT_eX]

[DOI]

Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

Beyond Ads: Sequential Decision-Making Algorithms in Law and Public Policy.

[BibT_eX]

[DOI]

Proceedings of the 2022 Symposium on Computer Science and Law, 2022

2021

Beyond Ads: Sequential Decision-Making Algorithms in Public Policy.

[BibT_eX]

[DOI]

CoRR, 2021

On the Opportunities and Risks of Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2021

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset.

[BibT_eX]

[DOI]

CoRR, 2021

An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning.

[BibT_eX]

[DOI]

Dilip Arumugam

Peter Henderson

Pierre-Luc Bacon

CoRR, 2021

When does pretraining help?: assessing self-supervised learning for law and the CaseHOLD dataset of 53, 000+ legal holdings.

[BibT_eX]

[DOI]

Proceedings of the ICAIL '21: Eighteenth International Conference for Artificial Intelligence and Law, São Paulo Brazil, June 21, 2021

TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?

[BibT_eX]

[DOI]

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020

Ideas for Improving the Field of Machine Learning: Summarizing Discussion from the NeurIPS 2019 Retrospectives Workshop.

[BibT_eX]

[DOI]

CoRR, 2020

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

[BibT_eX]

[DOI]

CoRR, 2020

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims.

[BibT_eX]

[DOI]

Thomas Krendl Gilbert

CoRR, 2020

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning.

[BibT_eX]

[DOI]

CoRR, 2020

With Little Power Comes Great Responsibility.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Separating value functions across time-scales.

[BibT_eX]

[DOI]

CoRR, 2019

Separable value functions across time-scales.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

An Introduction to Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Vincent François-Lavet

Found. Trends Mach. Learn., 2018

A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version.

[BibT_eX]

[DOI]

Dialogue Discourse, 2018

Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research.

[BibT_eX]

[DOI]

Peter Henderson

Emma Brunskill

CoRR, 2018

The RLLChatbot: a solution to the ConvAI challenge.

[BibT_eX]

[DOI]

Michael D. Noseworthy

Prasanna Parthasarathi

Joelle Pineau

CoRR, 2018

Adversarial Gain.

[BibT_eX]

[DOI]

CoRR, 2018

Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods.

[BibT_eX]

[DOI]

Peter Henderson

Joshua Romoff

Joelle Pineau

CoRR, 2018

Cost Adaptation for Robust Decentralized Swarm Behaviour.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Reward Estimation for Variance Reduction in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Joshua Romoff

Peter Henderson

Alexandre Piché

Vincent François-Lavet

Joelle Pineau

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Ethical Challenges in Data-Driven Dialogue Systems.

[BibT_eX]

[DOI]

Peter Henderson

Koustuv Sinha

Nicolas Angelard-Gontier

Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 2018

Deep Reinforcement Learning That Matters.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Learning Robust Dialog Policies in Noisy Environments.

[BibT_eX]

[DOI]

CoRR, 2017

Bayesian Policy Gradients via Alpha Divergence Dropout Inference.

[BibT_eX]

[DOI]

CoRR, 2017

Benchmark Environments for Multitask Learning in Continuous Domains.

[BibT_eX]

[DOI]

CoRR, 2017

Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control.

[BibT_eX]

[DOI]

CoRR, 2017

An Analysis of Parallelized Motion Masking Using Dual-Mode Single Gaussian Models.

[BibT_eX]

[DOI]

Peter Henderson

Matthew Vertescher

CoRR, 2017

Underwater multi-robot convoying using visual tracking by detection.

[BibT_eX]

[DOI]

Juan Camilo Gamboa Higuera

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

2016

Chaotic Memory Randomization for Securing Embedded Systems.

[BibT_eX]

[DOI]

Peter Henderson

Muthucumaru Maheswaran

CoRR, 2016

2015

A Survey of Available Corpora for Building Data-Driven Dialogue Systems.

[BibT_eX]

[DOI]

CoRR, 2015

Peter Henderson

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...