Joelle Pineau

According to our database1, Joelle Pineau authored at least 167 papers between 2000 and 2018.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Streaming kernel regression with provably adaptive mean, variance, and regularization.
Journal of Machine Learning Research, 2018

A Decision-Theoretic Approach for the Collaborative Control of a Smart Wheelchair.
I. J. Social Robotics, 2018

A Survey of Available Corpora For Building Data-Driven Dialogue Systems: The Journal Version.
D&D, 2018

TarMAC: Targeted Multi-Agent Communication.
CoRR, 2018

Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods.
CoRR, 2018

Extending Neural Generative Conversational Model using External Knowledge Sources.
CoRR, 2018

Sequential Coordination of Deep Models for Learning Visual Arithmetic.
CoRR, 2018

Combined Reinforcement Learning via Abstract Representations.
CoRR, 2018

Online Adaptative Curriculum Learning for GANs.
CoRR, 2018

The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach.
CoRR, 2018

A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning.
CoRR, 2018

Focused Hierarchical RNNs for Conditional Sequence Processing.
CoRR, 2018

Randomized Value Functions via Multiplicative Normalizing Flows.
CoRR, 2018

Reward Estimation for Variance Reduction in Deep Reinforcement Learning.
CoRR, 2018

Decoupling Dynamics and Reward for Transfer Learning.
CoRR, 2018

Disentangling the independently controllable factors of variation by interacting with the world.
CoRR, 2018

A Deep Reinforcement Learning Chatbot (Short Version).
CoRR, 2018

An Inference-Based Policy Gradient Method for Learning Options.
Proceedings of the 35th International Conference on Machine Learning, 2018

Focused Hierarchical RNNs for Conditional Sequence Processing.
Proceedings of the 35th International Conference on Machine Learning, 2018

Extending Neural Generative Conversational Model using External Knowledge Sources.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Reward Estimation for Variance Reduction in Deep Reinforcement Learning.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Deep Reinforcement Learning That Matters.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

OptionGAN: Learning Joint Reward-Policy Options Using Generative Adversarial Inverse Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Modeling Glucagon Action in Patients With Type 1 Diabetes.
IEEE J. Biomedical and Health Informatics, 2017

Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus.
D&D, 2017

Tensor Regression Networks with various Low-Rank Tensor Approximations.
CoRR, 2017

Ethical Challenges in Data-Driven Dialogue Systems.
CoRR, 2017

ACtuAL: Actor-Critic Under Adversarial Learning.
CoRR, 2017

OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning.
CoRR, 2017

Deep Reinforcement Learning that Matters.
CoRR, 2017

A Deep Reinforcement Learning Chatbot.
CoRR, 2017

Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses.
CoRR, 2017

Independently Controllable Factors.
CoRR, 2017

Streaming kernel regression with provably adaptive mean, variance, and regularization.
CoRR, 2017

Independently Controllable Features.
CoRR, 2017

MACA: A Modular Architecture for Conversational Agents.
Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue, 2017

Predicting Success in Goal-Driven Human-Human Dialogues.
Proceedings of the 18th Annual SIGdial Meeting on Discourse and Dialogue, 2017

Multitask Spectral Learning of Weighted Automata.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Piecewise Latent Variables for Neural Variational Text Processing.
Proceedings of the 2nd Workshop on Structured Prediction for Natural Language Processing, 2017

Piecewise Latent Variables for Neural Variational Text Processing.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Sparse Probabilistic Model of User Preference Data.
Proceedings of the Advances in Artificial Intelligence, 2017

Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Online Bagging and Boosting for Imbalanced Data Streams.
IEEE Trans. Knowl. Data Eng., 2016

Practical Kernel-Based Reinforcement Learning.
Journal of Machine Learning Research, 2016

Socially Adaptive Path Planning in Human Environments Using Inverse Reinforcement Learning.
I. J. Social Robotics, 2016

Learning Robust Features using Deep Learning for Automatic Seizure Detection.
CoRR, 2016

A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues.
CoRR, 2016

Multi-modal Variational Encoder-Decoders.
CoRR, 2016

Generative Deep Neural Networks for Dialogue: A Short Review.
CoRR, 2016

On the Evaluation of Dialogue Systems with Next Utterance Classification.
CoRR, 2016

How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation.
CoRR, 2016

Bayesian Reinforcement Learning: A Survey.
CoRR, 2016

An Actor-Critic Algorithm for Sequence Prediction.
CoRR, 2016

On the Evaluation of Dialogue Systems with Next Utterance Classification.
Proceedings of the SIGDIAL 2016 Conference, 2016

Learning Robust Features using Deep Learning for Automatic Seizure Detection.
Proceedings of the 1st Machine Learning in Health Care, 2016

Generalized Dictionary for Multitask Learning with Boosting.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning time series models for pedestrian motion prediction.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

On the Use of Modular Software and Hardware for Designing Wheelchair Robots.
Proceedings of the 2016 AAAI Spring Symposia, 2016

Multitask Generalized Eigenvalue Program.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Incremental Stochastic Factorization for Online Reinforcement Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Bayesian Reinforcement Learning: A Survey.
Foundations and Trends in Machine Learning, 2015

Hierarchical Neural Network Generative Models for Movie Dialogues.
CoRR, 2015

A Survey of Available Corpora for Building Data-Driven Dialogue Systems.
CoRR, 2015

The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems.
CoRR, 2015

Conditional Computation in Neural Networks for faster models.
CoRR, 2015

The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems.
Proceedings of the SIGDIAL 2015 Conference, 2015

Automatically characterizing driving activities onboard smart wheelchairs from accelerometer data.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

An Expectation-Maximization Algorithm to Compute a Stochastic Factorization From Data.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Person tracking and following with 2D laser scanners.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Analyzing Open Data from the City of Montreal.
Proceedings of the 2nd International Workshop on Mining Urban Data co-located with 32nd International Conference on Machine Learning (ICML 2015), 2015

Improving the Design and Discovery of Dynamic Treatment Strategies Using Recent Results in Sequential Decision-Making.
Proceedings of the Twenty-Fifth International Conference on Automated Planning and Scheduling, 2015

Missteps in Robot Social Navigation.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Adaptive Treatment Allocation Using Sub-Sampled Gaussian Processes.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Online Boosting Algorithms for Anytime Transfer and Multitask Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Information Gathering and Reward Exploitation of Subgoals for POMDPs.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Efficient learning and planning with compressed predictive states.
Journal of Machine Learning Research, 2014

Policy Iteration Based on Stochastic Factorization.
J. Artif. Intell. Res., 2014

Online Planning Algorithms for POMDPs.
CoRR, 2014

Non-Deterministic Policies in Markovian Decision Processes.
CoRR, 2014

Practical Kernel-Based Reinforcement Learning.
CoRR, 2014

Lifelong Learning of Discriminative Representations.
CoRR, 2014

Methods of Moments for Learning Stochastic Languages: Unified Presentation and Empirical Comparison.
Proceedings of the 31th International Conference on Machine Learning, 2014

Estimating People's Subjective Experiences of Robot Behavior.
Proceedings of the 2014 AAAI Fall Symposia, Arlington, Virginia, USA, November 13-15, 2014, 2014

2013
Time Series Analysis Using Geometric Template Matching.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Online Ensemble Learning for Imbalanced Data Streams.
CoRR, 2013

Efficient Learning and Planning with Compressed Predictive States.
CoRR, 2013

End-to-End Text Recognition with Hybrid HMM Maxout Models.
CoRR, 2013

A survey of point-based POMDP solvers.
Autonomous Agents and Multi-Agent Systems, 2013

Maximum Mean Discrepancy Imitation Learning.
Proceedings of the Robotics: Science and Systems IX, Technische Universität Berlin, Berlin, Germany, June 24, 2013

Learning from Limited Demonstrations.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Bellman Error Based Feature Generation using Random Projections on Sparse Spaces.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Modelling Sparse Dynamical Systems with Compressed Predictive State Representations.
Proceedings of the 30th International Conference on Machine Learning, 2013

Designing Intelligent Wheelchairs: Reintegrating AI.
Proceedings of the Designing Intelligent Robots: Reintegrating AI II, 2013

Mixed Observability Predictive State Representations.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Building Adaptive Dialogue Systems Via Bayes-Adaptive POMDPs.
J. Sel. Topics Signal Processing, 2012

Policy-contingent abstraction for robust robot control
CoRR, 2012

Bellman Error Based Feature Generation using Random Projections on Sparse Spaces
CoRR, 2012

Proceedings of the 29th International Conference on Machine Learning (ICML-12)
CoRR, 2012

Model-Based Bayesian Reinforcement Learning in Large Structured Domains
CoRR, 2012

PAC-Bayesian Policy Evaluation for Reinforcement Learning
CoRR, 2012

Active Learning for Developing Personalized Treatment
CoRR, 2012

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs.
Artif. Intell., 2012

On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

An Empirical Analysis of Off-policy Learning in Discrete MDPs.
Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

Design and Evaluation of a Flexible Interface for Spatial Navigation.
Proceedings of the Ninth Conference on Computer and Robot Vision, 2012

Compressed Least-Squares Regression on Sparse Spaces.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
A bistable computational model of recurring epileptiform activity as observed in rodent slice preparations.
Neural Networks, 2011

Informing sequential clinical decision-making through reinforcement learning: an empirical study.
Machine Learning, 2011

A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes.
Journal of Machine Learning Research, 2011

Non-Deterministic Policies in Markovian Decision Processes.
J. Artif. Intell. Res., 2011

Anytime Point-Based Approximations for Large POMDPs
CoRR, 2011

PAC-Bayesian Policy Evaluation for Reinforcement Learning.
Proceedings of the UAI 2011, 2011

Active Learning for Developing Personalized Treatment.
Proceedings of the UAI 2011, 2011

The Duality of State and Observation in Probabilistic Transition Systems.
Proceedings of the Logic, Language, and Computation, 2011

Reinforcement Learning using Kernel-Based Stochastic Factorization.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Bayesian reinforcement learning for POMDP-based dialogue systems.
Proceedings of the IEEE International Conference on Acoustics, 2011

A Framework for Computing Bounds for the Return of a Policy.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Goal-Directed Online Learning of Predictive Models.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Mobility profile and wheelchair driving skills of powered wheelchair users: Sensor-based event recognition using a support vector machine classifier.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011

Active learning for personalizing treatment.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

Automatic Seizure Detection in an In-Vivo Model of Epilepsy.
Proceedings of the Computational Physiology, 2011

2010
Towards a standardized test for intelligent wheelchairs.
Proceedings of the 10th Performance Metrics for Intelligent Systems Workshop, 2010

PAC-Bayesian Model Selection for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Variable resolution decomposition for robotic navigation under a POMDP framework.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Multi-tasking SLAM.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Automatically suggesting topics for augmenting text documents.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Treating Epilepsy by Reinforcement Learning Via Manifold-Based Simulation.
Proceedings of the Manifold Learning and Its Applications, 2010

2009
Development and Validation of a Robust Speech Interface for Improved Human-Robot Interaction.
I. J. Social Robotics, 2009

Treating Epilepsy via Adaptive Neurostimulation: a Reinforcement Learning Approach.
Int. J. Neural Syst., 2009

AAAI 2008 Workshop Reports.
AI Magazine, 2009

Manifold Embeddings for Model-Based Reinforcement Learning under Partial Observability.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

A bayesian reinforcement learning approach for customizing human-robot interfaces.
Proceedings of the 14th International Conference on Intelligent User Interfaces, 2009

Wikispeedia: An Online Game for Inferring Semantic Distances between Concepts.
Proceedings of the IJCAI 2009, 2009

Completing wikipedia's hyperlink structure through dimensionality reduction.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Online Planning Algorithms for POMDPs.
J. Artif. Intell. Res., 2008

Model-Based Bayesian Reinforcement Learning in Large Structured Domains.
Proceedings of the UAI 2008, 2008

MDPs with Non-Deterministic Policies.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Bayes-Adaptive POMDPs: A New Perspective on the Explore-Exploit Tradeoff in Partially Observable Domains.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs.
Proceedings of the Machine Learning, 2008

Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

A Variance Analysis for POMDP Policy Evaluation.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Apprentissage actif dans les processus décisionnels de Markov partiellement observables L'algorithme MEDUSA.
Revue d'Intelligence Artificielle, 2007

Theoretical Analysis of Heuristic Search Methods for Online POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Bayes-Adaptive POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

A formal framework for robot learning and control under model uncertainty.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Recurrent Boosting for Classification of Natural and Synthetic Time-Series Data.
Proceedings of the Advances in Artificial Intelligence, 2007

SmartWheeler: A Robotic Wheelchair Test-Bed for Investigating New Models of Human-Robot Interaction.
Proceedings of the Multidisciplinary Collaboration for Socially Assistive Robotics, 2007

2006
Planning under uncertainty in robotics.
Robotics and Autonomous Systems, 2006

Anytime Point-Based Approximations for Large POMDPs.
J. Artif. Intell. Res., 2006

PAC-Learning of Markov Models with Hidden State.
Proceedings of the Machine Learning: ECML 2006, 2006

RRT-Plan: A Randomized Algorithm for STRIPS Planning.
Proceedings of the Sixteenth International Conference on Automated Planning and Scheduling, 2006

Representing Systems with Hidden State.
Proceedings of the Proceedings, 2006

2005
POMDP Planning for Robust Robot Control.
Proceedings of the Robotics Research: Results of the 12th International Symposium, 2005

Active Learning in Partially Observable Markov Decision Processes.
Proceedings of the Machine Learning: ECML 2005, 2005

2003
Towards robotic assistants in nursing homes: Challenges and results.
Robotics and Autonomous Systems, 2003

Policy-contingent abstraction for robust robot control.
Proceedings of the UAI '03, 2003

Applying Metric-Trees to Belief-Point POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Point-based value iteration: An anytime algorithm for POMDPs.
Proceedings of the IJCAI-03, 2003

2002
Robotic Assistance During Ambulation by Older Adults.
Proceedings of the AMIA 2002, 2002

Experiences with a Mobile Robotic Guide for the Elderly.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2000
Fast reinforcement learning of dialog strategies.
Proceedings of the IEEE International Conference on Acoustics, 2000

Spoken Dialogue Management Using Probabilistic Reasoning.
Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 2000


  Loading...