Matthieu Geist

Proceedings of the Natural Interaction with Robots, 2012

Approximate Modified Policy Iteration.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

A Dantzig Selector Approach to Temporal Difference Learning.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

Off-policy learning in large-scale POMDP-based dialogue systems.

[BibT_eX]

[DOI]

Lucie Daubigney

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Clustering behaviors of Spoken Dialogue Systems users.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Monte-Carlo Swarm Policy Search.

[BibT_eX]

[DOI]

Jérémy Fix

Proceedings of the Swarm and Evolutionary Computation, 2012

Behavior Specific User Simulation in Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the 10th ITG Conference on Speech Communication, 2012

2011

Sample-efficient batch reinforcement learning for dialogue management optimization.

[BibT_eX]

[DOI]

Hervé Frezza-Buet

ACM Trans. Speech Lang. Process., 2011

Managing Uncertainty within KTD.

[BibT_eX]

[DOI]

Proceedings of the Active Learning and Experimental Design workshop, 2011

Optimization of a tutoring system from a fixed set of data.

[BibT_eX]

[DOI]

Lucie Daubigney

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

Uncertainty Management for On-Line Optimisation of a POMDP-Based Large-Scale Spoken Dialogue System.

[BibT_eX]

[DOI]

Lucie Daubigney

Milica Gasic

Steve J. Young

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

User Simulation in Dialogue Systems Using Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Sample Efficient On-Line Learning of Optimal Dialogue Policies with Kalman Temporal Differences.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

A Non-parametric Approach to Approximate Dynamic Programming.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Machine Learning and Applications and Workshops, 2011

Performance evaluation for particle filters.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Information Fusion, 2011

Recursive Least-Squares Learning with Eligibility Traces.

[BibT_eX]

[DOI]

Bruno Scherrer

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Batch, Off-Policy and Model-Free Apprenticeship Learning.

[BibT_eX]

[DOI]

Edouard Klein

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

ℓ1-Penalized Projected Bellman Residual.

[BibT_eX]

[DOI]

Bruno Scherrer

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Dynamic neural field optimization using the unscented Kalman filter.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Symposium on Computational Intelligence, 2011

Parametric value function approximation: A unified view.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010

Différences temporelles de Kalman. Cas déterministe.

[BibT_eX]

[DOI]

Rev. d'Intelligence Artif., 2010

Kalman Temporal Differences.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2010

Sparse Approximate Dynamic Programming for Dialog Management.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2010 Conference, 2010

Revisiting Natural Actor-Critics with Value Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the Modeling Decisions for Artificial Intelligence, 2010

Optimizing spoken dialogue management with fitted value iteration.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Eligibility traces through colored noises.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Ultra Modern Telecommunications, 2010

Statistically linearized least-squares temporal differences.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Ultra Modern Telecommunications, 2010

2009

Tracking in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing, 16th International Conference, 2009

Kernelizing Vector Quantization Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 17th European Symposium on Artificial Neural Networks, 2009

Kalman Temporal Differences: The deterministic case.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

2008

Bayesian Reward Filtering.

[BibT_eX]

[DOI]