Dale Schuurmans

According to our database1, Dale Schuurmans authored at least 176 papers between 1989 and 2018.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Planning and Learning with Stochastic Action Sets.
CoRR, 2018

Variational Rejection Sampling.
CoRR, 2018

Smoothed Action Value Functions for Learning Gaussian Policies.
CoRR, 2018

Planning and Learning with Stochastic Action Sets.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Smoothed Action Value Functions for Learning Gaussian Policies.
Proceedings of the 35th International Conference on Machine Learning, 2018

Variational Rejection Sampling.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Generalized Conditional Gradient for Sparse Estimation.
Journal of Machine Learning Research, 2017

Safe Exploration for Identifying Linear Systems via Robust Optimization.
CoRR, 2017

Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.
CoRR, 2017

Bridging the Gap Between Value and Policy Based Reinforcement Learning.
CoRR, 2017

Holographic Feature Representations of Deep Networks.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Bridging the Gap Between Value and Policy Based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Multi-view Matrix Factorization for Linear Dynamical System Estimation.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Logistic Markov Decision Processes.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Formalizing Anthropomorphism Through Games: A Study in Deep Neural Networks.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Stochastic Neural Networks with Monotonic Activation Functions.
CoRR, 2016

Reward Augmented Maximum Likelihood for Neural Structured Prediction.
CoRR, 2016

Improving Policy Gradient by Exploring Under-appreciated Rewards.
CoRR, 2016

Deep Learning Games.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Reward Augmented Maximum Likelihood for Neural Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Stochastic Neural Networks with Monotonic Activation Functions.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Scalable and Sound Low-Rank Tensor Learning.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
On Time Domain Analysis of Photoplethysmogram Signals for Monitoring Heat Stress.
Sensors, 2015

Learning with a Strong Adversary.
CoRR, 2015

Frequency analysis of photoplethysmogram and its derivatives.
Computer Methods and Programs in Biomedicine, 2015

Scalable Metric Learning for Co-Embedding.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Generalization in Unsupervised Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Embedding Inference for Structured Multilabel Prediction.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Correcting Covariate Shift with the Frank-Wolfe Algorithm.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Semi-Supervised Zero-Shot Classification with Label Representation Learning.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Variance Reduction via Antithetic Markov Chains.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Optimal Estimation of Multivariate ARMA Models.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Generalized Conditional Gradient for Sparse Estimation.
CoRR, 2014

Adaptive Monte Carlo via Bandit Allocation.
CoRR, 2014

Convex Deep Learning via Normalized Kernels.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Adaptive Monte Carlo via Bandit Allocation.
Proceedings of the 31th International Conference on Machine Learning, 2014

Convex Co-embedding.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Reinforcement Ranking
CoRR, 2013

Learning Bayesian Nets that Perform Well
CoRR, 2013

Monte Carlo Inference via Greedy Importance Sampling
CoRR, 2013

Convex Relaxations of Bregman Divergence Clustering.
CoRR, 2013

Exploiting Syntactic, Semantic, and Lexical Regularities in Language Modeling via Directed Markov Random Fields.
Computational Intelligence, 2013

Convex Relaxations of Bregman Divergence Clustering.
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Protein-chemical Interaction Prediction via Kernelized Sparse Learning SVM.
Proceedings of the Biocomputing 2013: Proceedings of the Pacific Symposium, 2013

Multi-label Classification with Output Kernels.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Polar Operators for Structured Sparse Estimation.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Convex Two-Layer Modeling.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Characterizing the Representer Theorem.
Proceedings of the 30th International Conference on Machine Learning, 2013

Divergence based graph estimation for manifold learning.
Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

Learning a Metric Space for Neighbourhood Topology Estimation: Application to Manifold Learning.
Proceedings of the Asian Conference on Machine Learning, 2013

2012
The Latent Maximum Entropy Principle.
TKDD, 2012

Generalized Optimal Reverse Prediction.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

An efficient algorithm for maximal margin clustering.
J. Global Optimization, 2012

An experimental methodology for response surface optimization methods.
J. Global Optimization, 2012

Linear Coherent Bi-Clustering via Beam Searching and Sample Set Clustering.
Discrete Math., Alg. and Appl., 2012

Boltzmann Machine Learning with the Latent Maximum Entropy Principle
CoRR, 2012

Monte Carlo Matrix Inversion Policy Evaluation
CoRR, 2012

Maximum Margin Bayesian Networks
CoRR, 2012

Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering
CoRR, 2012

Convex Structure Learning for Bayesian Networks: Polynomial Feature Selection and Approximate Ordering.
CoRR, 2012

Sparse Learning Based Linear Coherent Bi-clustering.
Proceedings of the Algorithms in Bioinformatics - 12th International Workshop, 2012

Semi-supervised Multi-label Classification - A Simultaneous Large-Margin, Subspace Learning Approach.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Accelerated Training for Matrix-norm Regularization: A Boosting Approach.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A Polynomial-time Form of Robust Regression.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Convex Multi-view Subspace Learning.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Regularizers versus Losses for Nonlinear Dimensionality Reduction: A Factored View with New Convex Relaxations.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Real-Time Discriminative Background Subtraction.
IEEE Trans. Image Processing, 2011

Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering.
Proceedings of the UAI 2011, 2011

Modular Community Detection in Networks.
Proceedings of the IJCAI 2011, 2011

MapReduce for Parallel Reinforcement Learning.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Adaptive Large Margin Training for Multilabel Classification.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Relaxed Clipping: A Global Training Method for Robust Regression and Classification.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Distributed Flow Algorithms for Scalable Similarity Visualization.
Proceedings of the ICDMW 2010, 2010

Improved Natural Language Learning via Variance-Regularization Support Vector Machines.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010

Linear Coherent Bi-cluster Discovery via Beam Detection and Sample Set Clustering.
Proceedings of the Combinatorial Optimization and Applications, 2010

Strictly Lexicalised Dependency Parsing.
Proceedings of the Trends in Parsing Technology, 2010

2009
Dual Temporal Difference Learning.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Learning Exercise Policies for American Options.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

A General Projection Property for Distribution Families.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Convex Relaxation of Mixture Regression with Efficient Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Optimal reverse prediction: a unified perspective on supervised, unsupervised and semi-supervised learning.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Discriminative Maximum Margin Image Object Categorization with Exact Inference.
Proceedings of the Fifth International Conference on Image and Graphics, 2009

Fast normalized cut with linear constraints.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Linear Coherent Bi-cluster Discovery via Line Detection and Sample Majority Voting.
Proceedings of the Combinatorial Optimization and Applications, 2009

Inference of the structural credit risk model using MLE.
Proceedings of the 2009 IEEE Symposium on Computational Intelligence for Financial Engineering, 2009

A Reformulation of Support Vector Machines for General Confidence Functions.
Proceedings of the Advances in Machine Learning, 2009

2008
Policy Iteration for Learning an Exercise Policy for American Options.
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Semi-Supervised Convex Training for Dependency Parsing.
Proceedings of the ACL 2008, 2008

2007
Learning Gene Regulatory Networks via Globally Regularized Risk Minimization.
Proceedings of the Comparative Genomics, RECOMB 2007 International Workshop, 2007

Stable Dual Dynamic Programming.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Convex Relaxations of Latent Variable Training.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Discriminative Batch Mode Active Learning.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Simple Training of Dependency Parsers via Structured Boosting.
Proceedings of the IJCAI 2007, 2007

Automatic Gait Optimization with Gaussian Process Regression.
Proceedings of the IJCAI 2007, 2007

2006
Graphical Models and Point Pattern Matching.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Constraint-based optimization and utility elicitation using the minimax decision criterion.
Artif. Intell., 2006

Convex Structure Learning for Bayesian Networks: Polynomial Feature Selection and Approximate Ordering.
Proceedings of the UAI '06, 2006

Web Communities Identification from Random Walks.
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006

Information Marginalization on Subgraphs.
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006

Learning to Model Spatial Dependency: Semi-Supervised Discriminative Random Fields.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

implicit Online Learning with Kernels.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Discriminative unsupervised learning of structured predictors.
Proceedings of the Machine Learning, 2006

Stochastic Analysis of Lexical and Semantic Enhanced Structural Language Model.
Proceedings of the Grammatical Inference: Algorithms and Applications, 2006

Improved Large Margin Dependency Parsing via Local Constraints and Laplacian Regularization.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

An Online Discriminative Approach to Background Subtraction.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Semi-Supervised Conditional Random Fields for Improved Sequence Segmentation and Labeling.
Proceedings of the ACL 2006, 2006

Robust Support Vector Machine Training via Convex Outlier Ablation.
Proceedings of the Proceedings, 2006

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning.
Proceedings of the Proceedings, 2006

2005
Combining Statistical Language Models via the Latent Maximum Entropy Principle.
Machine Learning, 2005

Maximum Margin Bayesian Networks.
Proceedings of the UAI '05, 2005

Strictly Lexical Dependency Parsing.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

Learning Coordination Classifiers.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Regret-based Utility Elicitation in Constraint-based Decision Problems.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Exploiting syntactic, semantic and lexical regularities in language modeling via directed Markov random fields.
Proceedings of the Machine Learning, 2005

Bayesian sparse sampling for on-line reward optimization.
Proceedings of the Machine Learning, 2005

Variational Bayesian image modelling.
Proceedings of the Machine Learning, 2005

Tangent-Corrected Embedding.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Unsupervised and Semi-Supervised Multi-Class Support Vector Machines.
Proceedings of the Proceedings, 2005

2004
Learning mixture models with the regularized latent maximum entropy principle.
IEEE Trans. Neural Networks, 2004

Dynamic Web log session identification with statistical language models.
JASIST, 2004

Augmenting Naive Bayes Classifiers with Statistical Language Models.
Inf. Retr., 2004

Maximum Margin Clustering.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Exploiting syntactic, semantic and lexical regularities in language modeling via directed Markov random fields.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Transformation-Invariant Embedding for Image Analysis.
Proceedings of the Computer Vision, 2004

2003
Automatic basis selection techniques for RBF networks.
Neural Networks, 2003

Applying Machine Learning to Text Segmentation for Information Retrieval.
Inf. Retr., 2003

Boltzmann Machine Learning with the Latent Maximum Entropy Principle.
Proceedings of the UAI '03, 2003

Monte Carlo Matrix Inversion Policy Evaluation.
Proceedings of the UAI '03, 2003

Language and Task Independent Text Categorization with Simple Language Models.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Text classification in Asian languages without word segmentation.
Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages, 2003

Learning Mixture Models with the Latent Maximum Entropy Principle.
Proceedings of the Machine Learning, 2003

Semantic n-gram language modeling with the latent maximum entropy principle.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Combining Naive Bayes and n-Gram Language Models for Text Classification.
Proceedings of the Advances in Information Retrieval, 2003

Language Independent Authorship Attribution with Character Level N-Grams.
Proceedings of the EACL 2003, 2003

Face Alignment Using Statistical Models and Wavelet Features.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Constraint-Based Optimization with the Minimax Decision Criterion.
Proceedings of the Principles and Practice of Constraint Programming, 2003

Learning Continuous Latent Variable Models with Bregman Divergences.
Proceedings of the Algorithmic Learning Theory, 14th International Conference, 2003

Latent Maximum Entropy Approach for Semantic N-gram Language Modeling.
Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003

Model-Based Least-Squares Policy Evaluation.
Proceedings of the Advances in Artificial Intelligence, 2003

Session Boundary Detection for Association Rule Learning Using n-Gram Language Models.
Proceedings of the Advances in Artificial Intelligence, 2003

2002
Metric-Based Methods for Adaptive Model Selection and Regularization.
Machine Learning, 2002

Guest Introduction: Special Issue on New Methods for Model Selection and Model Combination.
Machine Learning, 2002

Using self-supervised word segmentation in Chinese information retrieval.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Waterloo at NTCIR-3: Using Self-supervised Word Segmentation.
Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, 2002

Regularized Greedy Importance Sampling.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Investigating the Maximum Likelihood Alternative to TD(lambda).
Proceedings of the Machine Learning, 2002

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs.
Proceedings of the Machine Learning, 2002

Investigating the Relationship between Word Segmentation Performance and Retrieval Performance in Chinese IR.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Piecewise Linear Value Function Approximation for Factored MDPs.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

Greedy Linear Value-Approximation for Factored Markov Decision Processes.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

Data Perturbation for Escaping Local Maxima in Learning.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001
General Convergence Results for Linear Discriminant Updates.
Machine Learning, 2001

Local search characteristics of incomplete SAT procedures.
Artif. Intell., 2001

A Hierarchical EM Approach to Word Segmentation.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

A Simple Closed-Class/Open-Class Factorization for Improved Language Modeling.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

Direct value-approximation for factored MDPs.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

The Exponentiated Subgradient Algorithm for Heuristic Boolean Programming.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Self-Supervised Chinese Word Segmentation.
Proceedings of the Advances in Intelligent Data Analysis, 4th International Conference, 2001

2000
Monte Carlo inference via greedy importance sampling.
Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

An Adaptive Regularization Criterion for Supervised Learning.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Local Search Characteristics of Incomplete SAT Procedures.
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

1999
Greedy Importance Sampling.
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Efficient exploration for optimizing immediate reward.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998
Boosting in the Limit: Maximizing the Margin of Learned Ensembles.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

1997
Characterizing Rational Versus Exponential learning Curves.
J. Comput. Syst. Sci., 1997

Learning Bayesian Nets that Perform Well.
Proceedings of the UAI '97: Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, 1997

Characterizing the generalization performance of model selection strategies.
Proceedings of the Fourteenth International Conference on Machine Learning (ICML 1997), 1997

General Convergence Results for Linear Discriminant Updates.
Proceedings of the Tenth Annual Conference on Computational Learning Theory, 1997

A New Metric-Based Approach to Model Selection.
Proceedings of the Fourteenth National Conference on Artificial Intelligence and Ninth Innovative Applications of Artificial Intelligence Conference, 1997

1995
Practical PAC Learning.
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

Characterizing rational versus exponential learning curves.
Proceedings of the Computational Learning Theory, Second European Conference, 1995

Sequential PAC Learning.
Proceedings of the Eigth Annual Conference on Computational Learning Theory, 1995

1992
Learning Useful Horn Approximations.
Proceedings of the 3rd International Conference on Principles of Knowledge Representation and Reasoning (KR'92). Cambridge, 1992

Learning an Optimally Accurate Representation System.
Proceedings of the Foundation of Knowledge Representation and Reasoning [the book grew out of an ECAI-92 workshop], 1992

1989
Representational Difficulties with Classifier Systems.
Proceedings of the 3rd International Conference on Genetic Algorithms, 1989


  Loading...