Shie Mannor
Orcid: 0000000344397647Affiliations:
 Technion  Israel Institute of Technology, Department of Electrical Engineering, Haifa, Israel (PhD 2002)
 Nvidia Research, Tel AvivYafo, Israel
According to our database^{1},
Shie Mannor
authored at least 428 papers
between 2000 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:

on zbmath.org

on linkedin.com

on twitter.com

on orcid.org

on dnb.info
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes.
CoRR, 2024
CoRR, 2024
ExplorationDriven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the ThirtyEighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization.
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023
Proceedings of the International Conference on Machine Learning, 2023
Learning Hidden Markov Models When the Locations of Missing Observations are Unknown.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023
Never Worse, Mostly Better: Stable Policy Improvement in Deep Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
Proceedings of the ThirtySeventh AAAI Conference on Artificial Intelligence, 2023
2022
SIGMETRICS Perform. Evaluation Rev., 2022
CoRR, 2022
CoRR, 2022
CoRR, 2022
Efficient Policy Iteration for Robust Markov Decision Processes via Regularization.
CoRR, 2022
Whats Missing? Learning Hidden Markov Models When the Locations of Missing Observations are Unknown.
CoRR, 2022
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Reinforcement Learning for Extended Intelligence.
Proceedings of the 19th International Conference on Informatics in Control, 2022
Proceedings of the Conference on Robot Learning, 2022
Locality Matters: A Scalable Value Decomposition Approach for Cooperative MultiAgent Reinforcement Learning.
Proceedings of the ThirtySixth AAAI Conference on Artificial Intelligence, 2022
Proceedings of the ThirtySixth AAAI Conference on Artificial Intelligence, 2022
2021
Mach. Learn., 2021
CoRR, 2021
Better than the Best: Gradientbased Improper Reinforcement Learning for Network Scheduling.
CoRR, 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
Proceedings of the ThirtySeventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the ThirtySeventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the ThirtySeventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 1216, 2021., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Improve Agents without Retraining: Parallel Tree Search with OffPolicy Correction.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
On the Volatility of Optimal Control Policies of a Class of Linear Quadratic Regulators.
Proceedings of the 2021 American Control Conference, 2021
Proceedings of the ThirtyFifth AAAI Conference on Artificial Intelligence, 2021
Proceedings of the ThirtyFifth AAAI Conference on Artificial Intelligence, 2021
2020
The Architectural Implications of Distributed Reinforcement Learning on CPUGPU Systems.
CoRR, 2020
CoRR, 2020
How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks.
CoRR, 2020
The Pendulum Arrangement: Maximizing the Escape Time of Heterogeneous Random Walks.
CoRR, 2020
CoRR, 2020
CoRR, 2020
CoRR, 2020
CoRR, 2020
CoRR, 2020
CoRR, 2020
CoRR, 2020
CoRR, 2020
Scalable Detection of Offensive and Noncompliant Content / Logo in Product Images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the Conference on Learning Theory, 2020
Proceedings of the Algorithmic Learning Theory, 2020
Proceedings of the ThirtyFourth AAAI Conference on Artificial Intelligence, 2020
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs.
Proceedings of the ThirtyFourth AAAI Conference on Artificial Intelligence, 2020
2019
IEEE/ACM Trans. Netw., 2019
CoRR, 2019
CoRR, 2019
CoRR, 2019
CoRR, 2019
CoRR, 2019
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces.
CoRR, 2019
Image Matters: Detecting Offensive and NonCompliant Content / Logo in Product Images.
CoRR, 2019
CoRR, 2019
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching.
CoRR, 2019
CoRR, 2019
Proceedings of the ThirtyFifth Conference on Uncertainty in Artificial Intelligence, 2019
Distributional Policy Optimization: An Alternative Approach for Continuous Control.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Value Propagation for Decentralized Networked Deep Multiagent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
BatchSize Independent Regret Bounds for the Combinatorial MultiArmed Bandit Problem.
Proceedings of the Conference on Learning Theory, 2019
OnLine Learning of Linear Dynamical Systems: Exponential Forgetting in Kalman Filters.
Proceedings of the ThirtyThird AAAI Conference on Artificial Intelligence, 2019
Proceedings of the ThirtyThird AAAI Conference on Artificial Intelligence, 2019
2018
IEEE Trans. Netw. Sci. Eng., 2018
IEEE Trans. Inf. Theory, 2018
CoRR, 2018
CoRR, 2018
CoRR, 2018
CoRR, 2018
CoRR, 2018
CoRR, 2018
CoRR, 2018
CoRR, 2018
Proceedings of the ThirtyFourth Conference on Uncertainty in Artificial Intelligence, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Finite Sample Analysis of TwoTimescale Stochastic Approximation with Applications to Reinforcement Learning.
Proceedings of the Conference On Learning Theory, 2018
Proceedings of the Conference On Learning Theory, 2018
Is a Picture Worth a Thousand Words? A Deep MultiModal Architecture for Product Classification in ECommerce.
Proceedings of the ThirtySecond AAAI Conference on Artificial Intelligence, 2018
Proceedings of the ThirtySecond AAAI Conference on Artificial Intelligence, 2018
Proceedings of the ThirtySecond AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017
IEEE Trans. Autom. Control., 2017
SIAM J. Comput., 2017
Learn on Source, Refine on Target: A Model Transfer Learning Framework with Random Forests.
IEEE Trans. Pattern Anal. Mach. Intell., 2017
IEEE J. Sel. Areas Commun., 2017
CoRR, 2017
CoRR, 2017
CoRR, 2017
CoRR, 2017
CoRR, 2017
Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.
CoRR, 2017
CoRR, 2017
Proceedings of the 26th International Conference on World Wide Web Companion, 2017
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the IEEE Power & Energy Society Innovative Smart Grid Technologies Conference, 2017
Proceedings of the TwentySixth International Joint Conference on Artificial Intelligence, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 30th Conference on Learning Theory, 2017
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017
Proceedings of the ThirtyFirst AAAI Conference on Artificial Intelligence, 2017
2016
Math. Oper. Res., 2016
Math. Oper. Res., 2016
J. Mach. Learn. Res., 2016
J. Mach. Learn. Res., 2016
CoRR, 2016
Is a picture worth a thousand words? A Deep MultiModal Fusion Architecture for Product Classification in ecommerce.
CoRR, 2016
CoRR, 2016
CoRR, 2016
CoRR, 2016
CoRR, 2016
CoRR, 2016
CoRR, 2016
A Reinforcement Learning System to Encourage Physical Activity in Diabetes Patients.
CoRR, 2016
CoRR, 2016
CoRR, 2016
CoRR, 2016
CoRR, 2016
Distributed scenariobased optimization for asset management in a hierarchical decision making environment.
Proceedings of the Power Systems Computation Conference, 2016
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
IEEE Trans. Inf. Theory, 2015
IEEE Trans. Pattern Anal. Mach. Intell., 2015
J. Artif. Intell. Res., 2015
Oper. Res., 2015
Found. Trends Mach. Learn., 2015
CoRR, 2015
CoRR, 2015
CoRR, 2015
CoRR, 2015
CoRR, 2015
CoRR, 2015
CoRR, 2015
CoRR, 2015
Learning to coordinate without communication in multiuser multiarmed bandit problems.
CoRR, 2015
Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015
Proceedings of the 2015 IEEE Conference on Computer Communications, 2015
Proceedings of the 2015 IEEE Conference on Computer Communications, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of The 28th Conference on Learning Theory, 2015
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015
Proceedings of the TwentyNinth AAAI Conference on Artificial Intelligence, 2015
Proceedings of the Learning for General Competency in Video Games, 2015
2014
HighThroughput EnergyEfficient LDPC Decoders Using Differential Binary Message Passing.
IEEE Trans. Signal Process., 2014
Math. Oper. Res., 2014
J. Mach. Learn. Res., 2014
CoRR, 2014
CoRR, 2014
CoRR, 2014
CoRR, 2014
Proceedings of the ACM Conference on Economics and Computation, 2014
Heterogeneous Stream Processing and Crowdsourcing for Traffic Monitoring: Highlights.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Scaling Up Approximate Value Iteration with Options: Better Policies with Fewer Iterations.
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Combining a GaussMarkov model and Gaussian process for traffic prediction in Dublin city center.
Proceedings of the Workshops of the EDBT/ICDT 2014 Joint Conference (EDBT/ICDT 2014), 2014
Proceedings of the 17th International Conference on Extending Database Technology, 2014
Approachability in unknown games: Online learning meets multiobjective optimization.
Proceedings of The 27th Conference on Learning Theory, 2014
Proceedings of the IEEE 25th International Conference on ApplicationSpecific Systems, 2014
2013
IEEE Trans. Inf. Theory, 2013
IEEE Trans. Commun., 2013
IEEE Trans. Commun., 2013
Soc. Netw. Anal. Min., 2013
Pervasive Mob. Comput., 2013
IEEE Trans. Pattern Anal. Mach. Intell., 2013
A State Action Frequency Approach to Throughput Maximization over Uncertain Wireless Channels.
Internet Math., 2013
Games Econ. Behav., 2013
Eur. J. Oper. Res., 2013
CoRR, 2013
Online Learning for Loss Functions with Memory and Applications to Statistical Arbitrage
CoRR, 2013
CoRR, 2013
Policy Evaluation with Variance Related Risk Criteria in Markov Decision Processes
CoRR, 2013
CoRR, 2013
CoRR, 2013
CoRR, 2013
CoRR, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 58, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 58, 2013
Proceedings of the Fourteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2013
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the COLT 2013, 2013
Proceedings of the COLT 2013, 2013
Proceedings of the COLT 2013, 2013
2012
IEEE Trans. Commun., 2012
IEEE Trans. Pattern Anal. Mach. Intell., 2012
Math. Oper. Res., 2012
Math. Oper. Res., 2012
Mach. Learn., 2012
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012
Proceedings of the COLT 2012, 2012
Proceedings of the 4th Asian Conference on Machine Learning, 2012
Oper. Res., 2012
More Is Better: Large Scale Partiallysupervised Sentiment Classification  Appendix
CoRR, 2012
CoRR, 2012
CoRR, 2012
Ann. Oper. Res., 2012
Proceedings of the 2012 IEEE Workshop on Signal Processing Systems, 2012
Proceedings of the ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the 51th IEEE Conference on Decision and Control, 2012
Proceedings of the 51th IEEE Conference on Decision and Control, 2012
Proceedings of the 50th Annual Allerton Conference on Communication, 2012
Proceedings of the Reinforcement Learning, 2012
2011
J. Signal Process. Syst., 2011
IEEE Trans. Signal Process., 2011
IEEE Trans. Parallel Distributed Syst., 2011
IEEE Trans. Comput. Intell. AI Games, 2011
J. Mach. Learn. Res., 2011
Proceedings of the COLT 2011, 2011
Proceedings of the COLT 2011, 2011
CoRR, 2011
CoRR, 2011
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 1214 December 2011, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 1214 December 2011, 2011
Proceedings of the IJCAI 2011, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 28th International Conference on Machine Learning, 2011
Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011
Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011
Proceedings of the Computational Physiology, 2011
2010
Proceedings of the Encyclopedia of Machine Learning, 2010
IEEE Trans. Signal Process., 2010
IEEE Trans. Signal Process., 2010
IEEE Trans. Inf. Theory, 2010
IEEE Trans. Circuits Syst. II Express Briefs, 2010
Math. Oper. Res., 2010
Oper. Res., 2010
IEEE Commun. Lett., 2010
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 69 December 2010, 2010
Proceedings of the First Workshop on Social Media Analytics, 2010
Proceedings of the 2010 International Conference on Distributed Computing Systems, 2010
A novel similarity measure for time series data with applications to gait and activity recognition.
Proceedings of the UbiComp 2010: Ubiquitous Computing, 12th International Conference, 2010
Proceedings of the Global Communications Conference, 2010
Proceedings of the COLT 2010, 2010
Proceedings of the COLT 2010, 2010
Proceedings of the 49th IEEE Conference on Decision and Control, 2010
Proceedings of the 49th IEEE Conference on Decision and Control, 2010
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
Proceedings of the TwentyFourth AAAI Conference on Artificial Intelligence, 2010
2009
IEEE Trans. Autom. Control., 2009
IEEE Trans. Autom. Control., 2009
Math. Oper. Res., 2009
J. Mach. Learn. Res., 2009
J. Mach. Learn. Res., 2009
Approachability in repeated games: Computational aspects and a Stackelberg variant.
Games Econ. Behav., 2009
Proceedings of the IEEE Workshop on Signal Processing Systems, 2009
Proceedings of the 2009 IEEE Information Theory Workshop, 2009
Proceedings of the 26th Annual International Conference on Machine Learning, 2009
Proceedings of IEEE International Conference on Communications, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009
Online learning in Markov decision processes with arbitrarily changing rewards and transitions.
Proceedings of the 1st International Conference on Game Theory for Networks, 2009
Proceedings of the 1st International Conference on Game Theory for Networks, 2009
Proceedings of the COLT 2009, 2009
Proceedings of the 48th IEEE Conference on Decision and Control, 2009
Proceedings of the 48th IEEE Conference on Decision and Control, 2009
Proceedings of the 48th IEEE Conference on Decision and Control, 2009
Regularized Fitted QIteration for planning in continuousspace Markovian decision problems.
Proceedings of the American Control Conference, 2009
2008
IEEE Trans. Signal Process., 2008
Math. Oper. Res., 2008
Games Econ. Behav., 2008
CoRR, 2008
Proceedings of the Internet and Network Economics, 4th International Workshop, 2008
Proceedings of the 3rd International ICST Conference on Performance Evaluation Methodologies and Tools, 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008
Proceedings of the Machine Learning, 2008
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case.
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008
Proceedings of the 21st Annual Conference on Learning Theory, 2008
Proceedings of the 46th Annual Allerton Conference on Communication, 2008
Proceedings of the 46th Annual Allerton Conference on Communication, 2008
Proceedings of the TwentyThird AAAI Conference on Artificial Intelligence, 2008
2007
IEEE Trans. Inf. Theory, 2007
Online calibrated forecasts: Memory efficiency versus universality for learning in games.
Mach. Learn., 2007
Manag. Sci., 2007
IEEE J. Sel. Areas Commun., 2007
Artif. Intell., 2007
An AreaEfficient FPGABased Architecture for FullyParallel Stochastic LDPC Decoding.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2007
Proceedings of the NETWORKING 2007. Ad Hoc and Sensor Networks, 2007
Proceedings of the 37th International Symposium on MultipleValued Logic, 2007
Percentile optimization in uncertain Markov decision processes with application to efficient exploration.
Proceedings of the Machine Learning, 2007
Proceedings of the Global Communications Conference, 2007
Proceedings of the 46th IEEE Conference on Decision and Control, 2007
Proceedings of the TwentySecond AAAI Conference on Artificial Intelligence, 2007
Proceedings of the TwentySecond AAAI Conference on Artificial Intelligence, 2007
2006
Design of ℓ<sub>1</sub>optimal controllers with flexible disturbance rejection level.
IEEE Trans. Autom. Control., 2006
Action Elimination and Stopping Conditions for the MultiArmed Bandit and Reinforcement Learning Problems.
J. Mach. Learn. Res., 2006
IEEE Commun. Lett., 2006
Games Econ. Behav., 2006
Proceedings of the Advances in Neural Information Processing Systems 19, 2006
Proceedings of the INFOCOM 2006. 25th IEEE International Conference on Computer Communications, 2006
Automatic basis function construction for approximate dynamic programming and reinforcement learning.
Proceedings of the Machine Learning, 2006
Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006
Proceedings of the Learning Theory, 19th Annual Conference on Learning Theory, 2006
Proceedings of the American Control Conference, 2006
2005
Efficiency loss in a network resource allocation game: the case of elastic supply.
IEEE Trans. Autom. Control., 2005
On the Empirical StateAction Frequencies in Markov Decision Processes Under General Policies.
Math. Oper. Res., 2005
Ann. Oper. Res., 2005
Ann. Oper. Res., 2005
The Workshop Program at the Nineteenth National Conference on Artificial Intelligence.
AI Mag., 2005
Proceedings of the Machine Learning, 2005
Proceedings of the Machine Learning, 2005
2004
IEEE Trans. Signal Process., 2004
J. Mach. Learn. Res., 2004
J. Mach. Learn. Res., 2004
Proceedings of the Machine Learning, 2004
Proceedings of the Machine Learning, 2004
Proceedings of the Learning Theory, 17th Annual Conference on Learning Theory, 2004
Proceedings of the 43rd IEEE Conference on Decision and Control, 2004
2003
The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes.
Math. Oper. Res., 2003
Greedy Algorithms for Classification  Consistency, Convergence Rates, and Adaptivity.
J. Mach. Learn. Res., 2003
Proceedings of the Machine Learning, 2003
Proceedings of the Machine Learning, 2003
Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning.
Proceedings of the Machine Learning, 2003
Lower Bounds on the Sample Complexity of Exploration in the Multiarmed Bandit Problem.
Proceedings of the Computational Learning Theory and Kernel Machines, 2003
Proceedings of the Computational Learning Theory and Kernel Machines, 2003
2002
Mach. Learn., 2002
Proceedings of the Machine Learning: ECML 2002, 2002
Proceedings of the Machine Learning: ECML 2002, 2002
Proceedings of the Computational Learning Theory, 2002
Proceedings of the Computational Learning Theory, 2002
2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Learning Embedded Maps of Markov Processes.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001
Adaptive Strategies and Regret Minimization in Arbitrarily Varying Markov Environments.
Proceedings of the Computational Learning Theory, 2001
Proceedings of the Computational Learning Theory, 2001
2000
Proceedings of the Advances in Neural Information Processing Systems 13, 2000