Sridhar Mahadevan

Orcid: 0000-0001-6507-9109

Affiliations:
  • Adobe Research, USA
  • University of Massachusetts Amherst, USA


According to our database1, Sridhar Mahadevan authored at least 120 papers between 1985 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GAIA: Categorical Foundations of Generative AI.
CoRR, 2024

2023
Universal Causality.
Entropy, April, 2023

Zero-th Order Algorithm for Softmax Attention Optimization.
CoRR, 2023

Randomized and Deterministic Attention Sparsification Algorithms for Over-parameterized Feature Dimension.
CoRR, 2023

An Over-parameterized Exponential Regression.
CoRR, 2023

Privacy Aware Experiments without Cookies.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Smoothed Online Combinatorial Optimization Using Imperfect Predictions.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
A Layered Architecture for Universal Causality.
CoRR, 2022

Unifying Causal Inference and Reinforcement Learning using Higher-Order Category Theory.
CoRR, 2022

Categoroids: Universal Conditional Independence.
CoRR, 2022

On The Universality of Diagrams for Causal Inference and The Causal Reproducing Property.
CoRR, 2022

Generating and Controlling Diversity in Image Search.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

2021
Causal Homotopy.
CoRR, 2021

Universal Decision Models.
CoRR, 2021

Causal Inference in Network Economics.
CoRR, 2021

Asymptotic Causal Inference.
CoRR, 2021

Multiscale Manifold Warping.
CoRR, 2021

2020
Finite-Sample Analysis of GTD Algorithms.
CoRR, 2020

Optimizing for the Future in Non-Stationary MDPs.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Personalizing with Human Cognitive Biases.
Proceedings of the Adjunct Publication of the 27th Conference on User Modeling, 2019

2018
Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity.
J. Artif. Intell. Res., 2018

Global Convergence to the Equilibrium of GANs using Variational Inequalities.
CoRR, 2018

A Unified Framework for Domain Adaptation Using Metric Learning on Manifolds.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2018

Imagination Machines: A New Challenge for Artificial Intelligence.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Online Monotone Games.
CoRR, 2017

A Manifold Approach to Learning Mutually Orthogonal Subspaces.
CoRR, 2017

Generative Multi-Adversarial Networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Online Monotone Optimization.
CoRR, 2016

Deep Generative Models for Spectroscopic Analysis on Mars.
CoRR, 2016

Deep Reinforcement Learning With Macro-Actions.
CoRR, 2016

Proximal Gradient Temporal Difference Learning Algorithms.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Reasoning about Linguistic Regularities in Word Embeddings using Matrix Manifolds.
CoRR, 2015

Finite-Sample Analysis of Proximal Gradient TD Algorithms.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Efficient Hyper-parameter Optimization for NLP Applications.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Solving Large Sustainable Supply Chain Networks Using Variational Inequalities.
Proceedings of the Computational Sustainability, 2015

Aligning Mixed Manifolds.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Efficient and Scalable Metadata Management in EB-Scale File Systems.
IEEE Trans. Parallel Distributed Syst., 2014

Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces.
CoRR, 2014

Modeling Context in Cognition Using Variational Inequalities.
Proceedings of the 2014 AAAI Fall Symposia, Arlington, Virginia, USA, November 13-15, 2014, 2014

Manifold Spanning Graphs.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Projected Natural Actor-Critic.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

DROP: Facilitating distributed metadata management in EB-scale storage systems.
Proceedings of the IEEE 29th Symposium on Mass Storage Systems and Technologies, 2013

Manifold Alignment Preserving Global Geometry.
Proceedings of the IJCAI 2013, 2013

Basis Adaptation for Sparse Nonlinear Reinforcement Learning.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Multiscale Manifold Learning.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Sparse Q-learning with Mirror Descent.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Regularized Off-Policy TD-Learning.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Manifold Warping: Manifold Alignment over Time.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
A GPU-Based Approximate SVD Algorithm.
Proceedings of the Parallel Processing and Applied Mathematics, 2011

Jointly Learning Data-Dependent Label and Locality-Preserving Projections.
Proceedings of the IJCAI 2011, 2011

Heterogeneous Domain Adaptation Using Manifold Alignment.
Proceedings of the IJCAI 2011, 2011

2010
Basis Construction from Power Series Expansions of Value Functions.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Basis function construction for hierarchical reinforcement learning.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Representation Discovery in Sequential Decision Making.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Hybrid least-squares algorithms for approximate policy evaluation.
Mach. Learn., 2009

Learning Representation and Control in Markov Decision Processes: New Frontiers.
Found. Trends Mach. Learn., 2009

Multiscale Analysis of Document Corpora Based on Diffusion Models.
Proceedings of the IJCAI 2009, 2009

Manifold Alignment without Correspondence.
Proceedings of the IJCAI 2009, 2009

Transfer Learning and Representation Discovery in Intelligent Tutoring Systems.
Proceedings of the Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling, 2009

A General Framework for Manifold Alignment.
Proceedings of the Manifold Learning and Its Applications, 2009

2008
Representation Discovery using Harmonic Analysis
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01546-5, 2008

Manifold alignment using Procrustes analysis.
Proceedings of the Machine Learning, 2008

Fast Spectral Learning using Lanczos Eigenspace Projections.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes.
J. Mach. Learn. Res., 2007

Hierarchical Average Reward Reinforcement Learning.
J. Mach. Learn. Res., 2007

Learning state-action basis functions for hierarchical MDPs.
Proceedings of the Machine Learning, 2007

Adaptive mesh compression in 3D computer graphics using multiscale manifold learning.
Proceedings of the Machine Learning, 2007

Constructing basis functions from directed graphs for value function approximation.
Proceedings of the Machine Learning, 2007

Learning to Plan Using Harmonic Analysis of Diffusion Models.
Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007

Repairing Disengagement With Non-Invasive Interventions.
Proceedings of the Artificial Intelligence in Education, 2007

Representation Discovery in Planning using Harmonic Analysis.
Proceedings of the Computational Approaches to Representation Change during Learning and Development, 2007

Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Hierarchical multi-agent reinforcement learning.
Auton. Agents Multi Agent Syst., 2006

Estimating Student Proficiency Using an Item Response Theory Model.
Proceedings of the Intelligent Tutoring Systems, 8th International Conference, 2006

Improving Intelligent Tutoring Systems: Using Expectation Maximization to Learn Student Skill Levels.
Proceedings of the Intelligent Tutoring Systems, 8th International Conference, 2006

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes.
Proceedings of the Machine Learning, 2006

Learning Representation and Control in Continuous Markov Decision Processes.
Proceedings of the Proceedings, 2006

2005
Representation Policy Iteration.
Proceedings of the UAI '05, 2005

Switching kalman filters for prediction and tracking in an adaptive meteorological sensing network.
Proceedings of the Second Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2005

Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Coarticulation: an approach for generating concurrent plans in Markov decision processes.
Proceedings of the Machine Learning, 2005

Proto-value functions: developmental reinforcement learning.
Proceedings of the Machine Learning, 2005

Samuel Meets Amarel: Automating Value Function Approximation Using Global State Space Analysis.
Proceedings of the Proceedings, 2005

A Variational Learning Algorithm for the Abstract Hidden Markov Model.
Proceedings of the Proceedings, 2005

2004
Coarticulation in Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Learning hierarchical models of activity.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Learning to Communicate and Act Using Hierarchical Reinforcement Learning.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Probabilistic Plan Recognition in Multiagent Systems.
Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS 2004), 2004

2003
Recent Advances in Hierarchical Reinforcement Learning.
Discret. Event Dyn. Syst., 2003

Hierarchical Policy Gradient Algorithms.
Proceedings of the Machine Learning, 2003

2002
Spatiotemporal Abstraction of Stochastic Sequential Processes.
Proceedings of the Abstraction, 2002

Learning to Take Concurrent Actions.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Learning the hierarchical structure of spatial environments using multiresolution statistical models.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

Approximate Planning with Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation.
Proceedings of the 2002 IEEE International Conference on Robotics and Automation, 2002

Hierarchically Optimal Average Reward Reinforcement Learning.
Proceedings of the Machine Learning, 2002

A multiagent reinforcement learning algorithm by dynamically merging markov decision processes.
Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002

2001
Decision-Theoretic Planning with Concurrent Temporally Extended Actions.
Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

Learning Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation.
Proceedings of the 2001 IEEE International Conference on Robotics and Automation, 2001

Continuous-Time Hierarchical Reinforcement Learning.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

A reinforcement learning model of selective visual attention.
Proceedings of the Fifth International Conference on Autonomous Agents, 2001

2000
Hierarchical Memory-Based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Face Recognition Using Foveal Vision.
Proceedings of the Biologically Motivated Computer Vision, 2000

1999
Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes.
Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

1998
Rapid Concept Learning for Mobile Robots.
Auton. Robots, 1998

Optimizing Production Manufacturing Using Reinforcement Learning.
Proceedings of the Eleventh International Florida Artificial Intelligence Research Society Conference, 1998

1996
Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results.
Mach. Learn., 1996

The National Science Foundation Workshop on Reinforcement Learning.
AI Mag., 1996

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning.
Proceedings of the Machine Learning, 1996

An Average-Reward Reinforcement Learning Algorithm for Computing Bias-Optimal Policies.
Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996

1994
Quantifying Prior Determination Knowledge Using the PAC Learning Model.
Mach. Learn., 1994

To Discount or Not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning.
Proceedings of the Machine Learning, 1994

1993
An Apprentice-Based Approach to Knowledge Acquisition.
Artif. Intell., 1993

1992
Automatic Programming of Behavior-Based Robots Using Reinforcement Learning.
Artif. Intell., 1992

Enhancing Transfer in Reinforcement Learning by Building Stochastic Models of Robot Actions.
Proceedings of the Ninth International Workshop on Machine Learning (ML 1992), 1992

1991
Scaling Reinforcement Learning to Robotics by Exploiting the Subsumption Architecture.
Proceedings of the Eighth International Workshop (ML91), 1991

1989
Using Determinations in EBL: A Solution to the incomplete Theory Problem.
Proceedings of the Sixth International Workshop on Machine Learning (ML 1989), 1989

1988
On the Tractability of Learning from Incomplete Theories.
Proceedings of the Machine Learning, 1988

1985
LEAP: A Learning Apprentice for VLSI Design.
Proceedings of the 9th International Joint Conference on Artificial Intelligence. Los Angeles, 1985

Verification-based Learning: A Generalized Strategy for Inferring Problem-Reduction Methods.
Proceedings of the 9th International Joint Conference on Artificial Intelligence. Los Angeles, 1985


  Loading...