Pradeep Varakantham

Orcid: 0000-0001-7342-5745

According to our database1, Pradeep Varakantham authored at least 151 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SubIQ: Inverse Soft-Q Learning for Offline Imitation with Suboptimal Demonstrations.
CoRR, 2024

Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Unsupervised Training Sequence Design: Efficient and Generalizable Agent Training.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Reward Penalties on Augmented States for Solving Richly Constrained RL Effectively.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Training Reinforcement Learning Agents and Humans With Difficulty-Conditioned Generators.
CoRR, 2023

A Hierarchical Approach to Environment Design with Generative Trajectory Modeling.
CoRR, 2023

Conditioning Hierarchical Reinforcement Learning on Flexible Constraints.
CoRR, 2023

Regret-Based Optimization for Robust Reinforcement Learning.
CoRR, 2023

Diversity Induced Environment Design via Self-Play.
CoRR, 2023

Solving Constrained Reinforcement Learning through Augmented State and Reward Penalties.
CoRR, 2023

Effective Diversity in Unsupervised Environment Design.
CoRR, 2023

Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Transferable Curricula through Difficulty Conditioned Generators.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Generalization through Diversity: Improving Unsupervised Environment Design.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

On Sustainable Ride Pooling Through Conditional Expected Value Decomposition.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Knowledge Compilation for Constrained Combinatorial Action Spaces in Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Avoiding Starvation of Arms in Restless Multi-Armed Bandits.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Strategic Planning for Flexible Agent Availability in Large Taxi Fleets.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Future Aware Pricing and Matching for Sustainable On-Demand Ride Pooling.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Constrained Reinforcement Learning in Hard Exploration Problems.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Learning Individual Policies in Large Multi-agent Systems through Local Variance Minimization.
CoRR, 2022

Towards Soft Fairness in Restless Multi-Armed Bandits.
CoRR, 2022

Efficient resource allocation with fairness constraints in restless multi-armed bandits.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Hierarchical Value Decomposition for Effective On-demand Ride-Pooling.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

Preface.
Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling, 2022

Joint Pricing and Matching for City-Scale Ride-Pooling.
Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling, 2022

Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-profits in Improving Maternal and Child Health.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Facilitating Human-Wildlife Cohabitation through Conflict Prediction.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Zone pAth Construction (ZAC) based Approaches for Effective Real-Time Ridesharing.
J. Artif. Intell. Res., 2021

Conditional Expectation based Value Decomposition for Scalable On-Demand Ride Pooling.
CoRR, 2021

Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes.
CoRR, 2021

CLAIM: curriculum learning policy for influence maximization in unknown social networks.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Adaptive Operating Hours for Improved Performance of Taxi Fleets.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Learning Index Policies for Restless Bandits with Application to Maternal Healthcare.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
Value Variance Minimization for Learning Approximate Equilibrium in Aggregation Systems.
CoRR, 2020

On Solving Cooperative MARL Problems with a Few Good Experiences.
CoRR, 2020

Competitive Ratios for Online Multi-capacity Ridesharing.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Online Traffic Signal Control through Sample-Based Constrained Optimization.
Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling, 2020

Solving Online Threat Screening Games using Constrained Action Space Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neural Approximate Dynamic Programming for On-Demand Ride-Pooling.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Correlated Learning for Aggregation Systems.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

RE-ORG: An Online Repositioning Guidance Agent.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

A Homophily-Free Community Detection Framework for Trajectories with Delayed Responses.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Entropy Based Independent Learning in Anonymous Multi-Agent Settings.
Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019

ZAC: A Zone Path Construction Approach for Effective Real-Time Ridesharing.
Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019

Resource Constrained Deep Reinforcement Learning.
Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019

2018
Risk-Sensitive Stochastic Orienteering Problems for Trip Optimization in Urban Environments.
ACM Trans. Intell. Syst. Technol., 2018

TuSeRACT: Turn-Sample-Based Real-Time Traffic Signal Control.
CoRR, 2018

Entropy Controlled Non-Stationarity for Improving Performance of Independent Learners in Anonymous MARL Settings.
CoRR, 2018

Online spatio-temporal matching in stochastic and dynamic domains.
Artif. Intell., 2018

Decentralized Planning for Non-dedicated Agent Teams with Submodular Rewards in Uncertain Environments.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

A Driver Guidance System for Taxis in Singapore.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Bounded Rank Optimization for Effective and Efficient Emergency Response.
Proceedings of the Twenty-Eighth International Conference on Automated Planning and Scheduling, 2018

Reserved Optimisation: Handling Incident Priorities in Emergency Response Systems.
Proceedings of the Twenty-Eighth International Conference on Automated Planning and Scheduling, 2018

Upping the Game of Taxi Driving in the Age of Uber.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Dispatch Guided Allocation Optimization for Effective Emergency Response.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Dynamic Repositioning to Reduce Lost Demand in Bike Sharing Systems.
J. Artif. Intell. Res., 2017

Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs).
J. Artif. Intell. Res., 2017

Artificial Intelligence Research in Singapore: Assisting the Development of a Smart Nation.
AI Mag., 2017

Mechanism Design for Strategic Project Scheduling.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Proactive and Reactive Coordination of Non-dedicated Agent Teams Operating in Uncertain Environments.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Exploiting Anonymity and Homogeneity in Factored Dec-MDPs through Precomputed Binomial Distributions.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Improving Revenues.
Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, 2017

Online Repositioning in Bike Sharing Systems.
Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, 2017

Incentivizing the Use of Bike Trailers for Dynamic Repositioning in Bike Sharing Systems.
Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, 2017

Decentralized Planning in Stochastic Environments with Submodular Rewards.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Sequential Decision Making for Improving Efficiency in Urban Environments.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Robust Repositioning to Counter Unpredictable Demand in Bike Sharing Systems.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Scalable Greedy Algorithms for Task/Resource Constrained Multi-Agent Stochastic Planning.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

An Intelligent System for Personalized Conference Event Recommendation and Scheduling.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Detecting Communities Using Coordination Games: A Short Paper.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Robust Influence Maximization: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

PRESS: PeRsonalized Event Scheduling recommender System (Demonstration).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Strategic Planning for Setting Up Base Stations in Emergency Medical Systems.
Proceedings of the Twenty-Sixth International Conference on Automated Planning and Scheduling, 2016

Robust Partial Order Schedules for RCPSP/max with Durational Uncertainty.
Proceedings of the Twenty-Sixth International Conference on Automated Planning and Scheduling, 2016

A Proactive Sampling Approach to Project Scheduling under Uncertainty.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

NLU Framework for Voice Enabling Non-Native Applications on Smart Devices.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Robust Decision Making for Stochastic Network Design.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Solving Risk-Sensitive POMDPs With and Without Cost Observations.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Robust execution strategies for project scheduling with unreliable resources and stochastic durations.
J. Sched., 2015

An extended study on addressing defender teamwork while accounting for uncertainty in attacker defender games using iterative Dec-MDPs.
Multiagent Grid Syst., 2015

Learning and Controlling Network Diffusion in Dependent Cascade Models.
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015

Incremental DCOP Search Algorithms for Solving Dynamic DCOP Problems.
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015

Probabilistic Inference Based Message-Passing for Resource Constrained DCOPs.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

DIRECT: A Scalable Approach for Route Guidance in Selfish Orienteering Problems.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Near-Optimal Decentralized Power Supply Restoration in Smart Grids.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Risk Based Optimization for Improving Emergency Medical Systems.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Dynamic Redeployment to Counter Congestion or Starvation in Vehicle Sharing Systems.
Proceedings of the Artificial Intelligence for Cities, 2015

Solving Uncertain MDPs with Objectives that Are Separable over Instantiations of Model Uncertainty.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
TESLA: an extended study of an energy-saving agent that leverages schedule flexibility.
Auton. Agents Multi Agent Syst., 2014

Marginal Contribution Stochastic Games for Dynamic Resource Allocation.
Proceedings of the PRIMA 2014: Principles and Practice of Multi-Agent Systems, 2014

Unleashing Dec-MDPs in Security Games: Enabling Effective Defender Teamwork.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Building THINC: user incentivization and meeting rescheduling for energy savings.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

On understanding diffusion dynamics of patrons at a theme park.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Revisiting Risk-Sensitive MDPs: New Algorithms and Results.
Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014

Decentralized Stochastic Planning with Anonymity in Interactions.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

STREETS: Game-Theoretic Traffic Patrolling with Exploration and Exploitation.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
An agent-based simulation approach to experience management in theme parks.
Proceedings of the Winter Simulations Conference: Simulation Making Decisions in a Complex World, 2013

Regret based Robust Solutions for Uncertain Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Scalable Randomized Patrolling for Securing Rapid Transit Networks.
Proceedings of the Twenty-Fifth Innovative Applications of Artificial Intelligence Conference, 2013

TESLA: an energy-saving agent that leverages schedule flexibility.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Optimization Approaches for Solving Chance Constrained Stochastic Orienteering Problems.
Proceedings of the Algorithmic Decision Theory - Third International Conference, 2013

Budgeted Personalized Incentive Approaches for Smoothing Congestion in Resource Networks.
Proceedings of the Algorithmic Decision Theory - Third International Conference, 2013

2012
Robust Local Search for Solving RCPSP/max with Durational Uncertainty.
J. Artif. Intell. Res., 2012

Reports of the AAAI 2011 Fall Symposia.
AI Mag., 2012

Dynamic Stochastic Orienteering Problems for Risk-Aware Applications.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Uncertain Congestion Games with Assorted Human Agent Populations.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Active malware analysis using stochastic games.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Prioritized shaping of models for solving DEC-POMDPs.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Delayed observation planning in partially observable domains.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Sustainable multiagent application to conserve energy (demonstration).
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

SAVES: a sustainable multiagent application to conserve building energy considering occupants.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Lagrangian relaxation for large-scale multi-agent planning.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Decision Support for Agent Populations in Uncertain and Congested Environments.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Decision Support in Organizations: A Case for OrgPOMDPs.
Proceedings of the 2011 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2011

Social Model Shaping for Solving Generic DEC-POMDPs.
Proceedings of the 2011 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2011

Incremental DCOP search algorithms for solving dynamic DCOPs.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Adaptive decision support for structured organizations: a case for OrgPOMDPs.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

Decentralized decision support for an agent population in dynamic and uncertain domains.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011

2010
Introducing communication in Dis-POMDPs with locality of interaction.
Web Intell. Agent Syst., 2010

A Decision Theoretic Approach to Data Leakage Prevention.
Proceedings of the 2010 IEEE Second International Conference on Social Computing, 2010

Effect of Human Biases on Human-Agent Teams.
Proceedings of the 2010 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2010

Analyzing the impact of human bias on human-agent teams in resource allocation domains.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Risk-sensitive planning in partially observable environments.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Towards Finding Robust Execution Strategies for RCPSP/max with Durational Uncertainty.
Proceedings of the 20th International Conference on Automated Planning and Scheduling, 2010

2009
Caching schemes for DCOP search algorithms.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping.
Proceedings of the 19th International Conference on Automated Planning and Scheduling, 2009

2008
Not all agents are equal: scaling up distributed POMDPs for agent networks.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Linear Relaxation Techniques for Task Management in Uncertain Settings.
Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling, 2008

2007
Towards Efficient Computation of Error Bounded Solutions in POMDPs: Expected Value Approximation and Dynamic Disjunctive Beliefs.
Proceedings of the IJCAI 2007, 2007

Letting loose a SPIDER on a network of POMDPs: generating quality guaranteed policies.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

Demonstration of teamwork in uncertain domains using hybrid BDI-POMDP systems.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007

SPIDER Attack on a Network of POMDPs: Towards Quality Bounded Solutions.
Proceedings of the Game Theoretic and Decision Theoretic Agents, 2007

2006
Privacy Loss in Distributed Constraint Reasoning: A Quantitative Framework for Analysis and its Applications.
Auton. Agents Multi Agent Syst., 2006

Asimovian Multiagents: Applying Laws of Robotics to Teams of Humans and Agents.
Proceedings of the Programming Multi-Agent Systems, 4th International Workshop, 2006

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

Electric Elves: What Went Wrong and Why.
Proceedings of the What Went Wrong and Why: Lessons from AI Research and Applications, 2006

Exploiting Locality of Interaction in Networked Distributed POMDPs.
Proceedings of the Distributed Plan and Schedule Management, 2006

2005
Implementation Techniques for Solving POMDPs in Personal Assistant Agents.
Proceedings of the Programming Multi-Agent Systems, 2005

Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Exploiting belief bounds: practical POMDPs for personal assistant agents.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Conflicts in teamwork: hybrids to the rescue.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Practical POMDPs for Personal Assistant Domains.
Proceedings of the Persistent Assistants: Living and Working with AI, 2005

Valuations of Possible States (VPS): A Quantitative Framework for Analysis of Privacy Loss Among Collaborative Personal Assistant Agents.
Proceedings of the Persistent Assistants: Living and Working with AI, 2005

Networked Distributed POMDPs: A Synthesis of Distributed Constraint Optimization and POMDPs.
Proceedings of the Proceedings, 2005

2004
Taking DCOP to the Real World: Efficient Complete Solutions for Distributed Multi-Event Scheduling.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

2003
Adjustable Autonomy Challenges in Personal Assistant Agents: A Position Paper.
Proceedings of the Agents and Computational Autonomy - Potential, Risks, and Solutions - Postproceedings of the 1st International Workshop on Computational Autonomy, 2003

2002
On handling component and transaction failures in multi agent systems.
SIGecom Exch., 2002


  Loading...