Sumitra Ganesh

Orcid: 0000-0003-1695-8574

According to our database1, Sumitra Ganesh authored at least 43 papers between 2002 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Modelling bounded rational decision-making through Wasserstein constraints.
CoRR, April, 2025

Generating Structured Plan Representation of Procedures with LLMs.
CoRR, April, 2025

Monty Hall and Optimized Conformal Prediction to Improve Decision-Making with LLMs.
CoRR, January, 2025

Generative AI Agents for Knowledge Work Augmentation in Finance.
Annu. Rev. Control. Robotics Auton. Syst., 2025

ADAGE: A Generic Two-layer Framework for Adaptive Agent based Modelling.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-Time Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Partially Observable Contextual Bandits With Linear Payoffs.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

Approximate Equivariance in Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2025

Decentralized Convergence to Equilibrium Prices in Trading Networks.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Online MCMC Thinning with Kernelized Stein Discrepancy.
SIAM J. Math. Data Sci., March, 2024

Regularized Proportional Fairness Mechanism for Resource Allocation Without Money.
Trans. Mach. Learn. Res., 2024

In-Context Learning with Topological Information for Knowledge Graph Completion.
CoRR, 2024

Scalable Representation Learning for Multimodal Tabular Transactions.
CoRR, 2024

A Heterogeneous Agent Model of Mortgage Servicing: An Income-based Relief Analysis.
CoRR, 2024

Learning Payment-Free Resource Allocation Mechanisms.
Proceedings of the Winter Simulation Conference, 2024

Information-Directed Pessimism for Offline Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Efficient Inverse Multiagent Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Simulate and Optimise: A two-layer mortgage simulator for designing novel mortgage assistance products.
Proceedings of the 5th ACM International Conference on AI in Finance, 2024

Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-agent Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

2023
Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach.
CoRR, 2023

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models.
CoRR, 2023

Certifiably Robust Policy Learning against Adversarial Multi-Agent Communication.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sequential Fair Resource Allocation under a Markov Decision Process Framework.
Proceedings of the 4th ACM International Conference on AI in Finance, 2023

Phantom - A RL-driven Multi-Agent Framework to Model Complex Systems.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning.
CoRR, 2022

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations.
CoRR, 2022

Phantom - An RL-driven framework for agent-based modeling of complex economic systems and markets.
CoRR, 2022

Certifiably Robust Policy Learning against Adversarial Communication in Multi-agent Systems.
CoRR, 2022

Mixture of basis for interpretable continual learning with distribution shifts.
CoRR, 2022

Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures.
Proceedings of the International Conference on Machine Learning, 2022

2021
Causal Policy Gradients.
CoRR, 2021

Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Towards a fully rl-based market simulator.
Proceedings of the ICAIF'21: 2nd ACM International Conference on AI in Finance, Virtual Event, November 3, 2021

2020
Calibration of Shared Equilibria in General Sum Partially Observable Markov Games.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Risk-sensitive reinforcement learning: a martingale approach to reward uncertainty.
Proceedings of the ICAIF '20: The First ACM International Conference on AI in Finance, 2020

2019
Reinforcement Learning for Market Making in a Multi-agent Dealer Market.
CoRR, 2019

2009
Learning and Recognition of Human Actions Using Optimal Control Primitives.
Int. J. Humanoid Robotics, 2009

2008
Recognition of Human Actions using an Optimal Control Based Motor Model.
Proceedings of the 9th IEEE Workshop on Applications of Computer Vision (WACV 2008), 2008

Representation and Recognition of Human Actions - a New Approach based on an Optimal Control Motor Model.
Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008

2007
Composition of Dynamical Systems for Estimation of Human Body Dynamics.
Proceedings of the Hybrid Systems: Computation and Control, 10th International Workshop, 2007

2002
Blind space-time multiuser detector.
Proceedings of the 2002 International Symposium on Circuits and Systems, 2002


  Loading...