Bikramjit Banerjee

Orcid: 0000-0001-7999-0307

According to our database1, Bikramjit Banerjee authored at least 66 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Modeling and reinforcement learning in partially observable many-agent systems.
Auton. Agents Multi Agent Syst., June, 2024

2023
Latent Interactive A2C for Improved RL in Open Many-Agent Systems.
CoRR, 2023

2022
Reinforcement learning as a rehearsal for swarm foraging.
Swarm Intell., 2022

Reinforcement learning in many-agent settings under partial observability.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Online Inverse Reinforcement Learning with Learned Observation Model.
Proceedings of the Conference on Robot Learning, 2022

2021
Human-agent transfer from observations.
Knowl. Eng. Rev., 2021

PALO bounds for reinforcement learning in partially observable stochastic games.
Neurocomputing, 2021

Many Agent Reinforcement Learning Under Partial Observability.
CoRR, 2021

I2RL: online inverse reinforcement learning under occlusion.
Auton. Agents Multi Agent Syst., 2021

Min-Max Entropy Inverse RL of Multiple Tasks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Cooperative-Competitive Reinforcement Learning with History-Dependent Rewards.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
Maximum Entropy Multi-Task Inverse RL.
CoRR, 2020

2019
Team learning from human demonstration with coordination confidence.
Knowl. Eng. Rev., 2019

Online Inverse Reinforcement Learning Under Occlusion.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Model-Free IRL Using Maximum Likelihood Estimation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
A Framework and Method for Online Inverse Reinforcement Learning.
CoRR, 2018

Autonomous Acquisition of Behavior Trees for Robot Control.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2017
Multiagent Path Finding With Persistence Conflicts.
IEEE Trans. Comput. Intell. AI Games, 2017

Multirobot Systems.
IEEE Intell. Syst., 2017

Exact and Heuristic Algorithms for Risk-Aware Stochastic Physical Search.
Comput. Intell., 2017

2016
Multi-agent reinforcement learning as a rehearsal for decentralized planning.
Neurocomputing, 2016

Reinforcement Learning in Partially Observable Multiagent Settings: Monte Carlo Exploring Policies with PAC Bounds.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Detection of Plan Deviation in Multi-Agent Systems.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Stackelberg Surveillance.
Informatica (Slovenia), 2015

The complexity of multi-agent plan recognition.
Auton. Agents Multi Agent Syst., 2015

2014
Reinforcement Learning of Informed Initial Policies for Decentralized Planning.
ACM Trans. Auton. Adapt. Syst., 2014

Model AI Assignments 2014.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Concurrent reinforcement learning as a rehearsal for decentralized planning under uncertainty.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Pruning for Monte Carlo Distributed Reinforcement Learning in Decentralized POMDPs.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Strategic best-response learning in multiagent systems.
J. Exp. Theor. Artif. Intell., 2012

Efficient context free parsing of multi-agent activities for team and plan recognition.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Informed Initial Policies for Learning in Dec-POMDPs.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Sample Bounded Distributed Reinforcement Learning for Decentralized POMDPs.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Action Discovery for Single and Multi-Agent Reinforcement Learning.
Adv. Complex Syst., 2011

Adaptive Multi-robot Team Reconfiguration Using a Policy-Reuse Reinforcement Learning Approach.
Proceedings of the Advanced Agent Technology, 2011

Branch and Price for Multi-Agent Plan Recognition.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Fast a* with Iterative Resolution for Navigation.
Int. J. Artif. Intell. Tools, 2010

Action discovery for reinforcement learning.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Validation of agent based crowd egress simulation.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Coalition structure generation in multi-agent systems with mixed externalities.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Evaluation and Comparison of Multi-agent Based Crowd Simulation Systems.
Proceedings of the Agents for Games and Simulations II, 2010

Multi-Agent Plan Recognition: Formalization and Algorithms.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Search Performance of Multi-Agent Plan Recognition in a General Model.
Proceedings of the Plan, Activity, and Intent Recognition, 2010

2009
Layered Intelligence for Agent-based Crowd Simulation.
Simul., 2009

2008
Advancing the Layered Approach to Agent-Based Crowd Simulation.
Proceedings of the 22st International Workshop on Principles of Advanced and Distributed Simulation, 2008

Congestion Avoidance in Multi-Agent-based Egress Simulation.
Proceedings of the 2008 International Conference on Artificial Intelligence, 2008

2007
Generalized multiagent learning with performance bound.
Auton. Agents Multi Agent Syst., 2007

General Game Learning Using Knowledge Transfer.
Proceedings of the IJCAI 2007, 2007

2006
Reactivity and Safe Learning in Multi-Agent Systems.
Adapt. Behav., 2006

RV<sub>sigma(t)</sub>: a unifying approach to performance and convergence in online multiagent learning.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

2005
Unifying Convergence and No-Regret in Multiagent Learning.
Proceedings of the Learning and Adaption in Multi-Agent Systems, 2005

Efficient learning of multi-step best response.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

On the performance of on-line concurrent reinforcement learners.
Proceedings of the 4th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2005), 2005

Efficient No-Regret Multiagent Learning.
Proceedings of the Proceedings, 2005

2004
On-policy concurrent reinforcement learning.
J. Exp. Theor. Artif. Intell., 2004

The Role of Reactivity in Multiagent Learning.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Performance Bounded Reinforcement Learning in Strategic Interactions.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003
Adaptive policy gradient in multiagent learning.
Proceedings of the Second International Joint Conference on Autonomous Agents & Multiagent Systems, 2003

2002
Kernel Index for Relevance feedback Retrieval.
Proceedings of the FSDK'02, 2002

Convergent Gradient Ascent in General-Sum Games.
Proceedings of the Machine Learning: ECML 2002, 2002

2001
Mining user session data to facilitate user interaction with a customer service knowledge base in RightNow Web.
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

Fast Concurrent Reinforcement Learners.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

2000
Using Bayesian Networks to Model Agent Relationships.
Appl. Artif. Intell., 2000

Combining Multiple Perspectives.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Learning Mutual Trust.
Proceedings of the Fourth International Conference on Autonomous Agents, 2000

Selecting partners.
Proceedings of the Fourth International Conference on Autonomous Agents, 2000


  Loading...