Harshad Khadilkar

Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

2024

Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Aditya Kapoor

Sushant Swamy

Kale-ab Abebe Tessera

CoRR, 2024

DeepClean: Integrated Distortion Identification and Algorithm Selection for Rectifying Image Corruptions.

[BibT_eX]

[DOI]

Aditya Kapoor

Jayavardhana Gubbi

CoRR, 2024

Transformers are Expressive, But Are They Expressive Enough for Regression?

[BibT_eX]

[DOI]

Swaroop Nath

Sankara Sri Raghava Ravindra Muddu

Pushpak Bhattacharyya

CoRR, 2024

Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization.

[BibT_eX]

[DOI]

Swaroop Nath

Tejpalsingh Siledar

Rupasai Rangaraju

Pushpak Bhattacharyya

Suman Banerjee

Amey Patil

Sudhanshu Shekhar Singh

Muthusamy Chelliah

Nikesh Garera

CoRR, 2024

Linear-Time Optimal Deadlock Detection for Efficient Scheduling in Multi-Track Railway Networks.

[BibT_eX]

[DOI]

Shivaram Kalyanakrishnan

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Guiding Offline Reinforcement Learning Using a Safety Expert.

[BibT_eX]

[DOI]

Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

Deep reinforcement learning approach for routing and scheduling of trains at railway station.

[BibT_eX]

[DOI]

Sudhir R. Shetiya

Shripad Salsingikar

Proceedings of the 8th International Conference on Data Science and Management of Data (12th ACM IKDD CODS and 30th COMAD), 2024

A Learning Approach for Discovering Cost-Efficient Integrated Sourcing and Routing Strategies in E-Commerce.

[BibT_eX]

[DOI]

Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

DCT: Dual Channel Training of Action Embeddings for Reinforcement Learning with Large Discrete Action Spaces.

[BibT_eX]

[DOI]

Pranavi Pathakota

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

ORCHID: Offline RL for Control of HVAC in Buildings using Historical and Low-Fidelity Simulation Data.

[BibT_eX]

[DOI]

Richa Verma

Srikar Babu Gadipudi

Srinarayana Nagarathinam

Proceedings of the 4th International Conference on AI-ML Systems, 2024

2023

Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in E-Commerce.

[BibT_eX]

[DOI]

CoRR, 2023

Using General Value Functions to Learn Domain-Backed Inventory Management Policies.

[BibT_eX]

[DOI]

Durgesh Kalwar

CoRR, 2023

Using Linear Regression for Iteratively Training Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2023

Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas.

[BibT_eX]

[DOI]

CoRR, 2023

Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Swaroop Nath

Pushpak Bhattacharyya

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Learning to Minimize Cost to Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce.

[BibT_eX]

[DOI]

Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023

Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023

Classifier Design for Decentralised Sensing with Digital Communication.

[BibT_eX]

[DOI]

Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022

Scalable multi-product inventory control with lead time constraints using reinforcement learning.

[BibT_eX]

[DOI]

Neural Comput. Appl., 2022

A Novel Data Augmentation Technique for Out-of-Distribution Sample Detection using Compounded Corruptions.

[BibT_eX]

[DOI]

Ramya S. Hebbalaguppe

CoRR, 2022

Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT.

[BibT_eX]

[DOI]

CoRR, 2022

A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management.

[BibT_eX]

[DOI]

CoRR, 2022

A Novel Data Augmentation Technique for Out-of-Distribution Sample Detection Using Compounded Corruptions.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Gatekeeper: A deep reinforcement learning-cum-heuristic based algorithm for scheduling and routing trains in complex environments.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2022

Identifying efficient curricula for reinforcement learning in complex environments with a fixed computational budget.

[BibT_eX]

[DOI]

Proceedings of the CODS-COMAD 2022: 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD), Bangalore, India, January 8, 2022

Performance improvement of reinforcement learning algorithms for online 3D bin packing using FPGA.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on AI-ML Systems, 2022

2021

Fast Approximate Solutions using Reinforcement Learning for Dynamic Capacitated Vehicle Routing with Time Windows.

[BibT_eX]

[DOI]

CoRR, 2021

School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget.

[BibT_eX]

[DOI]

CoRR, 2021

A Simulation Driven Optimization Algorithm for Scheduling Sorting Center Operations.

[BibT_eX]

[DOI]

Proceedings of the Winter Simulation Conference, 2021

FoLaR: Foggy Latent Representations for Reinforcement Learning with Partial Observability.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

Anticipatory Decisions in Retail E-Commerce Warehouses using Reinforcement Learning.

[BibT_eX]

[DOI]

Vinita Baniwal

Proceedings of the CODS-COMAD 2021: 8th ACM IKDD CODS and 26th COMAD, 2021

Revisiting State Augmentation methods for Reinforcement Learning with Stochastic Delays.

[BibT_eX]

[DOI]

Somjit Nath

Mayank Baranwal

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020

Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication.

[BibT_eX]

[DOI]

CoRR, 2020

A Generalized Reinforcement Learning Algorithm for Online 3D Bin-Packing.

[BibT_eX]

[DOI]

CoRR, 2020

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains.

[BibT_eX]

[DOI]

CoRR, 2020

SIBRE: Self Improvement Based REwards for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning.

[BibT_eX]

[DOI]

Tanuja Ganu

Deva P. Seetharam

CoRR, 2020

2019

A Scalable Reinforcement Learning Algorithm for Scheduling Railway Lines.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2019

A rolling horizon optimisation model for consolidated hump yard operational planning.

[BibT_eX]

[DOI]

J. Rail Transp. Plan. Manag., 2019

Accelerating Training in Pommerman with Imitation and Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems.

[BibT_eX]

[DOI]

CoRR, 2019

Reinforcement Learning of Supply Chain Control Policy Using Closed Loop Multi-agent Simulation.

[BibT_eX]

[DOI]

Proceedings of the Multi-Agent-Based Simulation XX - 20th International Workshop, 2019

A Reinforcement Learning Framework for Container Selection and Ship Load Sequencing in Ports.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Actor Based Simulation for Closed Loop Control of Supply Chain using Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

An Imitation Learning Approach for Computing Anticipatory Picking Decisions in Retail Distribution Centres.

[BibT_eX]

[DOI]

Proceedings of the 2019 American Control Conference, 2019

2018

Learning representations for sentiment classification using Multi-task framework.

[BibT_eX]

[DOI]

Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

2017

Data-Enabled Stochastic Modeling for Evaluating Schedule Robustness of Railway Networks.

[BibT_eX]

[DOI]

Transp. Sci., 2017

Scheduling of vehicle movement in resource-constrained transportation networks using a capacity-aware heuristic.

[BibT_eX]

[DOI]

Proceedings of the 2017 American Control Conference, 2017

2016

Integrated Control of Airport and Terminal Airspace Operations.

[BibT_eX]

[DOI]

Hamsa Balakrishnan

IEEE Trans. Control. Syst. Technol., 2016

SMOME: A framework for evaluating the costs and benefits of instrumentation in smart home systems.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Smart Grid Communications, 2016

Unlocking the hidden potential of data towards efficient buildings: Findings from a pilot study in India.

[BibT_eX]

[DOI]

Proceedings of the IEEE PES Innovative Smart Grid Technologies Conference Europe, 2016

Modelling the impact of control strategy on stochastic delay propagation in transportation networks.

[BibT_eX]

[DOI]

Pg Mohammad Iskandarbin Pg Hj Petra

Proceedings of the 15th European Control Conference, 2016

2015

A socially aware incentive strategy for encouraging residential solar uptake in Brunei.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Smart Grid Communications, 2015

Multi-User Energy Consumption Monitoring and Anomaly Detection with Partial Context Information.

[BibT_eX]

[DOI]

Proceedings of the 2nd ACM International Conference on Embedded Systems for Energy-Efficient Built Environments, 2015

A Framework for Evaluating the Costs and Benefits of Instrumentation in Smart Home Systems.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM Sixth International Conference on Future Energy Systems, 2015

UrJar: A Device to Address Energy Poverty Using E-Waste.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM Sixth International Conference on Future Energy Systems, 2015

Individual and Aggregate Electrical Load Forecasting: One for All and All for One.

[BibT_eX]

[DOI]

Sambaran Bandyopadhyay

Tanuja Ganu

Rodzay bin Haji Abdul Wahab

Vijay Arya

Proceedings of the 2015 ACM Sixth International Conference on Future Energy Systems, 2015

2014

Hybrid Communication Protocols and Control Algorithms for NextGen Aircraft Arrivals.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2014

High Confidence Networked Control for Next Generation Air Transportation Systems.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2014

DC Picogrids as power backups for office buildings.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Smart Grid Communications, 2014

Collaborative energy conservation in a microgrid.

[BibT_eX]

[DOI]

Liyanage Chandratilake De Silva

Deva P. Seetharam

Proceedings of the 1st ACM Conference on Embedded Systems for Energy-Efficient Buildings, 2014

Algorithms for upgrading the resolution of aggregate energy meter data.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Future Energy Systems, 2014

UrJar: A Lighting Solution using Discarded Laptop Batteries.

[BibT_eX]

[DOI]

Proceedings of the Fifth ACM Symposium on Computing for Development, 2014

2013

Optimal control of airport operations with gate capacity constraints.

[BibT_eX]

[DOI]

Hamsa Balakrishnan

Proceedings of the 12th European Control Conference, 2013

2012

A network congestion control approach to airport departure management.

[BibT_eX]

[DOI]