Harshad Khadilkar

Orcid: 0000-0003-3601-778X

According to our database1, Harshad Khadilkar authored at least 62 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Transformers are Expressive, But Are They Expressive Enough for Regression?
CoRR, 2024

Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization.
CoRR, 2024

Guiding Offline Reinforcement Learning Using a Safety Expert.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

A Learning Approach for Discovering Cost-Efficient Integrated Sourcing and Routing Strategies in E-Commerce.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

2023
Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in E-Commerce.
CoRR, 2023

Using General Value Functions to Learn Domain-Backed Inventory Management Policies.
CoRR, 2023

Using Linear Regression for Iteratively Training Neural Networks.
CoRR, 2023

DCT: Dual Channel Training of Action Embeddings for Reinforcement Learning with Large Discrete Action Spaces.
CoRR, 2023

Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas.
CoRR, 2023

Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Learning to Minimize Cost to Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce.
Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023

Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning.
Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023

Classifier Design for Decentralised Sensing with Digital Communication.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Scalable multi-product inventory control with lead time constraints using reinforcement learning.
Neural Comput. Appl., 2022

A Novel Data Augmentation Technique for Out-of-Distribution Sample Detection using Compounded Corruptions.
CoRR, 2022

Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT.
CoRR, 2022

A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management.
CoRR, 2022

A Novel Data Augmentation Technique for Out-of-Distribution Sample Detection Using Compounded Corruptions.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Gatekeeper: A deep reinforcement learning-cum-heuristic based algorithm for scheduling and routing trains in complex environments.
Proceedings of the International Joint Conference on Neural Networks, 2022

Identifying efficient curricula for reinforcement learning in complex environments with a fixed computational budget.
Proceedings of the CODS-COMAD 2022: 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD), Bangalore, India, January 8, 2022

Performance improvement of reinforcement learning algorithms for online 3D bin packing using FPGA.
Proceedings of the Second International Conference on AI-ML Systems, 2022

2021
Fast Approximate Solutions using Reinforcement Learning for Dynamic Capacitated Vehicle Routing with Time Windows.
CoRR, 2021

School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget.
CoRR, 2021

A Simulation Driven Optimization Algorithm for Scheduling Sorting Center Operations.
Proceedings of the Winter Simulation Conference, 2021

FoLaR: Foggy Latent Representations for Reinforcement Learning with Partial Observability.
Proceedings of the International Joint Conference on Neural Networks, 2021

Anticipatory Decisions in Retail E-Commerce Warehouses using Reinforcement Learning.
Proceedings of the CODS-COMAD 2021: 8th ACM IKDD CODS and 26th COMAD, 2021

Revisiting State Augmentation methods for Reinforcement Learning with Stochastic Delays.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

SIBRE: Self Improvement Based REwards for Adaptive Feedback in Reinforcement Learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

2020
Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication.
CoRR, 2020

A Generalized Reinforcement Learning Algorithm for Online 3D Bin-Packing.
CoRR, 2020

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains.
CoRR, 2020

SIBRE: Self Improvement Based REwards for Reinforcement Learning.
CoRR, 2020

Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning.
CoRR, 2020

2019
A Scalable Reinforcement Learning Algorithm for Scheduling Railway Lines.
IEEE Trans. Intell. Transp. Syst., 2019

A rolling horizon optimisation model for consolidated hump yard operational planning.
J. Rail Transp. Plan. Manag., 2019

Accelerating Training in Pommerman with Imitation and Reinforcement Learning.
CoRR, 2019

Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems.
CoRR, 2019

Reinforcement Learning of Supply Chain Control Policy Using Closed Loop Multi-agent Simulation.
Proceedings of the Multi-Agent-Based Simulation XX - 20th International Workshop, 2019

A Reinforcement Learning Framework for Container Selection and Ship Load Sequencing in Ports.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Actor Based Simulation for Closed Loop Control of Supply Chain using Reinforcement Learning.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

An Imitation Learning Approach for Computing Anticipatory Picking Decisions in Retail Distribution Centres.
Proceedings of the 2019 American Control Conference, 2019

2018
Learning representations for sentiment classification using Multi-task framework.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

2017
Data-Enabled Stochastic Modeling for Evaluating Schedule Robustness of Railway Networks.
Transp. Sci., 2017

Scheduling of vehicle movement in resource-constrained transportation networks using a capacity-aware heuristic.
Proceedings of the 2017 American Control Conference, 2017

2016
Integrated Control of Airport and Terminal Airspace Operations.
IEEE Trans. Control. Syst. Technol., 2016

SMOME: A framework for evaluating the costs and benefits of instrumentation in smart home systems.
Proceedings of the 2016 IEEE International Conference on Smart Grid Communications, 2016

Unlocking the hidden potential of data towards efficient buildings: Findings from a pilot study in India.
Proceedings of the IEEE PES Innovative Smart Grid Technologies Conference Europe, 2016

Modelling the impact of control strategy on stochastic delay propagation in transportation networks.
Proceedings of the 15th European Control Conference, 2016

2015
A socially aware incentive strategy for encouraging residential solar uptake in Brunei.
Proceedings of the 2015 IEEE International Conference on Smart Grid Communications, 2015

Multi-User Energy Consumption Monitoring and Anomaly Detection with Partial Context Information.
Proceedings of the 2nd ACM International Conference on Embedded Systems for Energy-Efficient Built Environments, 2015

A Framework for Evaluating the Costs and Benefits of Instrumentation in Smart Home Systems.
Proceedings of the 2015 ACM Sixth International Conference on Future Energy Systems, 2015

UrJar: A Device to Address Energy Poverty Using E-Waste.
Proceedings of the 2015 ACM Sixth International Conference on Future Energy Systems, 2015

Individual and Aggregate Electrical Load Forecasting: One for All and All for One.
Proceedings of the 2015 ACM Sixth International Conference on Future Energy Systems, 2015

2014
Hybrid Communication Protocols and Control Algorithms for NextGen Aircraft Arrivals.
IEEE Trans. Intell. Transp. Syst., 2014

High Confidence Networked Control for Next Generation Air Transportation Systems.
IEEE Trans. Autom. Control., 2014

DC Picogrids as power backups for office buildings.
Proceedings of the 2014 IEEE International Conference on Smart Grid Communications, 2014

Collaborative energy conservation in a microgrid.
Proceedings of the 1st ACM Conference on Embedded Systems for Energy-Efficient Buildings, 2014

Algorithms for upgrading the resolution of aggregate energy meter data.
Proceedings of the Fifth International Conference on Future Energy Systems, 2014

UrJar: A Lighting Solution using Discarded Laptop Batteries.
Proceedings of the Fifth ACM Symposium on Computing for Development, 2014

2013
Optimal control of airport operations with gate capacity constraints.
Proceedings of the 12th European Control Conference, 2013

2012
A network congestion control approach to airport departure management.
Proceedings of the American Control Conference, 2012


  Loading...