Adith Swaminathan

According to our database1, Adith Swaminathan authored at least 39 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks.
CoRR, 2024

2023
LLF-Bench: Benchmark for Interactive Learning from Language Feedback.
CoRR, 2023

Interactive Robot Learning from Verbal Correction.
CoRR, 2023

Hindsight Learning for MDPs with Exogenous Inputs.
Proceedings of the International Conference on Machine Learning, 2023

2022
Hindsight Learning for MDPs with Exogenous Inputs.
CoRR, 2022

2021
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL.
CoRR, 2021

Recommendations as Treatments.
AI Mag., 2021

Heuristic-Guided Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Provably Good Batch Reinforcement Learning Without Great Exploration.
CoRR, 2020

Improved Image Wasserstein Attacks and Defenses.
CoRR, 2020

Active Learning for ML Enhanced Database Systems.
Proceedings of the 2020 International Conference on Management of Data, 2020

REVEAL 2020: Bandit and Reinforcement Learning from User Interactions.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Calibratable Policies using Programmatic Style-Consistency.
Proceedings of the 37th International Conference on Machine Learning, 2020

Working Memory Graphs.
Proceedings of the 37th International Conference on Machine Learning, 2020

Metareasoning in Modular Software Systems: On-the-Fly Configuration Using Reinforcement Learning with Rich Contextual Representations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Off-Policy Policy Gradient with State Distribution Correction.
CoRR, 2019

Multi-Preference Actor Critic.
CoRR, 2019

NAIL: A General Interactive Fiction Agent.
CoRR, 2019

Off-Policy Policy Gradient with Stationary Distribution Correction.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

REVEAL 2019: closing the loop with the real world: reinforcement and robust estimators for recommendation.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019

A Distillation Approach to Data Efficient Individual Treatment Effect Estimation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
REVEAL 2018: offline evaluation for recommender systems.
Proceedings of the 12th ACM Conference on Recommender Systems, 2018

Deep Learning with Logged Bandit Feedback.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Counterfactual evaluation and learning from logged user feedback.
PhD thesis, 2017

Unbiased Learning-to-Rank with Biased Feedback.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Off-policy evaluation for slate recommendation.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016
Large-scale Validation of Counterfactual Learning Methods: A Test-Bed.
CoRR, 2016

Counterfactual Evaluation and Learning for Search, Recommendation and Ad Placement.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Unbiased Comparative Evaluation of Ranking Functions.
Proceedings of the 2016 ACM on International Conference on the Theory of Information Retrieval, 2016

Recommendations as Treatments: Debiasing Learning and Evaluation.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Batch learning from logged bandit feedback through counterfactual risk minimization.
J. Mach. Learn. Res., 2015

Counterfactual Risk Minimization.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Unbiased Ranking Evaluation on a Budget.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

The Self-Normalized Estimator for Counterfactual Learning.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Counterfactual Risk Minimization: Learning from Logged Bandit Feedback.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2014
Mining Videos from the Web for Electronic Textbooks.
Proceedings of the Formal Concept Analysis - 12th International Conference, 2014

2013
Beyond myopic inference in big data pipelines.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

2012
Temporal corpus summarization using submodular word coverage.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012


  Loading...