Yash Chandak

Orcid: 0000-0002-6276-5549

According to our database1, Yash Chandak authored at least 34 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The GPT Surprise: Offering Large Language Model Chat in a Massive Coding Class Reduced Engagement but Increased Adopters Exam Performances.
CoRR, 2024

Short-Long Policy Evaluation with Novel Actions.
CoRR, 2024

Averaging log-likelihoods in direct alignment.
CoRR, 2024

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion.
CoRR, 2024

OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators.
CoRR, 2024

Estimating the Causal Treatment Effect of Unproductive Persistence.
Proceedings of the 14th Learning Analytics and Knowledge Conference, 2024

Adaptive Instrument Design for Indirect Experiments.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A/B testing under Interference with Partial Network Information.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
Coagent Networks: Generalized and Scaled.
CoRR, 2023

Optimization using Parallel Gradient Evaluations on Multiple Parameters.
CoRR, 2023

Behavior Alignment via Reward Function Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Supervised Pretraining Can Learn In-Context Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Understanding Self-Predictive Learning for Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition.
Proceedings of the International Conference on Machine Learning, 2023

Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Scaling Graph Propagation Kernels for Predictive Learning.
Frontiers Big Data, 2022

Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Off-Policy Evaluation for Action-Dependent Non-stationary Environments.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Optimizing Interventions in Shared Autonomy.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
SOPE: Spectrum of Off-Policy Estimators.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Universal Off-Policy Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

High Confidence Generalization for Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

High-Confidence Off-Policy (or Counterfactual) Variance Estimation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Reinforcement Learning for Strategic Recommendations.
CoRR, 2020

Towards Safe Policy Improvement for Non-Stationary MDPs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Evaluating the Performance of Reinforcement Learning Algorithms.
Proceedings of the 37th International Conference on Machine Learning, 2020

Optimizing for the Future in Non-Stationary MDPs.
Proceedings of the 37th International Conference on Machine Learning, 2020

Lifelong Learning with a Changing Action Set.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Reinforcement Learning When All Actions Are Not Always Available.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Classical Policy Gradient: Preserving Bellman's Principle of Optimality.
CoRR, 2019

Learning Action Representations for Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Fusion Graph Convolutional Networks.
CoRR, 2018

HOPF: Higher Order Propagation Framework for Deep Collective Classification.
CoRR, 2018

2015
On Optimizing Human-Machine Task Assignments.
CoRR, 2015


  Loading...