Subhojyoti Mukherjee

Orcid: 0000-0003-0537-7184

According to our database¹, Subhojyoti Mukherjee authored at least 40 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization.

[BibT_eX]

[DOI]

Md. Mehrab Tanjim

Jayakumar Subramanian

CoRR, May, 2026

Sparse Personalized Text Generation with Multi-Trajectory Reasoning.

[BibT_eX]

[DOI]

CoRR, April, 2026

A Survey on LLM-based Conversational User Simulation.

[BibT_eX]

[DOI]

CoRR, April, 2026

Stepwise Credit Assignment for GRPO on Flow-Matching Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

Agentic Planning with Reasoning for Image Styling via Offline RL.

[BibT_eX]

[DOI]

CoRR, March, 2026

Partial Policy Gradients for RL in LLMs.

[BibT_eX]

[DOI]

CoRR, March, 2026

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions.

[BibT_eX]

[DOI]

CoRR, March, 2026

Reasoning-Based Personalized Generation for Users with Sparse Data.

[BibT_eX]

[DOI]

CoRR, February, 2026

Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis.

[BibT_eX]

[DOI]

CoRR, February, 2026

A Survey on LLM-based Conversational User Simulation.

[BibT_eX]

[DOI]

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025

Learning to Reason in LLMs by Expectation Maximization.

[BibT_eX]

[DOI]

CoRR, December, 2025

StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos.

[BibT_eX]

[DOI]

CoRR, December, 2025

MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces.

[BibT_eX]

[DOI]

Cindy Xiong Bearfield

Branislav Kveton

CoRR, October, 2025

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality.

[BibT_eX]

[DOI]

CoRR, July, 2025

Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning.

[BibT_eX]

[DOI]

Jayakumar Subramanian

Branislav Kveton

CoRR, June, 2025

A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations.

[BibT_eX]

[DOI]

CoRR, May, 2025

Logits are All We Need to Adapt Closed Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

From Selection to Generation: A Survey of LLM-based Active Learning.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Multi-Objective Alignment of Large Language Models Through Hypervolume Maximization.

[BibT_eX]

[DOI]

CoRR, 2024

Off-Policy Evaluation from Logged Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Optimal Design for Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2024

Experimental Design for Active Transductive Inference in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Optimal Design for Human Preference Elicitation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Josiah P. Hanna

Robert D. Nowak

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Efficient and Interpretable Bandit Algorithms.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Ruihao Zhu

Branislav Kveton

CoRR, 2023

Multi-task Representation Learning for Pure Exploration in Bilinear Bandits.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

ReVar: Strengthening policy evaluation via reduced variance sampling.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Josiah P. Hanna

Robert D. Nowak

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Safety aware changepoint detection for piecewise i.i.d. bandits.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Chernoff Sampling for Active Testing and Extension to Active Regression.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Ardhendu S. Tripathy

Robert D. Nowak

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Nearly Optimal Algorithms for Level Set Estimation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

A Unified Approach to Translate Classical Bandit Algorithms to Structured Bandits.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting.

[BibT_eX]

[DOI]

IEEE J. Sel. Areas Inf. Theory, 2020

Generalized Chernoff Sampling for Active Learning and Structured Bandit Algorithms.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Ardhendu Tripathy

Robert D. Nowak

CoRR, 2020

2019

Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Odalric-Ambrym Maillard

CoRR, 2019

2018

Efficient-UCBV: An Almost Optimal Algorithm Using Variance Estimates.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Thresholding Bandits with Augmented UCB.

[BibT_eX]

[DOI]

Subhojyoti Mukherjee

Kolar Purushothama Naveen

Nandan Sudarsanam

Balaraman Ravindran

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Subhojyoti Mukherjee

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...