Subhojyoti Mukherjee

Orcid: 0000-0003-0537-7184

According to our database1, Subhojyoti Mukherjee authored at least 40 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models.
CoRR, May, 2026

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization.
CoRR, May, 2026

Sparse Personalized Text Generation with Multi-Trajectory Reasoning.
CoRR, April, 2026

A Survey on LLM-based Conversational User Simulation.
CoRR, April, 2026

Stepwise Credit Assignment for GRPO on Flow-Matching Models.
CoRR, March, 2026

Agentic Planning with Reasoning for Image Styling via Offline RL.
CoRR, March, 2026

Partial Policy Gradients for RL in LLMs.
CoRR, March, 2026

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions.
CoRR, March, 2026

Reasoning-Based Personalized Generation for Users with Sparse Data.
CoRR, February, 2026

Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis.
CoRR, February, 2026


2025
Learning to Reason in LLMs by Expectation Maximization.
CoRR, December, 2025

StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos.
CoRR, December, 2025

MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces.
CoRR, October, 2025

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality.
CoRR, July, 2025

Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning.
CoRR, June, 2025

A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations.
CoRR, May, 2025

Logits are All We Need to Adapt Closed Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025



2024
Multi-Objective Alignment of Large Language Models Through Hypervolume Maximization.
CoRR, 2024

Off-Policy Evaluation from Logged Human Feedback.
CoRR, 2024

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning.
CoRR, 2024

Optimal Design for Human Feedback.
CoRR, 2024

Experimental Design for Active Transductive Inference in Large Language Models.
CoRR, 2024

Optimal Design for Human Preference Elicitation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
Efficient and Interpretable Bandit Algorithms.
CoRR, 2023

Multi-task Representation Learning for Pure Exploration in Bilinear Bandits.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
ReVar: Strengthening policy evaluation via reduced variance sampling.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Safety aware changepoint detection for piecewise i.i.d. bandits.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Chernoff Sampling for Active Testing and Extension to Active Regression.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Nearly Optimal Algorithms for Level Set Estimation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
A Unified Approach to Translate Classical Bandit Algorithms to Structured Bandits.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting.
IEEE J. Sel. Areas Inf. Theory, 2020

Generalized Chernoff Sampling for Active Learning and Structured Bandit Algorithms.
CoRR, 2020

2019
Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits.
CoRR, 2019

2018
Efficient-UCBV: An Almost Optimal Algorithm Using Variance Estimates.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Thresholding Bandits with Augmented UCB.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017


  Loading...