We stand with Ukraine

We stand with Ukraine

Sunishchal Dev

According to our database¹, Sunishchal Dev authored at least 24 papers between 2025 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Seven simple steps for log analysis in AI systems.

[DOI]

CoRR, April, 2026

The science and practice of proportionality in AI risk evaluations.

[DOI]

CoRR, March, 2026

Judge Reliability Harness: Stress Testing the Reliability of LLM Judges.

[DOI]

,

,

,

,

CoRR, March, 2026

Broken Chains: The Cost of Incomplete Reasoning in LLMs.

[DOI]

,

Gaurav Purushothaman

,

,

,

,

,

,

Maheep Chaudhary

CoRR, February, 2026

ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs.

[DOI]

Rohan Subramanian Thomas

,

Shikhar Shiromani

,

Abdullah Chaudhry

,

,

,

,

CoRR, February, 2026

Visualizing and Benchmarking LLM Factual Hallucination Tendencies via Internal State Analysis and Clustering.

[DOI]

,

,

Shreya Shivkumar

,

Parham Sharafoleslami

,

,

CoRR, February, 2026

A Few Bad Neurons: Isolating and Surgically Correcting Sycophancy.

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2026

2025

Emergent Persuasion: Will LLMs Persuade Without Being Prompted?

[DOI]

,

,

,

,

,

,

CoRR, December, 2025

Peek-a-Boo Reasoning: Contrastive Region Masking in MLLMs.

[DOI]

Isha Chaturvedi

,

,

,

Adhitya Rajendra Kumar

,

,

,

,

CoRR, December, 2025

COMPASS: Context-Modulated PID Attention Steering System for Hallucination Mitigation.

[DOI]

,

,

,

,

Shikhar Shiromani

,

,

CoRR, November, 2025

Sumudu Neural Operator for ODEs and PDEs.

[DOI]

,

Saibilila Abudukelimu

,

,

,

CoRR, November, 2025

Modeling and Predicting Multi-Turn Answer Instability in Large Language Models.

[DOI]

,

Rishi Ramachandran

,

Neel Ramachandran

,

,

,

,

,

Aryan Shrivastava

CoRR, November, 2025

SALT: Steering Activations towards Leakage-free Thinking in Chain of Thought.

[DOI]

,

,

,

Shashank Kesineni

,

,

,

,

,

Maheep Chaudhary

CoRR, November, 2025

Inference-Time Chain-of-Thought Pruning with Latent Informativeness Signals.

[DOI]

,

,

,

,

,

,

CoRR, November, 2025

DuoLens: A Framework for Robust Detection of Machine-Generated Multilingual Text and Code.

[DOI]

Shriyansh Agrawal

,

,

,

,

,

,

CoRR, October, 2025

AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI.

[DOI]

,

,

Anotida Expected Msiiwa

,

,

,

,

,

CoRR, October, 2025

Limits of Emergent Reasoning of Large Language Models in Agentic Frameworks for Deterministic Games.

[DOI]

,

,

Matheus Marques

,

,

,

CoRR, October, 2025

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs.

[DOI]

,

Nikita Andriyanov

,

Nikhil Bageshpura

,

,

,

,

,

Alexander Panchenko

,

,

Elena Tutubalina

,

Mikhail Seleznyov

CoRR, October, 2025

PALADIN: Self-Correcting Language Model Agents to Cure Tool-Failure Cases.

[DOI]

Sri Vatsa Vuddanti

,

,

Satwik Kumar Chittiprolu

,

,

,

,

Maheep Chaudhary

CoRR, September, 2025

Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization.

[DOI]

,

,

,

,

Maheep Chaudhary

CoRR, September, 2025

FRIT: Using Causal Importance to Improve Chain-of-Thought Faithfulness.

[DOI]

,

,

Saksham Uboweja

,

Adiliia Uzdenova

,

,

,

,

,

,

Maheep Chaudhary

CoRR, September, 2025

Recommendations and Reporting Checklist for Rigorous & Transparent Human Baselines in Model Evaluations.

[DOI]

,

Patricia Paskov

,

,

Michael J. Byun

,

,

Xavier Roberts-Gaal

,

,

,

Chinmay Deshpande

CoRR, June, 2025

RAPTOR: Reasoned Agentic Portfolio Trading with Orchestrated Rebalancing.

[DOI]

,

Matthew Caliboso

,

,

,

,

Mithil Srungarapu

,

,

,

Proceedings of the 1st Workshop on Knowledge Graphs & Agentic Systems Interplay (NORA 2025) co-located with the Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025), 2025

Position: Human Baselines in Model Evaluations Need Rigor and Transparency (With Recommendations & Reporting Checklist).

[DOI]

,

Patricia Paskov

,

,

Michael J. Byun

,

,

Xavier Roberts-Gaal

,

,

,

Chinmay Deshpande

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Loading...