Sunishchal Dev

According to our database1, Sunishchal Dev authored at least 24 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Seven simple steps for log analysis in AI systems.
CoRR, April, 2026

The science and practice of proportionality in AI risk evaluations.
CoRR, March, 2026

Judge Reliability Harness: Stress Testing the Reliability of LLM Judges.
CoRR, March, 2026

Broken Chains: The Cost of Incomplete Reasoning in LLMs.
CoRR, February, 2026

ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs.
CoRR, February, 2026

Visualizing and Benchmarking LLM Factual Hallucination Tendencies via Internal State Analysis and Clustering.
CoRR, February, 2026

A Few Bad Neurons: Isolating and Surgically Correcting Sycophancy.
CoRR, January, 2026

2025
Emergent Persuasion: Will LLMs Persuade Without Being Prompted?
CoRR, December, 2025

Peek-a-Boo Reasoning: Contrastive Region Masking in MLLMs.
CoRR, December, 2025

COMPASS: Context-Modulated PID Attention Steering System for Hallucination Mitigation.
CoRR, November, 2025

Sumudu Neural Operator for ODEs and PDEs.
CoRR, November, 2025

Modeling and Predicting Multi-Turn Answer Instability in Large Language Models.
CoRR, November, 2025

SALT: Steering Activations towards Leakage-free Thinking in Chain of Thought.
CoRR, November, 2025

Inference-Time Chain-of-Thought Pruning with Latent Informativeness Signals.
CoRR, November, 2025

DuoLens: A Framework for Robust Detection of Machine-Generated Multilingual Text and Code.
CoRR, October, 2025

AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI.
CoRR, October, 2025

Limits of Emergent Reasoning of Large Language Models in Agentic Frameworks for Deterministic Games.
CoRR, October, 2025

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs.
CoRR, October, 2025

PALADIN: Self-Correcting Language Model Agents to Cure Tool-Failure Cases.
CoRR, September, 2025

Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization.
CoRR, September, 2025

FRIT: Using Causal Importance to Improve Chain-of-Thought Faithfulness.
CoRR, September, 2025

Recommendations and Reporting Checklist for Rigorous & Transparent Human Baselines in Model Evaluations.
CoRR, June, 2025

RAPTOR: Reasoned Agentic Portfolio Trading with Orchestrated Rebalancing.
Proceedings of the 1st Workshop on Knowledge Graphs & Agentic Systems Interplay (NORA 2025) co-located with the Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025), 2025

Position: Human Baselines in Model Evaluations Need Rigor and Transparency (With Recommendations & Reporting Checklist).
Proceedings of the Forty-second International Conference on Machine Learning, 2025


  Loading...