Vinija Jain
According to our database1,
Vinija Jain authored at least 85 papers
between 2021 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
SPHERICAL KV: Angle-Domain Attention and Rate-Distortion Retention for Efficient Long-Context Inference.
CoRR, May, 2026
SleepWalk: A Three-Tier Benchmark for Stress-Testing Instruction-Guided Vision-Language Navigation.
CoRR, May, 2026
Moral Sensitivity in LLMs: A Tiered Evaluation of Contextual Bias via Behavioral Profiling and Mechanistic Interpretability.
CoRR, May, 2026
Personality Shapes Gender Bias in Persona-Conditioned LLM Narratives Across English and Hindi: An Empirical Investigation.
CoRR, April, 2026
PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training.
CoRR, April, 2026
CONSCIENTIA: Can LLM Agents Learn to Strategize? Emergent Deception and Trust in a Multi-Agent NYC Simulation.
CoRR, April, 2026
Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models.
CoRR, March, 2026
The Reasoning Trap - Logical Reasoning as a Mechanistic Pathway to Situational Awareness.
CoRR, March, 2026
SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement.
CoRR, March, 2026
When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning.
CoRR, March, 2026
I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift.
CoRR, March, 2026
CoRR, March, 2026
CoRR, February, 2026
Neural FOXP2 - Language Specific Neuron Steering for Targeted Language Improvement in LLMs.
CoRR, February, 2026
Stochastic CHAOS: Why Deterministic Inference Kills, and Distributional Variability Is the Heartbeat of Artifical Cognition.
CoRR, January, 2026
CoRR, January, 2026
ECLIPTICA - A Framework for Switchable LLM Alignment via CITA - Contrastive Instruction-Tuned Alignment.
CoRR, January, 2026
CoRR, January, 2026
Proceedings of the 59th Hawaii International Conference on System Sciences, 2026
2025
AlignMerge - Alignment-Preserving Large Language Model Merging via Fisher-Guided Geometric Constraints.
CoRR, December, 2025
D-STEER - Preference Alignment Techniques Learn to Behave, not to Believe - Beneath the Surface, DPO as Steering Vector Perturbation in Activation Space.
CoRR, December, 2025
Catch Me If You Can: How Smaller Reasoning Models Pretend to Reason with Mathematical Fidelity.
CoRR, December, 2025
CoRR, November, 2025
Alignment Faking - the Train -> Deploy Asymmetry: Through a Game-Theoretic Lens with Bayesian-Stackelberg Equilibria.
CoRR, November, 2025
A Novel XAI-Enhanced Quantum Adversarial Networks for Velocity Dispersion Modeling in MaNGA Galaxies.
CoRR, October, 2025
CoRR, October, 2025
DeHate: A Stable Diffusion-based Multimodal Approach to Mitigate Hate Speech in Images.
CoRR, September, 2025
AMBEDKAR-A Multi-level Bias Elimination through a Decoding Approach with Knowledge Augmentation for Robust Constitutional Alignment of Language Models.
CoRR, September, 2025
AlignGuard-LoRA: Alignment-Preserving Fine-Tuning via Fisher-Guided Decomposition and Riemannian-Geodesic Collision Regularization.
CoRR, August, 2025
TRACEALIGN - Tracing the Drift: Attributing Alignment Failures to Training-Time Belief Sources in LLMs.
CoRR, August, 2025
CoRR, July, 2025
MOD-X: A Modular Open Decentralized eXchange Framework proposal for Heterogeneous Interoperable Artificial Intelligence Agents.
CoRR, July, 2025
RADIANT: Retrieval AugmenteD entIty-context AligNmenT - Introducing RAG-ability and Entity-Context Divergence.
CoRR, July, 2025
Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images.
CoRR, June, 2025
QuickSilver - Speeding up LLM Inference through Dynamic Token Halting, KV Skipping, Contextual Token Fusion, and Adaptive Matryoshka Quantization.
CoRR, June, 2025
AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI).
CoRR, June, 2025
YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment.
CoRR, February, 2025
Multilingual State Space Models for Structured Question Answering in Indic Languages.
CoRR, February, 2025
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding.
CoRR, January, 2025
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
Proceedings of the 6th IEEE International Conference on Image Processing, 2025
The Visual Counter Turing Test (VCT²): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (V_AI).
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025
PHAnToM: Persona-Based Prompting Has an Effect on Theory-of-Mind Reasoning in Large Language Models.
Proceedings of the Nineteenth International AAAI Conference on Web and Social Media, 2025
Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
KnowledgePrompts: Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
LLMsAgainstHate@NLU of Devanagari Script Languages 2025: Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Exploring the Impact of Large Language Models on Recommender Systems: An Extensive Review.
Proceedings of the IEEE International Conference on Big Data, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2025
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
YinYang-Align: A new Benchmark for Competing Objectives and Introducing Multi-Objective Preference based Text-to-Image Alignment.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Visual Counter Turing Test (VCT<sup>2</sup>): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V<sub>AI</sub>).
CoRR, 2024
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models.
CoRR, 2024
The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks.
CoRR, 2024
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models.
CoRR, 2024
Evaluating Open Language Models Across Task Types, Application Domains, and Reasoning Types: An In-Depth Experimental Analysis.
CoRR, 2024
CoRR, 2024
A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models.
CoRR, 2024
Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Survey.
CoRR, 2024
CoRR, 2024
Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions.
CoRR, 2024
CoRR, 2024
Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models.
CoRR, 2024
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models.
CoRR, 2024
A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications.
CoRR, 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models - A Detailed Survey.
CoRR, 2024
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models.
CoRR, 2024
IEEE Access, 2024
A Comprehensive Survey of Hallucination in Large Language, Image, Video and Audio Foundation Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024
2023
CoRR, 2023
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index.
CoRR, 2023
CONFLATOR: Incorporating Switching Point based Rotatory Positional Encodings for Code-Mixed Language Modeling.
CoRR, 2023
Counter Turing Test (CT2): AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index (ADI).
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2021
iReason: Multimodal Commonsense Reasoning using Videos and Natural Language with Interpretability.
CoRR, 2021