Vasu Sharma

Orcid: 0009-0006-4348-7412

According to our database1, Vasu Sharma authored at least 56 papers between 2015 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Universal Neurons in GPT-2: Emergence, Persistence, and Functional Impact.
CoRR, August, 2025

The Geometry of Harmfulness in LLMs through Subconcept Probing.
CoRR, July, 2025

Rewrite-to-Rank: Optimizing Ad Visibility via Retrieval-Aware Text Rewriting.
CoRR, July, 2025

Causal Language Control in Multilingual Transformers via Sparse Feature Steering.
CoRR, July, 2025

COREVQA: A Crowd Observation and Reasoning Entailment Visual Question Answering Benchmark.
CoRR, July, 2025

Understanding Trade offs When Conditioning Synthetic Data.
CoRR, July, 2025

Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images.
CoRR, June, 2025

Seamless Interaction: Dyadic Audiovisual Motion Modeling and Large-Scale Dataset.
CoRR, June, 2025

Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations.
CoRR, June, 2025

AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI).
CoRR, June, 2025

Pruning for Performance: Efficient Idiom and Metaphor Classification in Low-Resource Konkani Using mBERT.
CoRR, June, 2025

NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts.
CoRR, June, 2025

Sarc7: Evaluating Sarcasm Detection and Generation with Seven Types and Emotion-Informed Techniques.
CoRR, June, 2025

From Directions to Cones: Exploring Multidimensional Representations of Propositional Facts in LLMs.
CoRR, May, 2025

Distill CLIP (DCLIP): Enhancing Image-Text Retrieval via Cross-Modal Transformer Distillation.
CoRR, May, 2025

Advancing Uto-Aztecan Language Technologies: A Case Study on the Endangered Comanche Language.
CoRR, May, 2025

Deconstructing Bias: A Multifaceted Framework for Diagnosing Cultural and Compositional Inequities in Text-to-Image Generative Models.
CoRR, May, 2025

Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning.
CoRR, May, 2025

Incorporating a Deep Neural Network into Moving Horizon Estimation for Embedded Thermal Torque Derating of an Electric Machine.
CoRR, April, 2025

TRUTH DECAY: Quantifying Multi-Turn Sycophancy in Language Models.
CoRR, March, 2025

Pause-Tuning for Long-Context Comprehension: A Lightweight Approach to LLM Attention Recalibration.
CoRR, February, 2025

Safe Reinforcement Learning-based Control for Hydrogen Diesel Dual-Fuel Engines.
CoRR, February, 2025

YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment.
CoRR, February, 2025

Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

YinYang-Align: A new Benchmark for Competing Objectives and Introducing Multi-Objective Preference based Text-to-Image Alignment.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
DINOv2: Learning Robust Visual Features without Supervision.
Trans. Mach. Learn. Res., 2024

The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks.
CoRR, 2024

An Introduction to Vision-Language Modeling.
CoRR, 2024

Text Quality-Based Pruning for Efficient Training of Language Models.
CoRR, 2024

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM.
CoRR, 2024

ε-ViLM : Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024

Demystifying CLIP Data.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer.
CoRR, 2023

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning.
CoRR, 2023

Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI.
CoRR, 2023

Alexa Arena: A User-Centric Interactive Platform for Embodied AI.
CoRR, 2023

Alexa Arena: A User-Centric Interactive Platform for Embodied AI.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MAViL: Masked Audio-Video Learners.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Shimmy: Accelerating Inter-Container Communication for the IoT Edge.
Proceedings of the IEEE Global Communications Conference, 2023

Flap: Fast Language-Audio Pre-Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning.
CoRR, 2022

PISA: PoIncaré Saliency-Aware Interpolative Augmentation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Tweet Based Reach Aware Temporal Attention Network for NFT Valuation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2019
Induced Attention Invariance: Defending VQA Models against Adversarial Attacks.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

Multimodal Behavioral Markers Exploring Suicidal Intent in Social Media Videos.
Proceedings of the International Conference on Multimodal Interaction, 2019

Community Regularization of Visually-Grounded Dialog.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Mind Your Language: Learning Visually Grounded Dialog in a Multi-Agent Setting.
CoRR, 2018

Cyclegen: Cyclic consistency based product review generator from attributes.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

BioAMA: Towards an End to End BioMedical Question Answering System.
Proceedings of the BioNLP 2018 workshop, Melbourne, Australia, July 19, 2018, 2018

2017
Segmentation Guided Attention Networks for Visual Question Answering.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Automatic tagging and retrieval of E-Commerce products based on visual features.
Proceedings of the Student Research Workshop, 2016

2015
Analyzing Newspaper Crime Reports for Identification of Safe Transit Paths.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Image summarization using topic modelling.
Proceedings of the 2015 IEEE International Conference on Signal and Image Processing Applications, 2015

A Deep Neural Network based approach for vocal extraction from songs.
Proceedings of the 2015 IEEE International Conference on Signal and Image Processing Applications, 2015


  Loading...