Vasudev Lal

Orcid: 0000-0002-5907-9898

According to our database1, Vasudev Lal authored at least 53 papers between 2021 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Probing the Representational Power of Sparse Autoencoders in Vision Models.
CoRR, August, 2025

DPO Learning with LLMs-Judge Signal for Computer Use Agents.
CoRR, June, 2025

Cultural Awareness in Vision-Language Models: A Cross-Country Exploration.
CoRR, May, 2025

Learning from Reasoning Failures via Synthetic Data Generation.
CoRR, April, 2025

Quantifying Interpretability in CLIP Models with Concept Consistency.
CoRR, March, 2025

Is Your Paper Being Reviewed by an LLM? A New Benchmark Dataset and Approach for Detecting AI Text in Peer Review.
CoRR, February, 2025

Semantic Specialization in MoE Appears with Scale: A Study of DeepSeek R1 Expert Specialization.
CoRR, February, 2025

LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning.
Proceedings of the 2025 AAAI Spring Symposium Series, 2025

2024
FiVL: A Framework for Improved Vision-Language Alignment.
CoRR, 2024

Causal World Representation in the GPT Model.
CoRR, 2024

Steering Large Language Models to Evaluate and Amplify Creativity.
CoRR, 2024

FastRM: An efficient and automatic explainability framework for multimodal generative models.
CoRR, 2024

Debias your Large Multi-Modal Model at Test-Time with Non-Contrastive Visual Attribute Steering.
CoRR, 2024

Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency.
CoRR, 2024

Debiasing Large Vision-Language Models by Ablating Protected Attribute Representations.
CoRR, 2024

Is Your Paper Being Reviewed by an LLM? Investigating AI Text Detectability in Peer Review.
CoRR, 2024

Quantifying and Enabling the Interpretability of CLIP-like Models.
CoRR, 2024

ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution.
CoRR, 2024

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs.
CoRR, 2024

LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model.
CoRR, 2024

LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models.
Proceedings of the 3rd Explainable AI for Computer Vision (XAI4CV) Workshop, 2024

NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Why do LLaVA Vision-Language Models Reply to Images in English?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Getting it Right: Improving Spatial Consistency in Text-to-Image Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

L-MAGIC: Language Model Assisted Generation of Images with Coherence.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
MuMUR: Multilingual Multimodal Universal Retrieval.
Inf. Retr. J., June, 2023

Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples.
CoRR, 2023

LDM3D-VR: Latent Diffusion Model for 3D VR.
CoRR, 2023

Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks.
CoRR, 2023

Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples.
CoRR, 2023

ICSVR: Investigating Compositional and Semantic Understanding in Video Retrieval Models.
CoRR, 2023

LDM3D: Latent Diffusion Model for 3D.
CoRR, 2023

Is multi-modal vision supervision beneficial to language?
CoRR, 2023

Brain encoding models based on multimodal transformers can transfer across language and vision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Video Retrieval Using Multilingual Knowledge Transfer.
Proceedings of the Advances in Information Retrieval, 2023

Is Multimodal Vision Supervision Beneficial to Language?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning.
CoRR, 2022

Opinion-based Relational Pivoting for Cross-domain Aspect Term Extraction.
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding.
Proceedings of the Artificial General Intelligence - 15th International Conference, 2022

2021
InterpreT: An Interactive Visualization Tool for Interpreting Transformers.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

First Workshop on Knowledge Injection in Neural Networks (KINN).
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021


  Loading...