We stand with Ukraine

We stand with Ukraine

Vasudev Lal

Orcid: 0000-0002-5907-9898

According to our database¹, Vasudev Lal authored at least 54 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Probing the Representational Power of Sparse Autoencoders in Vision Models.

[BibT_eX]

[DOI]

Matthew Lyle Olson

,

,

,

,

,

,

CoRR, August, 2025

DPO Learning with LLMs-Judge Signal for Computer Use Agents.

[BibT_eX]

[DOI]

,

,

,

Shachar Rosenman

,

,

,

CoRR, June, 2025

Cultural Awareness in Vision-Language Models: A Cross-Country Exploration.

[BibT_eX]

[DOI]

,

,

CoRR, May, 2025

Learning from Reasoning Failures via Synthetic Data Generation.

[BibT_eX]

[DOI]

Gabriela Ben Melech Stan

,

,

,

,

CoRR, April, 2025

Quantifying Interpretability in CLIP Models with Concept Consistency.

[BibT_eX]

[DOI]

,

,

CoRR, March, 2025

Is Your Paper Being Reviewed by an LLM? A New Benchmark Dataset and Approach for Detecting AI Text in Peer Review.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, February, 2025

Semantic Specialization in MoE Appears with Scale: A Study of DeepSeek R1 Expert Specialization.

[BibT_eX]

[DOI]

Matthew Lyle Olson

,

,

,

,

,

,

CoRR, February, 2025

LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression.

[BibT_eX]

[DOI]

,

Anahita Bhiwandiwalla

,

,

,

,

Sharath Nittur Sridhar

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment.

[BibT_eX]

[DOI]

Raanan Yehezkel Rohekar

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders.

[BibT_eX]

[DOI]

Matthew Lyle Olson

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2025 AAAI Spring Symposium Series, 2025

2024

FiVL: A Framework for Improved Vision-Language Alignment.

[BibT_eX]

[DOI]

,

Gabriela Ben Melech Stan

,

,

,

Shachar Rosenman

,

,

,

CoRR, 2024

Causal World Representation in the GPT Model.

[BibT_eX]

[DOI]

Raanan Y. Rohekar

,

,

,

CoRR, 2024

Steering Large Language Models to Evaluate and Amplify Creativity.

[BibT_eX]

[DOI]

Matthew Lyle Olson

,

,

,

,

CoRR, 2024

FastRM: An efficient and automatic explainability framework for multimodal generative models.

[BibT_eX]

[DOI]

Gabriela Ben Melech Stan

,

,

,

Shachar Rosenman

,

,

,

,

CoRR, 2024

Debias your Large Multi-Modal Model at Test-Time with Non-Contrastive Visual Attribute Steering.

[BibT_eX]

[DOI]

,

Matthew Lyle Olson

,

,

,

,

,

CoRR, 2024

Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency.

[BibT_eX]

[DOI]

Prafulla Kumar Choubey

,

,

,

,

,

,

Shachar Rosenman

,

,

,

Ricky Ho Yin Chan

,

,

CoRR, 2024

Debiasing Large Vision-Language Models by Ablating Protected Attribute Representations.

[BibT_eX]

[DOI]

,

Matthew Lyle Olson

,

,

,

,

CoRR, 2024

Is Your Paper Being Reviewed by an LLM? Investigating AI Text Detectability in Peer Review.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Quantifying and Enabling the Interpretability of CLIP-like Models.

[BibT_eX]

[DOI]

,

Yossi Gandelsman

,

,

CoRR, 2024

ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution.

[BibT_eX]

[DOI]

,

,

Anahita Bhiwandiwalla

,

,

Matthew Lyle Olson

,

,

CoRR, 2024

LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model.

[BibT_eX]

[DOI]

,

Matthew L. Olson

,

,

,

CoRR, 2024

LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models.

[BibT_eX]

[DOI]

Gabriela Ben Melech Stan

,

,

Raanan Yehezkel Rohekar

,

Anahita Bhiwandiwalla

,

,

Matthew Lyle Olson

,

,

,

,

Proceedings of the 3rd Explainable AI for Computer Vision (XAI4CV) Workshop, 2024

NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge.

[BibT_eX]

[DOI]

,

,

,

,

,

Swabha Swayamdipta

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Why do LLaVA Vision-Language Models Reply to Images in English?

[BibT_eX]

[DOI]

,

Carolin Holtermann

,

Matthew L. Olson

,

Florian Schneider

,

,

Anahita Bhiwandiwalla

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Getting it Right: Improving Spatial Consistency in Text-to-Image Models.

[BibT_eX]

[DOI]

Agneet Chatterjee

,

Gabriela Ben Melech Stan

,

,

,

,

,

,

Hannaneh Hajishirzi

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation.

[BibT_eX]

[DOI]

Shachar Rosenman

,

,

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples.

[BibT_eX]

[DOI]

,

,

,

Gustavo A. Lujan-Moreno

,

Anahita Bhiwandiwalla

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

L-MAGIC: Language Model Assisted Generation of Images with Coherence.

[BibT_eX]

[DOI]

,

Matthias Mueller

,

,

,

,

,

Gabriela Ben Melech Stan

,

,

Michael Paulitsch

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

MuMUR: Multilingual Multimodal Universal Retrieval.

[BibT_eX]

[DOI]

,

,

Gabriela Ben Melech Stan

,

Shachar Rosenman

,

,

Gedas Bertasius

,

Inf. Retr. J., June, 2023

Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples.

[BibT_eX]

[DOI]

,

,

,

Gustavo A. Lujan-Moreno

,

Anahita Bhiwandiwalla

,

CoRR, 2023

LDM3D-VR: Latent Diffusion Model for 3D VR.

[BibT_eX]

[DOI]

Gabriela Ben Melech Stan

,

,

,

,

,

Michael Paulitsch

,

CoRR, 2023

Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks.

[BibT_eX]

[DOI]

,

Anahita Bhiwandiwalla

,

CoRR, 2023

Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples.

[BibT_eX]

[DOI]

,

,

,

Gustavo A. Lujan-Moreno

,

CoRR, 2023

ICSVR: Investigating Compositional and Semantic Understanding in Video Retrieval Models.

[BibT_eX]

[DOI]

,

CoRR, 2023

LDM3D: Latent Diffusion Model for 3D.

[BibT_eX]

[DOI]

Gabriela Ben Melech Stan

,

,

,

,

,

,

,

,

,

Matthias Müller

,

CoRR, 2023

Is multi-modal vision supervision beneficial to language?

[BibT_eX]

[DOI]

,

CoRR, 2023

Brain encoding models based on multimodal transformers can transfer across language and vision.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Video Retrieval Using Multilingual Knowledge Transfer.

[BibT_eX]

[DOI]

,

,

Gabriela Ben Melech Stan

,

,

Gedas Bertasius

,

Proceedings of the Advances in Information Retrieval, 2023

Is Multimodal Vision Supervision Beneficial to Language?

[BibT_eX]

[DOI]

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

Anahita Bhiwandiwalla

,

Shachar Rosenman

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning.

[BibT_eX]

[DOI]

,

,

Shachar Rosenman

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning.

[BibT_eX]

[DOI]

,

,

Shachar Rosenman

,

,

CoRR, 2022

Opinion-based Relational Pivoting for Cross-domain Aspect Term Extraction.

[BibT_eX]

[DOI]

,

,

,

,

Moshe Wasserblat

,

Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

Swabha Swayamdipta

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs.

[BibT_eX]

[DOI]

,

,

,

Ana Paula Simões

,

,

,

Moshe Wasserblat

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding.

[BibT_eX]

[DOI]

,

,

Tetiana Grinberg

,

,

Phillip Ryan Howard

,

,

Proceedings of the Artificial General Intelligence - 15th International Conference, 2022

2021

InterpreT: An Interactive Visualization Tool for Interpreting Transformers.

[BibT_eX]

[DOI]

,

,

,

,

Ana Paula Simões

,

,

,

,

Moshe Wasserblat

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

First Workshop on Knowledge Injection in Neural Networks (KINN).

[BibT_eX]

[DOI]

,

,

,

Pasquale Minervini

,

Sandya Mannarswamy

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Loading...