Zaid Khan

Orcid: 0000-0003-0743-2992

Affiliations:
  • Northeastern University, Department of Electrical and Computer Engineering, Boston, MA, USA


According to our database1, Zaid Khan authored at least 17 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
OpenThoughts: Data Recipes for Reasoning Models.
CoRR, June, 2025

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems.
CoRR, April, 2025

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning.
CoRR, March, 2025

MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use.
CoRR, February, 2025

Learning to Generate Unit Tests for Automated Debugging.
CoRR, February, 2025

DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Exploring Question Decomposition for Zero-Shot VQA.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Families in Wild Multimedia: A Multimodal Database for Recognizing Kinship.
IEEE Trans. Multim., 2022

Single-Stream Multi-level Alignment for Vision-Language Pretraining.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Exploiting BERT for Multimodal Target Sentiment Classification through Input Space Translation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision.
Proceedings of the FAccT '21: 2021 ACM Conference on Fairness, 2021

2020
Families In Wild Multimedia (FIW-MM): A Multi-Modal Database for Recognizing Kinship.
CoRR, 2020

Recognizing Families In the Wild (RFIW): The 4th Edition.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020


  Loading...