David Fan

Orcid: 0000-0002-9217-5451

Affiliations:

Meta Fundamental AI Research (FAIR), New York, NY, USA
Amazon Prime Video, Seattle, WA, USA (former)
Princeton University, Vision and Learning Lab, Princeton, NJ, USA (former)

According to our database¹, David Fan authored at least 18 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Beyond Language Modeling: An Exploration of Multimodal Pretraining.

[BibT_eX]

[DOI]

CoRR, March, 2026

A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

World Models Can Leverage Human Videos for Dexterous Manipulation.

[BibT_eX]

[DOI]

Raktim Gautam Goswami

Prashanth Krishnamurthy

Michael Rabbat

Farshad Khorrami

Yann LeCun

CoRR, December, 2025

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training.

[BibT_eX]

[DOI]

CoRR, September, 2025

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning.

[BibT_eX]

[DOI]

CoRR, June, 2025

GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-Grained Video-Language Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Now you see Me: Context-Aware Automatic Audio Description.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Scaling Language-Free Visual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024

NowYouSee Me: Context-Aware Automatic Audio Description.

[BibT_eX]

[DOI]

CoRR, 2024

Video Token Merging for Long-form Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Video Token Merging for Long Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Text-Guided Video Masked Autoencoder.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos.

[BibT_eX]

[DOI]

CoRR, 2023

MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation.

[BibT_eX]

[DOI]

Hector J. Santos-Villalobos

Vimal Bhat

Rohith MV

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Motion-Guided Masking for Spatiotemporal Representation Learning.

[BibT_eX]

[DOI]

Hector J. Santos-Villalobos

Rohith MV

Xinyu Li

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2021

Shot Contrastive Self-Supervised Learning for Scene Boundary Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

OASIS: A Large-Scale Dataset for Single Image 3D in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

David Fan

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...