Yue Yang

Affiliations:

University of Pennsylvania, Philadelphia, PA, USA

According to our database¹, Yue Yang authored at least 21 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web.

[BibT_eX]

[DOI]

CoRR, April, 2026

MolmoPoint: Better Pointing for VLMs with Grounding Tokens.

[BibT_eX]

[DOI]

CoRR, March, 2026

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs.

[BibT_eX]

[DOI]

CoRR, March, 2026

Connecting the Dots: Surfacing Structure in Documents through AI-Generated Cross-Modal Links.

[BibT_eX]

[DOI]

CoRR, February, 2026

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

HOLODECK 2.0: Vision-Language-Guided 3D World Generation with Editing.

[BibT_eX]

[DOI]

CoRR, August, 2025

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2024

A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

MiRAGeNews: Multimodal Realistic AI-Generated News Detection.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

CoMo: Controllable Motion Generation Through Language Guided Pose Code Editing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Holodeck: Language Guided Generation of 3D Embodied AI Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Interpretable-by-Design Text Classification with Iteratively Generated Concept Bottleneck.

[BibT_eX]

[DOI]

CoRR, 2023

Causal Reasoning of Entities and Events in Procedural Texts.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Induce, Edit, Retrieve: Language Grounded Multimodal Schema for Instructional Video Retrieval.

[BibT_eX]

[DOI]

CoRR, 2021

Visual Goal-Step Inference using wikiHow.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Yue Yang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...