Winson Han

According to our database1, Winson Han authored at least 24 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
WildDet3D: Scaling Promptable 3D Detection in the Wild.
CoRR, April, 2026

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web.
CoRR, April, 2026

MolmoPoint: Better Pointing for VLMs with Grounding Tokens.
CoRR, March, 2026

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs.
CoRR, March, 2026

MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation.
CoRR, March, 2026

Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos.
CoRR, February, 2026

MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation.
CoRR, February, 2026

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding.
CoRR, January, 2026

2025
Visual Representations inside the Language Model.
CoRR, October, 2025

MolmoAct: Action Reasoning Models that can Reason in Space.
CoRR, August, 2025

GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation.
CoRR, May, 2025

2024
The One RING: a Robotic Indoor Navigation Generalist.
CoRR, 2024

Holodeck: Language Guided Generation of 3D Embodied AI Environments.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World.
CoRR, 2023

EXCALIBUR: Encouraging and Evaluating Embodied Exploration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Retrospectives on the Embodied AI Workshop.
CoRR, 2022

ProcTHOR: Large-Scale Embodied AI Using Procedural Generation.
CoRR, 2022

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Learning Generalizable Visual Representations via Interactive Gameplay.
Proceedings of the 9th International Conference on Learning Representations, 2021

ManipulaTHOR: A Framework for Visual Object Manipulation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

RoboTHOR: An Open Simulation-to-Real Embodied AI Platform.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Artificial Agents Learn Flexible Visual Representations by Playing a Hiding Game.
CoRR, 2019


  Loading...