Artem Zholus

Orcid: 0000-0003-3167-3585

According to our database1, Artem Zholus authored at least 22 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models.
CoRR, May, 2026

TAPNext++: What's Next for Tracking Any Point (TAP)?
CoRR, April, 2026

Hierarchical Planning with Latent World Models.
CoRR, April, 2026

TRecViT: A Recurrent Video Transformer.
Trans. Mach. Learn. Res., 2026

2025
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning.
CoRR, June, 2025

IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Tapnext: Tracking Any Point (Tap) as Next Token Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation.
CoRR, 2024

Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent.
CoRR, 2024

Mastering Memory Tasks with World Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions.
CoRR, 2023

2022
Collecting Interactive Multi-modal Datasets for Grounded Language Understanding.
CoRR, 2022

Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions.
CoRR, 2022

IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents.
CoRR, 2022

IGLU 2022: Interactive Grounded Language Understanding in a Collaborative Environment at NeurIPS 2022.
CoRR, 2022

2021
Multitask Adaptation by Retrospective Exploration with Learned World Models.
CoRR, 2021

NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment.
CoRR, 2021



Case-Based Task Generalization in Model-Based Reinforcement Learning.
Proceedings of the Artificial General Intelligence - 14th International Conference, 2021

2020
Continuous Histogram Loss: Beyond Neural Similarity.
CoRR, 2020


  Loading...