Shaofei Cai

Orcid: 0000-0002-1195-7276

According to our database¹, Shaofei Cai authored at least 33 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

Steering Visuomotor Policy in Open Worlds via Cross-View Goal Alignment.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents.

[BibT_eX]

[DOI]

CoRR, December, 2025

CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent.

[BibT_eX]

[DOI]

CoRR, October, 2025

UniCode: A Framework for Generating High Quality Competitive Coding Problems.

[BibT_eX]

[DOI]

CoRR, October, 2025

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents.

[BibT_eX]

[DOI]

CoRR, July, 2025

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective.

[BibT_eX]

[DOI]

CoRR, July, 2025

Toward Memory-Aided World Models: Benchmarking via Spatial Consistency.

[BibT_eX]

[DOI]

CoRR, May, 2025

JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

Open-World Skill Discovery from Unsegmented Demonstrations.

[BibT_eX]

[DOI]

CoRR, March, 2025

ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment.

[BibT_eX]

[DOI]

CoRR, March, 2025

Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., January, 2025

GROOT-2: Weakly Supervised Multimodal Instruction Following Agents.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Open-World Skill Discovery from Unsegmented Demonstration Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Inductive State-Relabeling Adversarial Active Learning With Heuristic Clique Rescaling.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

MineStudio: A Streamlined Package for Minecraft AI Agent Development.

[BibT_eX]

[DOI]

CoRR, 2024

GROOT-2: Weakly Supervised Multi-Modal Instruction Following Agents.

[BibT_eX]

[DOI]

CoRR, 2024

Optimizing Latent Goal by Learning from Trajectory Preference.

[BibT_eX]

[DOI]

CoRR, 2024

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting.

[BibT_eX]

[DOI]

CoRR, 2024

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GROOT: Learning to Follow Instructions by Watching Gameplay Videos.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents.

[BibT_eX]

[DOI]

CoRR, 2023

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Automatic Relation-aware Graph Network Proliferation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing.

[BibT_eX]

[DOI]

CoRR, 2021

Edge-featured Graph Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking Graph Neural Network Search from Message-passing.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking Graph Neural Architecture Search From Message-Passing.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Shaofei Cai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...