Shaofei Cai

Orcid: 0000-0002-1195-7276

According to our database1, Shaofei Cai authored at least 33 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

Steering Visuomotor Policy in Open Worlds via Cross-View Goal Alignment.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents.
CoRR, December, 2025

CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent.
CoRR, October, 2025

UniCode: A Framework for Generating High Quality Competitive Coding Problems.
CoRR, October, 2025

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning.
CoRR, September, 2025

Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents.
CoRR, July, 2025

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective.
CoRR, July, 2025

Toward Memory-Aided World Models: Benchmarking via Spatial Consistency.
CoRR, May, 2025

JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

Open-World Skill Discovery from Unsegmented Demonstrations.
CoRR, March, 2025

ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment.
CoRR, March, 2025

Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition.
IEEE Trans. Neural Networks Learn. Syst., January, 2025

GROOT-2: Weakly Supervised Multimodal Instruction Following Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Open-World Skill Discovery from Unsegmented Demonstration Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Inductive State-Relabeling Adversarial Active Learning With Heuristic Clique Rescaling.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

MineStudio: A Streamlined Package for Minecraft AI Agent Development.
CoRR, 2024

GROOT-2: Weakly Supervised Multi-Modal Instruction Following Agents.
CoRR, 2024

Optimizing Latent Goal by Learning from Trajectory Preference.
CoRR, 2024

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting.
CoRR, 2024

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

GROOT: Learning to Follow Instructions by Watching Gameplay Videos.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents.
CoRR, 2023

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Automatic Relation-aware Graph Network Proliferation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing.
CoRR, 2021

Edge-featured Graph Neural Architecture Search.
CoRR, 2021

Rethinking Graph Neural Network Search from Message-passing.
CoRR, 2021

Rethinking Graph Neural Architecture Search From Message-Passing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


  Loading...