Zekun Qi

Orcid: 0009-0001-2554-5141

According to our database1, Zekun Qi authored at least 22 papers between 2016 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining.
CoRR, April, 2026

Learning Athletic Humanoid Tennis Skills from Imperfect Human Motion Data.
CoRR, March, 2026

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model.
CoRR, February, 2026

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models.
CoRR, January, 2026

2025
Switch-JustDance: Benchmarking Whole Body Motion Tracking Policies Using a Commercial Console Game.
CoRR, November, 2025

Reasoning in Space via Grounding in the World.
CoRR, October, 2025

MM-Nav: Multi-View VLA Model for Robust Visual Navigation via Multi-Expert Learning.
CoRR, October, 2025

DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge.
CoRR, July, 2025

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models.
CoRR, June, 2025

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Positional Prompt Tuning for Efficient 3D Representation Learning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Hybrid-Grained Feature Aggregation with Coarse-to-Fine Language Guidance for Self-Supervised Monocular Depth Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DexVLG: Dexterous Vision-Language-Grasp Model at Scale.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction.
CoRR, 2024

Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DreamLLM: Synergistic Multimodal Comprehension and Creation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining.
Proceedings of the International Conference on Machine Learning, 2023

Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2016
Bidirectional transformation between BPMN and BPEL with graph grammar.
Comput. Electr. Eng., 2016


  Loading...