Zuyan Liu

Orcid: 0009-0002-6943-3085

According to our database¹, Zuyan Liu authored at least 21 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

ProtoComp++: Diverse Point Cloud Completion With Controllable Prototype.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., July, 2026

Point2Seq: Quantized Serialization Encoding for Object Point Cloud Pretraining.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., June, 2026

GEM: Generative Supervision Helps Embodied Intelligence.

[BibT_eX]

[DOI]

CoRR, May, 2026

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents.

[BibT_eX]

[DOI]

CoRR, April, 2026

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2026

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

Efficient High-Order Spatial Interactions for Visual Perception.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., January, 2026

2025

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization.

[BibT_eX]

[DOI]

CoRR, November, 2025

Vision Generalist Model: A Survey.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., October, 2025

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark.

[BibT_eX]

[DOI]

CoRR, September, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment.

[BibT_eX]

[DOI]

CoRR, February, 2025

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Inference of Vision Instruction-Following Models with Elastic Cache.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation.

[BibT_eX]

[DOI]

CoRR, 2023

Unleashing Text-to-Image Diffusion Models for Visual Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021

PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Zuyan Liu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...