Zuyan Liu

Orcid: 0009-0002-6943-3085

According to our database1, Zuyan Liu authored at least 21 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ProtoComp++: Diverse Point Cloud Completion With Controllable Prototype.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2026

Point2Seq: Quantized Serialization Encoding for Object Point Cloud Pretraining.
Int. J. Comput. Vis., June, 2026

GEM: Generative Supervision Helps Embodied Intelligence.
CoRR, May, 2026

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents.
CoRR, April, 2026

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning.
CoRR, March, 2026

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models.
CoRR, March, 2026

Efficient High-Order Spatial Interactions for Visual Perception.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2026

2025
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization.
CoRR, November, 2025

Vision Generalist Model: A Survey.
Int. J. Comput. Vis., October, 2025

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark.
CoRR, September, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment.
CoRR, February, 2025

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models.
CoRR, 2024

Efficient Inference of Vision Instruction-Following Models with Elastic Cache.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation.
CoRR, 2023

Unleashing Text-to-Image Diffusion Models for Visual Perception.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffSwap: High-Fidelity and Controllable Face Swapping via 3D-Aware Masked Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021
PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021


  Loading...