Yatian Pang

Orcid: 0000-0002-9714-9068

According to our database1, Yatian Pang authored at least 18 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models.
IEEE Trans. Multim., 2026

Next Patch Prediction for AutoRegressive Visual Generation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation.
CoRR, June, 2025

SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video.
CoRR, March, 2025

E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Masked Autoencoders for 3D Point Cloud Self-supervised Learning.
World Sci. Annu. Rev. Artif. Intell., 2024

Next Patch Prediction for Autoregressive Visual Generation.
CoRR, 2024

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation.
CoRR, 2024

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses.
CoRR, 2024

Open-Sora Plan: Open-Source Large Video Generation Model.
CoRR, 2024

Envision3D: One Image to 3D with Anchor Views Interpolation.
CoRR, 2024

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Repaint123: Fast and High-Quality One Image to 3D Generation with Progressive Controllable Repainting.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting.
CoRR, 2023

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
CoRR, 2023

2022
Masked Autoencoders for Point Cloud Self-supervised Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022


  Loading...