Yatian Pang

Orcid: 0000-0002-9714-9068

According to our database¹, Yatian Pang authored at least 18 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

Next Patch Prediction for AutoRegressive Visual Generation.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video.

[BibT_eX]

[DOI]

CoRR, March, 2025

E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Masked Autoencoders for 3D Point Cloud Self-supervised Learning.

[BibT_eX]

[DOI]

World Sci. Annu. Rev. Artif. Intell., 2024

Next Patch Prediction for Autoregressive Visual Generation.

[BibT_eX]

[DOI]

CoRR, 2024

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses.

[BibT_eX]

[DOI]

CoRR, 2024

Open-Sora Plan: Open-Source Large Video Generation Model.

[BibT_eX]

[DOI]

CoRR, 2024

Envision3D: One Image to 3D with Anchor Views Interpolation.

[BibT_eX]

[DOI]

CoRR, 2024

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Repaint123: Fast and High-Quality One Image to 3D Generation with Progressive Controllable Repainting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting.

[BibT_eX]

[DOI]

CoRR, 2023

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Masked Autoencoders for Point Cloud Self-supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Yatian Pang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...