Yangguang Li

Orcid: 0000-0002-6090-3899

Affiliations:
  • VAST, Beijing, China


According to our database1, Yangguang Li authored at least 46 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models.
CoRR, June, 2025

Flow-GRPO: Training Flow Matching Models via Online RL.
CoRR, May, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
CoRR, April, 2025

HoloPart: Generative 3D Part Amodal Segmentation.
CoRR, April, 2025

MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs.
CoRR, March, 2025

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling.
CoRR, March, 2025

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models.
CoRR, February, 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

PSHuman: Photorealistic Single-image 3D Human Reconstruction using Cross-Scale Multiview Diffusion and Explicit Remeshing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
TEXGen: a Generative Diffusion Model for Mesh Textures.
ACM Trans. Graph., December, 2024

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction.
CoRR, 2024

DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow.
CoRR, 2024

TEXGen: a Generative Diffusion Model for Mesh Textures.
CoRR, 2024

DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model.
CoRR, 2024

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion.
CoRR, 2024

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT.
CoRR, 2024

TripoSR: Fast 3D Object Reconstruction from a Single Image.
CoRR, 2024

Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Text-to-3D with Classifier Score Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

GVGEN: Text-to-3D Generation with Volumetric Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Exploring Text-to-Motion Generation with Human Preference.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Exploring Temporal Feature Correlation for Efficient and Stable Video Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Task-balanced distillation for object detection.
Pattern Recognit., May, 2023

UniG3D: A Unified 3D Object Generation Dataset.
CoRR, 2023

Mask Hierarchical Features For Self-Supervised Learning.
CoRR, 2023

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception.
CoRR, 2023

Parallel Reasoning Network for Human-Object Interaction Detection.
CoRR, 2023

2022
BEVBert: Topo-Metric Map Pre-training for Language-guided Navigation.
CoRR, 2022

R<sup>2</sup>F: A General Retrieval, Reading and Fusion Framework for Document-level Natural Language Inference.
CoRR, 2022

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022).
CoRR, 2022

Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision.
CoRR, 2022

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples.
CoRR, 2022

A Mixture Of Surprises for Unsupervised Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-training.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm.
Proceedings of the Tenth International Conference on Learning Representations, 2022

R2F: A General Retrieval, Reading and Fusion Framework for Document-level Natural Language Inference.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies.
Proceedings of the Computer Vision - ECCV 2022, 2022

IMCI: Integrate Multi-view Contextual Information for Fact Extraction and Verification.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Neighbor Regularized Bayesian Optimization for Hyperparameter Optimization.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
INTERN: A New Learning Paradigm Towards General Vision.
CoRR, 2021


  Loading...