Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

PSHuman: Photorealistic Single-image 3D Human Reconstruction using Cross-Scale Multiview Diffusion and Explicit Remeshing.

[BibT_eX]

[DOI]

Peng Li

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

TEXGen: a Generative Diffusion Model for Mesh Textures.

[BibT_eX]

[DOI]

ACM Trans. Graph., December, 2024

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction.

[BibT_eX]

[DOI]

CoRR, 2024

DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow.

[BibT_eX]

[DOI]

CoRR, 2024

TEXGen: a Generative Diffusion Model for Mesh Textures.

[BibT_eX]

[DOI]

CoRR, 2024

DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model.

[BibT_eX]

[DOI]

CoRR, 2024

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion.

[BibT_eX]

[DOI]

CoRR, 2024

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT.

[BibT_eX]

[DOI]

CoRR, 2024

TripoSR: Fast 3D Object Reconstruction from a Single Image.

[BibT_eX]

[DOI]

CoRR, 2024

Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Text-to-3D with Classifier Score Distillation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

GVGEN: Text-to-3D Generation with Volumetric Representation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Exploring Text-to-Motion Generation with Human Preference.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Exploring Temporal Feature Correlation for Efficient and Stable Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Task-balanced distillation for object detection.

[BibT_eX]

[DOI]

Pattern Recognit., May, 2023

UniG3D: A Unified 3D Object Generation Dataset.

[BibT_eX]

[DOI]

CoRR, 2023

Mask Hierarchical Features For Self-Supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception.

[BibT_eX]

[DOI]

CoRR, 2023

Parallel Reasoning Network for Human-Object Interaction Detection.

[BibT_eX]

[DOI]

CoRR, 2023

2022

BEVBert: Topo-Metric Map Pre-training for Language-guided Navigation.

[BibT_eX]

[DOI]

CoRR, 2022

R<sup>2</sup>F: A General Retrieval, Reading and Fusion Framework for Document-level Natural Language Inference.

[BibT_eX]

[DOI]

CoRR, 2022

1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022).

[BibT_eX]

[DOI]

CoRR, 2022

Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision.

[BibT_eX]

[DOI]

CoRR, 2022

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples.

[BibT_eX]

[DOI]

CoRR, 2022

A Mixture Of Surprises for Unsupervised Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

R2F: A General Retrieval, Reading and Fusion Framework for Document-level Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

IMCI: Integrate Multi-view Contextual Information for Fact Extraction and Verification.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Neighbor Regularized Bayesian Optimization for Hyperparameter Optimization.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021

INTERN: A New Learning Paradigm Towards General Vision.

[BibT_eX]

[DOI]

CoRR, 2021

2017

Depth map super-resolution via low-resolution depth guided joint trilateral up-sampling.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2017

Yangguang Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...