Yuang Peng

Orcid: 0000-0002-1448-5489

According to our database1, Yuang Peng authored at least 17 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale.
CoRR, August, 2025

Automated HEMT Model Construction from Datasheets via Multi-Modal Intelligence and Prior-Knowledge-Free Optimization.
CoRR, July, 2025

Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning.
CoRR, July, 2025

DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning.
CoRR, June, 2025

Step1X-Edit: A Practical Framework for General Image Editing.
CoRR, April, 2025

Perception-R1: Pioneering Perception Policy with Reinforcement Learning.
CoRR, April, 2025

Perception in Reflection.
CoRR, April, 2025

Taming Teacher Forcing for Masked Autoregressive Video Generation.
CoRR, January, 2025

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Taming Teacher Forcing for Masked Autoregressive Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Exploring Recurrent Long-Term Temporal Fusion for Multi-View 3D Perception.
IEEE Robotics Autom. Lett., July, 2024

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model.
CoRR, 2024

ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

GladCoder: Stylized QR Code Generation with Grayscale-Aware Denoising Process.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

DreamLLM: Synergistic Multimodal Comprehension and Creation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

WaterDiff: Perceptual Image Watermarks Via Diffusion Model.
Proceedings of the IEEE International Conference on Acoustics, 2024


  Loading...