Lutao Jiang

Orcid: 0000-0002-1775-2765

According to our database1, Lutao Jiang authored at least 26 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SAP: Segment Any 4K Panorama.
CoRR, March, 2026

EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next.
CoRR, March, 2026

StruVis: Enhancing Reasoning-based Text-to-Image Generation via Thinking with Structured Vision.
CoRR, March, 2026

BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis.
Trans. Mach. Learn. Res., 2026

T-Rex-Omni: Integrating Negative Visual Prompt in Generic Object Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement.
CoRR, December, 2025

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks.
CoRR, October, 2025

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs.
CoRR, October, 2025

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods.
CoRR, October, 2025

Understanding-in-Generation: Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation.
CoRR, September, 2025

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era.
CoRR, September, 2025

MLLMs are Deeply Affected by Modality Bias.
CoRR, May, 2025

DiMeR: Disentangled Mesh Reconstruction Model.
CoRR, April, 2025

Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook.
CoRR, March, 2025

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Reducing Unimodal Bias in Multi-Modal Semantic Segmentation With Multi-Scale Functional Entropy Regularization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Sat2City: 3D City Generation from a Single Satellite Image with Cascaded Latent Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Correction: PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection.
Int. J. Comput. Vis., January, 2024

MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection.
CoRR, 2024

Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation.
CoRR, 2024

G-NeLF: Memory- and Data-Efficient Hybrid Neural Light Field for Novel View Synthesis.
CoRR, 2024

A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness.
CoRR, 2024

BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis.
CoRR, 2024

A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection.
Int. J. Comput. Vis., December, 2023

SDF-3DGAN: A 3D Object Generative Method Based on Implicit Signed Distance Function.
CoRR, 2023


  Loading...