Yizhi Song

Orcid: 0000-0003-0525-0415

According to our database1, Yizhi Song authored at least 15 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation.
CoRR, July, 2025

MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models.
CoRR, May, 2025

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting.
CoRR, April, 2025

Generative AI for Cel-Animation: A Survey.
CoRR, January, 2025

TrCL-AGS: A Universal Sequential Triple-Stage Contrastive Learning Framework for Bacterial Detection With Across-Growth-Stage Information.
IEEE Internet Things J., 2025

Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
GroundingBooth: Grounding Text-to-Image Customization.
CoRR, 2024

Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation.
CoRR, 2024

Nested and Interleaved Ticketing for Multiple Travelers.
Proceedings of the Frontiers of Algorithmics - 18th International Joint Conference, 2024

Thinking Outside the BBox: Unconstrained Generative Object Compositing.
Proceedings of the Computer Vision - ECCV 2024, 2024

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
ObjectStitch: Object Compositing with Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
ObjectStitch: Generative Object Compositing.
CoRR, 2022

2019
A three-stage real-time detector for traffic signs in large panoramas.
Comput. Vis. Media, 2019

2017
Properties on n-dimensional convolution for image deconvolution.
CoRR, 2017


  Loading...