Yizhi Song

Orcid: 0000-0002-3005-382X

According to our database¹, Yizhi Song authored at least 19 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2026

GMDM-MoE: A biologically-inspired growth-to-morphology and dual-magnification mixture-of-experts for bacterial detection.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2026

Bacterial perception-enhanced detection transformer in time-lapse images.

[BibT_eX]

[DOI]

Biomed. Signal Process. Control., 2026

2025

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

DuetUI: A Bidirectional Context Loop for Human-Agent Co-Generation of Task-Oriented Interfaces.

[BibT_eX]

[DOI]

CoRR, September, 2025

Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting.

[BibT_eX]

[DOI]

CoRR, April, 2025

Generative AI for Cel-Animation: A Survey.

[BibT_eX]

[DOI]

CoRR, January, 2025

TrCL-AGS: A Universal Sequential Triple-Stage Contrastive Learning Framework for Bacterial Detection With Across-Growth-Stage Information.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2025

Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

GroundingBooth: Grounding Text-to-Image Customization.

[BibT_eX]

[DOI]

CoRR, 2024

Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Nested and Interleaved Ticketing for Multiple Travelers.

[BibT_eX]

[DOI]

Dongyu Lv

Yizhi Song

Chao Xu

Proceedings of the Frontiers of Algorithmics - 18th International Joint Conference, 2024

Thinking Outside the BBox: Unconstrained Generative Object Compositing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

ObjectStitch: Object Compositing with Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

ObjectStitch: Generative Object Compositing.

[BibT_eX]

[DOI]

CoRR, 2022

2019

A three-stage real-time detector for traffic signs in large panoramas.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2019

2017

Properties on n-dimensional convolution for image deconvolution.

[BibT_eX]

[DOI]

CoRR, 2017

Yizhi Song

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...