Yipeng Zhang

Orcid: 0009-0002-0886-8296

Affiliations:
  • Tsinghua University, Beijing, China


According to our database1, Yipeng Zhang authored at least 13 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent Reasoning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
ScenarioDiff: Text-to-video Generation with Dynamic Transformations of Scene Conditions.
Int. J. Comput. Vis., July, 2025

VideoDreamer: Customized Multi-Subject Text-to-Video Generation With Disen-Mix Finetuning on Language-Video Foundation Models.
IEEE Trans. Multim., 2025

ModuleTeam: Open-Set Multi-Conditional Image Generation with Training-Free Latent Mixture of Any Control Module.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
DisenDreamer: Subject-Driven Text-to-Image Generation With Sample-Aware Disentangled Tuning.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond.
CoRR, 2024

DisenStudio: Customized Multi-Subject Text-to-Video Generation with Disentangled Spatial Control.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Large Language Model with Curriculum Reasoning for Visual Concept Recognition.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning.
CoRR, 2023

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation.
CoRR, 2023

Adaptive Disentangled Transformer for Sequential Recommendation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023


  Loading...