Summary of Mastering Text-to-image Diffusion: Recaptioning, Planning, and Generating with Multimodal Llms, by Ling Yang et al.
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMsby Ling Yang, Zhaochen Yu, Chenlin…