Summary of Ctrlsynth: Controllable Image Text Synthesis For Data-efficient Multimodal Learning, by Qingqing Cao et al.
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learningby Qingqing Cao, Mahyar Najibi, Sachin MehtaFirst…