Summary of World to Code: Multi-modal Data Generation Via Self-instructed Compositional Captioning and Filtering, by Jiacong Wang et al.
World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filteringby Jiacong Wang, Bohong…