Summary of Cosmos: Cross-modality Self-distillation For Vision Language Pre-training, by Sanghwan Kim et al.
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-trainingby Sanghwan Kim, Rui Xiao, Mariana-Iuliana Georgescu, Stephan Alaniz,…