Summary of Pre-trained Vision-language Models As Partial Annotators, by Qian-wei Wang et al.
Pre-Trained Vision-Language Models as Partial Annotatorsby Qian-Wei Wang, Yuqiu Xie, Letian Zhang, Zimo Liu, Shu-Tao…
Pre-Trained Vision-Language Models as Partial Annotatorsby Qian-Wei Wang, Yuqiu Xie, Letian Zhang, Zimo Liu, Shu-Tao…
Decoding Decision Reasoning: A Counterfactual-Powered Model for Knowledge Discoveryby Yingying Fang, Zihao Jin, Xiaodan Xing,…
Pseudo-label Based Domain Adaptation for Zero-Shot Text Steganalysisby Yufei Luo, Zhen Yang, Ru Zhang, Jianyi…
FLOW: Fusing and Shuffling Global and Local Views for Cross-User Human Activity Recognition with IMUsby…
Negative Prototypes Guided Contrastive Learning for WSODby Yu Zhang, Chuang Zhu, Guoqing Yang, Siqi ChenFirst…
Assessment of Sentinel-2 spatial and temporal coverage based on the scene classification layerby Cristhian Sanchez,…
Flexible ViG: Learning the Self-Saliency for Flexible Object Recognitionby Lin Zuo, Kunshan Yang, Xianlong Tian,…
Nomic Embed Vision: Expanding the Latent Spaceby Zach Nussbaum, Brandon Duderstadt, Andriy MulyarFirst submitted to…
Improving Execution Concurrency in Partial-Order Plans via Block-Substitutionby Sabah Binte Noor, Fazlul Hasan SiddiquiFirst submitted…
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Databy William Berman, Alexander PeysakhovichFirst submitted to arxiv…