Summary of Lookupvit: Compressing Visual Information to a Limited Number Of Tokens, by Rajat Koner et al.
LookupViT: Compressing visual information to a limited number of tokensby Rajat Koner, Gagan Jain, Prateek…
LookupViT: Compressing visual information to a limited number of tokensby Rajat Koner, Gagan Jain, Prateek…
Contrastive Adversarial Training for Unsupervised Domain Adaptationby Jiahong Chen, Zhilin Zhang, Lucy Li, Behzad Shahrasbi,…
A Scalable Real-Time Data Assimilation Framework for Predicting Turbulent Atmosphere Dynamicsby Junqi Yin, Siming Liang,…
When to Accept Automated Predictions and When to Defer to Human Judgment?by Daniel Sikar, Artur…
Transfer Learning with Self-Supervised Vision Transformers for Snake Identificationby Anthony Miyaguchi, Murilo Gustineli, Austin Fischer,…
Multi-Label Plant Species Classification with Self-Supervised Vision Transformersby Murilo Gustineli, Anthony Miyaguchi, Ian StalterFirst submitted…
QMViT: A Mushroom is worth 16x16 Wordsby Siddhant Dutta, Hemant Singh, Kalpita Shankhdhar, Sridhar IyerFirst…
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learningby Saeed Shurrab, Alejandro Guerra-Manzanares, Farah E.…
Guided Context Gating: Learning to leverage salient lesions in retinal fundus imagesby Teja Krishna Cherukuri,…
When Will Gradient Regularization Be Harmful?by Yang Zhao, Hao Zhang, Xiuyuan HuFirst submitted to arxiv…