Summary of Better Language Models Exhibit Higher Visual Alignment, by Jona Ruthardt et al.
Better Language Models Exhibit Higher Visual Alignmentby Jona Ruthardt, Gertjan J. Burghouts, Serge Belongie, Yuki…
Better Language Models Exhibit Higher Visual Alignmentby Jona Ruthardt, Gertjan J. Burghouts, Serge Belongie, Yuki…
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?by Fumiya…
MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disordersby Cheng…
Epsilon-VAE: Denoising as Visual Decodingby Long Zhao, Sanghyun Woo, Ziyu Wan, Yandong Li, Han Zhang,…
From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compressionby Eunseong Choi, Sunkyung Lee,…
Grounding 3D Scene Affordance From Egocentric Interactionsby Cuiyu Liu, Wei Zhai, Yuhang Yang, Hongchen Luo,…
See then Tell: Enhancing Key Information Extraction with Vision Groundingby Shuhang Liu, Zhenrong Zhang, Pengfei…
LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular Automataby Jaime A. Berkovich, Markus J. BuehlerFirst…
GCA-SUN: A Gated Context-Aware Swin-UNet for Exemplar-Free Countingby Yuzhe Wu, Yipeng Xu, Tianyu Xu, Jialu…
MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototypingby Amirreza Fateh, Mohammad Reza Mohammadi,…