Summary of Losing Visual Needles in Image Haystacks: Vision Language Models Are Easily Distracted in Short and Long Contexts, by Aditya Sharma et al.
Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and…