Summary of Multi-frame, Lightweight & Efficient Vision-language Models For Question Answering in Autonomous Driving, by Akshay Gopalkrishnan et al.
Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Drivingby Akshay Gopalkrishnan, Ross…