Summary of Videoespresso: a Large-scale Chain-of-thought Dataset For Fine-grained Video Reasoning Via Core Frame Selection, by Songhao Han et al.
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selectionby Songhao Han,…