Summary of Video-ccam: Enhancing Video-language Understanding with Causal Cross-attention Masks For Short and Long Videos, by Jiajun Fei et al.
Video-CCAM: Enhancing Video-Language Understanding with Causal Cross-Attention Masks for Short and Long Videosby Jiajun Fei,…