Summary of Modelling Visual Semantics Via Image Captioning to Extract Enhanced Multi-level Cross-modal Semantic Incongruity Representation with Attention For Multimodal Sarcasm Detection, by Sajal Aggarwal et al.
Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with…