Image captioning – Page 4 – GrooveSquid.com

Loading Now

July 13, 2025

Summary of Explainable Image Captioning Using Cnn- Cnn Architecture and Hierarchical Attention, by Rishi Kesav Mohan et al.

Explainable Image Captioning using CNN- CNN architecture and Hierarchical Attentionby Rishi Kesav Mohan, Sanjay Sureshkumar,…

July 13, 2025

Summary of Pseudo-ris: Distinctive Pseudo-supervision Generation For Referring Image Segmentation, by Seonghoon Yu et al.

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentationby Seonghoon Yu, Paul Hongsuck Seo, Jeany SonFirst…

July 13, 2025

Summary of Raven: Multitask Retrieval Augmented Vision-language Learning, by Varun Nagaraj Rao et al.

RAVEN: Multitask Retrieval Augmented Vision-Language Learningby Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar…

July 13, 2025

Summary of Do More Details Always Introduce More Hallucinations in Lvlm-based Image Captioning?, by Mingqian Feng et al.

Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?by Mingqian Feng, Yunlong Tang,…

July 13, 2025

Summary of Ospc: Detecting Harmful Memes with Large Language Model As a Catalyst, by Jingtao Cao et al.

OSPC: Detecting Harmful Memes with Large Language Model as a Catalystby Jingtao Cao, Zheng Zhang,…

July 13, 2025

Summary of From Redundancy to Relevance: Information Flow in Lvlms Across Reasoning Tasks, by Xiaofeng Zhang et al.

From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasksby Xiaofeng Zhang, Yihao Quan,…

July 13, 2025

Summary of Fleur: An Explainable Reference-free Evaluation Metric For Image Captioning Using a Large Multimodal Model, by Yebin Lee et al.

FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Modelby Yebin…

July 13, 2025

Summary of Text-only Synthesis For Image Captioning, by Qing Zhou et al.

Text-only Synthesis for Image Captioningby Qing Zhou, Junlin Huang, Qiang Li, Junyu Gao, Qi WangFirst…

July 13, 2025

Summary of Class-conditional Self-reward Mechanism For Improved Text-to-image Models, by Safouane El Ghazouali et al.

Class-Conditional self-reward mechanism for improved Text-to-Image modelsby Safouane El Ghazouali, Arnaud Gucciardi, Umberto MichelucciFirst submitted…

July 13, 2025

Summary of Towards Retrieval-augmented Architectures For Image Captioning, by Sara Sarto et al.

Towards Retrieval-Augmented Architectures for Image Captioningby Sara Sarto, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita…