Gemini – Page 11 – GrooveSquid.com

July 13, 2025

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Modelsby Yanwei Li, Yuechen Zhang, Chengyao Wang,…

July 13, 2025

Evaluating the Efficacy of Prompt-Engineered Large Multimodal Models Versus Fine-Tuned Vision Transformers in Image-Based Security…

July 13, 2025

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs’ Gaming Ability in Multi-Agent…

July 13, 2025

Gemma: Open Models Based on Gemini Research and Technologyby Gemma Team, Thomas Mesnard, Cassidy Hardin,…

July 13, 2025

How Well Do Multi-modal LLMs Interpret CT Scans? An Auto-Evaluation Framework for Analysesby Qingqing Zhu,…

July 13, 2025

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of contextby Gemini Team Google, Petko…

July 13, 2025

Can Large Language Models do Analytical Reasoning?by Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang,…

July 13, 2025

Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoningby Deepanway Ghosal,…

July 13, 2025

LLMs in Political Science: Heralding a New Era of Visual Analysisby Yu WangFirst submitted to…

July 13, 2025

GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluationby Yi Zong, Xipeng QiuFirst submitted to…