Summary of Benchmarking Vision Language Models For Cultural Understanding, by Shravan Nayak et al.
Benchmarking Vision Language Models for Cultural Understandingby Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy,…
Benchmarking Vision Language Models for Cultural Understandingby Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy,…
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?by Ruisheng Cao,…
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenesby Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao…
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusionby Yongyuan Liang, Tingqiang Xu, Kaizhe Hu,…
Building Artificial Intelligence with Creative Agency and Self-hoodby Liane Gabora, Joscha BachFirst submitted to arxiv…
Do Large Language Models Understand Verbal Indicators of Romantic Attraction?by Sandra C. Matz, Heinrich Peters,…
MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Modelsby…
LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Contentby Jessica Foo, Shaun KhooFirst…
TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house Criteria by Criteria Division…
Visualization Literacy of Multimodal Large Language Models: A Comparative Studyby Zhimin Li, Haichao Miao, Valerio…