Summary of Benchmarking Vision Language Models For Cultural Understanding, by Shravan Nayak et al.
Benchmarking Vision Language Models for Cultural Understandingby Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy,…
Benchmarking Vision Language Models for Cultural Understandingby Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy,…
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenesby Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao…
Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusionby Yongyuan Liang, Tingqiang Xu, Kaizhe Hu,…
LAB-Bench: Measuring Capabilities of Language Models for Biology Researchby Jon M. Laurent, Joseph D. Janizek,…
An Empirical Study of Mamba-based Pedestrian Attribute Recognitionby Xiao Wang, Weizhe Kong, Jiandong Jin, Shiao…
NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Modelsby Pranshu Pandya, Vatsal Gupta, Agney S Talwarr,…
Cooperative Reward Shaping for Multi-Agent Pathfindingby Zhenyu Song, Ronghao Zheng, Senlin Zhang, Meiqin LiuFirst submitted…
Expanding the Scope: Inductive Knowledge Graph Reasoning with Multi-Starting Progressive Propagationby Zhoutian Shao, Yuanning Cui,…
Melon Fruit Detection and Quality Assessment Using Generative AI-Based Image Data Augmentationby Seungri Yoon, Yunseong…
A Multi-Stage Framework for 3D Individual Tooth Segmentation in Dental CBCTby Chunshi Wang, Bin Zhao,…