Summary of Cmmath: a Chinese Multi-modal Math Skill Evaluation Benchmark For Foundation Models, by Zhong-zhi Li et al.
CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Modelsby Zhong-Zhi Li, Ming-Liang Zhang,…
CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Modelsby Zhong-Zhi Li, Ming-Liang Zhang,…
NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2by Tengfei…
The Art of Saying No: Contextual Noncompliance in Language Modelsby Faeze Brahman, Sachin Kumar, Vidhisha…
Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acaciasby Waqar HussainFirst submitted…
Reliable Reasoning Beyond Natural Languageby Nasim Borazjanizadeh, Steven T. PiantadosiFirst submitted to arxiv on: 16…
Large Vision-Language Models as Emotion Recognizers in Context Awarenessby Yuxuan Lei, Dingkang Yang, Zhaoyu Chen,…
EARN Fairness: Explaining, Asking, Reviewing, and Negotiating Artificial Intelligence Fairness Metrics Among Stakeholdersby Lin Luo,…
Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Promptsby Jianhao Li, Tianyu Sun,…
TM-PATHVQA:90000+ Textless Multilingual Questions for Medical Visual Question Answeringby Tonmoy Rajkhowa, Amartya Roy Chowdhury, Sankalp…
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlightsby Shunqi Mao, Chaoyi Zhang,…