Summary of Cmmath: a Chinese Multi-modal Math Skill Evaluation Benchmark For Foundation Models, by Zhong-zhi Li et al.
CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Modelsby Zhong-Zhi Li, Ming-Liang Zhang,…
CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Modelsby Zhong-Zhi Li, Ming-Liang Zhang,…
The Pitfalls of Publishing in the Age of LLMs: Strange and Surprising Adventures with a…
Comprehensive Performance Evaluation of YOLOv12, YOLO11, YOLOv10, YOLOv9 and YOLOv8 on Detecting and Counting Fruitlet…
The Art of Saying No: Contextual Noncompliance in Language Modelsby Faeze Brahman, Sachin Kumar, Vidhisha…
NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2by Tengfei…
Relational Representation Distillationby Nikolaos Giakoumoglou, Tania StathakiFirst submitted to arxiv on: 16 Jul 2024CategoriesMain: Computer…
GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compressionby Daniel Goldstein, Fares…
Better RAG using Relevant Information Gainby Marc Pickett, Jeremy Hartman, Ayan Kumar Bhowmick, Raquib-ul Alam,…
LLMs-in-the-loop Part-1: Expert Small AI Models for Bio-Medical Text Translationby Bunyamin Keles, Murat Gunay, Serdar…
Predicting Emotion Intensity in Polish Political Texts: Comparing Supervised Models and Large Language Models in…