Summary of Codenames As a Benchmark For Large Language Models, by Matthew Stephenson et al.
Codenames as a Benchmark for Large Language Modelsby Matthew Stephenson, Matthew Sidji, Benoît RonvalFirst submitted…
Codenames as a Benchmark for Large Language Modelsby Matthew Stephenson, Matthew Sidji, Benoît RonvalFirst submitted…
Beyond Discrete Personas: Personality Modeling Through Journal Intensive Conversationsby Sayantan Pal, Souvik Das, Rohini K.…
Learning to Verify Summary Facts with Fine-Grained LLM Feedbackby Jihwan Oh, Jeonghwan Choi, Nicole Hee-Yeon…
VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generationby Hyeonseok Lim, Dongjae Shin, Seohyun Song,…
Benchmarking LLMs for Mimicking Child-Caregiver Language in Interactionby Jing Liu, Abdellah FourtassiFirst submitted to arxiv…
Assessing Personalized AI Mentoring with Large Language Models in the Computing Fieldby Xiao Luo, Sean…
Generating Knowledge Graphs from Large Language Models: A Comparative Study of GPT-4, LLaMA 2, and…
PromptRefine: Enhancing Few-Shot Performance on Low-Resource Indic Languages with Example Selection from Related Example Banksby…
Enhancing Cross-Language Code Translation via Task-Specific Embedding Alignment in Retrieval-Augmented Generationby Manish Bhattarai, Minh Vu,…
Neuro-Symbolic Data Generation for Math Reasoningby Zenan Li, Zhi Zhou, Yuan Yao, Yu-Feng Li, Chun…