Summary of Timeseriesexam: a Time Series Understanding Exam, by Yifu Cai et al.
TimeSeriesExam: A time series understanding examby Yifu Cai, Arjun Choudhry, Mononito Goswami, Artur DubrawskiFirst submitted…
TimeSeriesExam: A time series understanding examby Yifu Cai, Arjun Choudhry, Mononito Goswami, Artur DubrawskiFirst submitted…
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court…
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMsby Forrest Sheng Bao, Miaoran Li,…
Large Language Models for Medical OSCE Assessment: A Novel Approach to Transcript Analysisby Ameer Hamza…
Investigating Implicit Bias in Large Language Models: A Large-Scale Study of Over 50 LLMsby Divyanshu…
MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generationby Aniket Deroy, Subhankar…
Evaluating Morphological Compositional Generalization in Large Language Modelsby Mete Ismayilzada, Defne Circi, Jonne Sälevä, Hale…
Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Informationby Yingya Li, Timothy Miller, Steven…
Capturing Bias Diversity in LLMsby Purva Prasad Gosavi, Vaishnavi Murlidhar Kulkarni, Alan F. SmeatonFirst submitted…
Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructionsby…