Summary of Assessing the Creativity Of Llms in Proposing Novel Solutions to Mathematical Problems, by Junyi Ye et al.
Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problemsby Junyi Ye, Jingyi…
Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problemsby Junyi Ye, Jingyi…
Evaluating AI-Generated Essays with GRE Analytical Writing Assessmentby Yang Zhong, Jiangang Hao, Michael Fauss, Chen…
Revealing Hidden Bias in AI: Lessons from Large Language Modelsby Django Beatty, Kritsada Masanthia, Teepakorn…
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmarkby Himanshu Gupta, Shreyas Verma, Ujjwala Anantheswaran, Kevin Scaria,…
TimeSeriesExam: A time series understanding examby Yifu Cai, Arjun Choudhry, Mononito Goswami, Artur DubrawskiFirst submitted…
MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generationby Aniket Deroy, Subhankar…
Evaluating Morphological Compositional Generalization in Large Language Modelsby Mete Ismayilzada, Defne Circi, Jonne Sälevä, Hale…
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalitiesby Lichang Chen, Hexiang Hu, Mingda Zhang,…
Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructionsby Yuhan Fu, Ruobing Xie, Jiazhen Liu,…
Evidence of Cognitive Deficits andDevelopmental Advances in Generative AI: A Clock Drawing Test Analysisby Isaac…