Summary of Assessing the Creativity Of Llms in Proposing Novel Solutions to Mathematical Problems, by Junyi Ye et al.
Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problemsby Junyi Ye, Jingyi…
Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problemsby Junyi Ye, Jingyi…
Evaluating AI-Generated Essays with GRE Analytical Writing Assessmentby Yang Zhong, Jiangang Hao, Michael Fauss, Chen…
Revealing Hidden Bias in AI: Lessons from Large Language Modelsby Django Beatty, Kritsada Masanthia, Teepakorn…
TimeSeriesExam: A time series understanding examby Yifu Cai, Arjun Choudhry, Mononito Goswami, Artur DubrawskiFirst submitted…
Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmarkby Himanshu Gupta, Shreyas Verma, Ujjwala Anantheswaran, Kevin Scaria,…
MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generationby Aniket Deroy, Subhankar…
Evaluating Morphological Compositional Generalization in Large Language Modelsby Mete Ismayilzada, Defne Circi, Jonne Sälevä, Hale…
OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalitiesby Lichang Chen, Hexiang Hu, Mingda Zhang,…
Evidence of Cognitive Deficits andDevelopmental Advances in Generative AI: A Clock Drawing Test Analysisby Isaac…
Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructionsby Yuhan Fu, Ruobing Xie, Jiazhen Liu,…