Summary of Evaluating Ai-generated Essays with Gre Analytical Writing Assessment, by Yang Zhong et al.
Evaluating AI-Generated Essays with GRE Analytical Writing Assessmentby Yang Zhong, Jiangang Hao, Michael Fauss, Chen…
Evaluating AI-Generated Essays with GRE Analytical Writing Assessmentby Yang Zhong, Jiangang Hao, Michael Fauss, Chen…
In Context Learning and Reasoning for Symbolic Regression with Large Language Modelsby Samiha Sharlin, Tyler…
CLR-Bench: Evaluating Large Language Models in College-level Reasoningby Junnan Dong, Zijin Hong, Yuanchen Bei, Feiran…
VoiceBench: Benchmarking LLM-Based Voice Assistantsby Yiming Chen, Xianghu Yue, Chen Zhang, Xiaoxue Gao, Robby T.…
Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modelingby Azmine Toushik Wasi,…
Revealing Hidden Bias in AI: Lessons from Large Language Modelsby Django Beatty, Kritsada Masanthia, Teepakorn…
An Eye for an AI: Evaluating GPT-4o’s Visual Perception Skills and Geometric Reasoning Skills Using…
Towards a Reliable Offline Personal AI Assistant for Long Duration Spaceflightby Oliver Bensch, Leonie Bensch,…
Language Model Probabilities are Not Calibrated in Numeric Contextsby Charles Lovering, Michael Krumdick, Viet Dac…
Improve Vision Language Model Chain-of-thought Reasoningby Ruohong Zhang, Bowen Zhang, Yanghao Li, Haotian Zhang, Zhiqing…