Summary of A Notso Simple Way to Beat Simple Bench, by Soham Sane and Angus Mclean
A NotSo Simple Way to Beat Simple Benchby Soham Sane, Angus McLeanFirst submitted to arxiv…
A NotSo Simple Way to Beat Simple Benchby Soham Sane, Angus McLeanFirst submitted to arxiv…
OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviewsby Maximilian Idahl, Zahra…
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detectionby Guangsheng Bao,…
Seeing the Forest and the Trees: Solving Visual Graph and Tree Based Data Structure Problems…
Codenames as a Benchmark for Large Language Modelsby Matthew Stephenson, Matthew Sidji, Benoît RonvalFirst submitted…
LLMs-in-the-Loop Part 2: Expert Small AI Models for Anonymization and De-identification of PHI Across Multiple…
MedG-KRP: Medical Graph Knowledge Representation Probingby Gabriel R. Rosenbaum, Lavender Yao Jiang, Ivaxi Sheth, Jaden…
Evaluation of GPT-4o and GPT-4o-mini’s Vision Capabilities for Compositional Analysis from Dried Solution Dropsby Deven…
Evaluating Robustness of LLMs on Crisis-Related Microblogs across Events, Information Types, and Linguistic Featuresby Muhammad…
GPTDrawer: Enhancing Visual Synthesis through ChatGPTby Kun Li, Xinwei Chen, Tianyou Song, Hansong Zhang, Wenzhe…