GPT – Page 73 – GrooveSquid.com

July 13, 2025

Evaluating and Optimizing Educational Content with Large Language Model Judgmentsby Joy He-Yueya, Noah D. Goodman,…

July 13, 2025

VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPTby Yifang Xu, Yunzhuo Sun, Zien Xie, Benxiang…

July 13, 2025

NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese…

July 13, 2025

SoftTiger: A Clinical Foundation Model for Healthcare Workflowsby Ye Chen, Igor Couto, Wei Cai, Cong…

July 13, 2025

Executing Natural Language-Described Algorithms with Large Language Models: An Investigationby Xin Zheng, Qiming Zhu, Hongyu…

July 13, 2025

Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Samplingby Gabriel Grand, Valerio…

July 13, 2025

Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chineseby…

July 13, 2025

Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Databy…

July 13, 2025

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Webby…

July 13, 2025

Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agentsby Corby Rosset, Ho-Lam…