Summary of Super: Evaluating Agents on Setting Up and Executing Tasks From Research Repositories, by Ben Bogin et al.
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositoriesby Ben Bogin, Kejuan…
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositoriesby Ben Bogin, Kejuan…
Questioning Internal Knowledge Structure of Large Language Models Through the Lens of the Olympic Gamesby…
RIRAG: Regulatory Information Retrieval and Answer Generationby Tuba Gokhan, Kexin Wang, Iryna Gurevych, Ted BriscoeFirst…