Summary of Diagnosing Robotics Systems Issues with Large Language Models, by Jordis Emilia Herrmann et al.
Diagnosing Robotics Systems Issues with Large Language Modelsby Jordis Emilia Herrmann, Aswath Mandakath Gopinath, Mikael…
Diagnosing Robotics Systems Issues with Large Language Modelsby Jordis Emilia Herrmann, Aswath Mandakath Gopinath, Mikael…
Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT…
Benchmarking Agentic Workflow Generationby Shuofei Qiao, Runnan Fang, Zhisong Qiu, Xiaobin Wang, Ningyu Zhang, Yong…
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiencyby Preferred Elements, Kenshin Abe, Kaizaburo Chubachi,…
SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencodersby Constantin Venhoff, Anisoara Calinescu, Philip Torr,…
Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRAby Maharshi Gor,…
ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Timeby Yi Ding, Bolian…
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple…
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hackby Leo McKee-Reid, Christoph…
TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Databy Jeremy Andrew Irvin, Emily Ruoyu…