Summary of G-veval: a Versatile Metric For Evaluating Image and Video Captions Using Gpt-4o, by Tony Cheng Tong et al.
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4oby Tony Cheng Tong,…
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4oby Tony Cheng Tong,…
LLM-SEM: A Sentiment-Based Student Engagement Metric Using LLMS for E-Learning Platformsby Ali Hamdi, Ahmed Abdelmoneim…
Experience of Training a 1.7B-Parameter LLaMa Model From Scratchby Miles Q. Li, Benjamin C. M.…
GIRAFFE: Design Choices for Extending the Context Length of Visual Language Modelsby Mukai Li, Lei…
How Different AI Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Gamesby Yutong Xie,…
Explainable Procedural Mistake Detectionby Shane Storks, Itamar Bar-Yossef, Yayuan Li, Zheyuan Zhang, Jason J. Corso,…
OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviewsby Maximilian Idahl, Zahra…
Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agentsby Wonje Choi, Woo Kyung Kim,…
Embodied CoT Distillation From LLM To Off-the-shelf Agentsby Wonje Choi, Woo Kyung Kim, Minjong Yoo,…
Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approachby Daiki Shirafuji, Makoto Takenaka,…