Summary of Llmr: Knowledge Distillation with a Large Language Model-induced Reward, by Dongheng Li et al.
LLMR: Knowledge Distillation with a Large Language Model-Induced Rewardby Dongheng Li, Yongchang Hao, Lili MouFirst…
LLMR: Knowledge Distillation with a Large Language Model-Induced Rewardby Dongheng Li, Yongchang Hao, Lili MouFirst…
CamelEval: Advancing Culturally Aligned Arabic Language Models and Benchmarksby Zhaozhi Qian, Faroq Altam, Muhammad Alqurishi,…
Enhancing Construction Site Safety: A Lightweight Convolutional Network for Effective Helmet Detectionby Mujadded Al Rabbani…
Connecting Ideas in ‘Lower-Resource’ Scenarios: NLP for National Varieties, Creoles and Other Low-resource Scenariosby Aditya…
Optical Flow Matters: an Empirical Comparative Study on Fusing Monocular Extracted Modalities for Better Steeringby…
Fine Tuning Large Language Models for Medicine: The Role and Importance of Direct Preference Optimizationby…
GaRField++: Reinforced Gaussian Radiance Fields for Large-Scale 3D Scene Reconstructionby Hanyue Zhang, Zhiliu Yang, Xinhe…
Evaluating Image Hallucination in Text-to-Image Generation with Question-Answeringby Youngsun Lim, Hojun Choi, Hyunjung ShimFirst submitted…
KnowFormer: Revisiting Transformers for Knowledge Graph Reasoningby Junnan Liu, Qianren Mao, Weifeng Jiang, Jianxin LiFirst…
FoodPuzzle: Developing Large Language Model Agents as Flavor Scientistsby Tenghao Huang, Donghee Lee, John Sweeney,…