Summary of What Is the Best Model? Application-driven Evaluation For Large Language Models, by Shiguo Lian et al.
What is the best model? Application-driven Evaluation for Large Language Modelsby Shiguo Lian, Kaikai Zhao,…
What is the best model? Application-driven Evaluation for Large Language Modelsby Shiguo Lian, Kaikai Zhao,…
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsby Rui Yang, Ruomeng Ding, Yong…
Multi-Modal Retrieval For Large Language Model Based Speech Recognitionby Jari Kolehmainen, Aditya Gourav, Prashanth Gurunath…
Talking Heads: Understanding Inter-layer Communication in Transformer Language Modelsby Jack Merullo, Carsten Eickhoff, Ellie PavlickFirst…
OLMES: A Standard for Language Model Evaluationsby Yuling Gu, Oyvind Tafjord, Bailey Kuehl, Dany Haddad,…
Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective Tasksby Justin Zhao, Flor Miriam…
AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detectionby Pia Pachinger, Janis Goldzycher, Anna…
Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modelingby Zile Qiao, Wei Ye, Yong Jiang, Tong Mo,…
Collective Constitutional AI: Aligning a Language Model with Public Inputby Saffron Huang, Divya Siddarth, Liane…
BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Predictionby Yinhao Bai, Yalan Xie, Xiaoyi…