Summary of Vibecheck: Discover and Quantify Qualitative Differences in Large Language Models, by Lisa Dunlap et al.
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Modelsby Lisa Dunlap, Krishna Mandal, Trevor…
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Modelsby Lisa Dunlap, Krishna Mandal, Trevor…
Investigating Implicit Bias in Large Language Models: A Large-Scale Study of Over 50 LLMsby Divyanshu…
Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Informationby Yingya Li, Timothy Miller, Steven…
ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilitiesby Zhenchao Jin, Mengchen Liu,…
A Recipe For Building a Compliant Real Estate Chatbotby Navid Madani, Anusha Bagalkotkar, Supriya Anand,…
Applying Refusal-Vector Ablation to Llama 3.1 70B Agentsby Simon Lermen, Mateusz Dziemian, Govind PimpaleFirst submitted…
SimpleStrat: Diversifying Language Model Generation with Stratificationby Justin Wong, Yury Orlovskiy, Michael Luo, Sanjit A.…
Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machinesby Junyu Lai, Jiahe Xu, Yao Yang,…
Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent Systemby Weize Chen, Jiarui Yuan, Chen Qian,…
SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layersby Viktoriia Chekalina, Anna Rudenko, Gleb…