Summary of Towards Interpreting Language Models: a Case Study in Multi-hop Reasoning, by Mansi Sakarvadia
Towards Interpreting Language Models: A Case Study in Multi-Hop Reasoningby Mansi SakarvadiaFirst submitted to arxiv…
Towards Interpreting Language Models: A Case Study in Multi-Hop Reasoningby Mansi SakarvadiaFirst submitted to arxiv…
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Modelsby Ming Cheng, Jiaying Gong, Chenhan Yuan,…
Beyond Model Adaptation at Test Time: A Surveyby Zehao Xiao, Cees G. M. SnoekFirst submitted…
Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detectionby Geng Yu, Jianing Zhu, Jiangchao Yao, Bo…
DiffLM: Controllable Synthetic Data Generation via Diffusion Language Modelsby Ying Zhou, Xinyao Wang, Yulei Niu,…
Proxy-informed Bayesian transfer learning with unknown sourcesby Sabina J. Sloman, Julien Martinelli, Samuel KaskiFirst submitted…
Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignmentby Jason Vega, Junsheng Huang,…
Conditional Vendi Score: An Information-Theoretic Approach to Diversity Evaluation of Prompt-based Generative Modelsby Mohammad Jalali,…
Pretrained transformer efficiently learns low-dimensional target functions in-contextby Kazusato Oko, Yujin Song, Taiji Suzuki, Denny…
Defining and Evaluating Physical Safety for Large Language Modelsby Yung-Chen Tang, Pin-Yu Chen, Tsung-Yi HoFirst…