Summary of To Believe or Not to Believe Your Llm, by Yasin Abbasi Yadkori et al.
To Believe or Not to Believe Your LLMby Yasin Abbasi Yadkori, Ilja Kuzborskij, András György,…
To Believe or Not to Believe Your LLMby Yasin Abbasi Yadkori, Ilja Kuzborskij, András György,…
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Modelsby Marianna…
Causal prompting model-based offline reinforcement learningby Xuehui Yu, Yi Guan, Rujia Shen, Xin Li, Chen…
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correctionby Xiaoyuan…
Empirical influence functions to understand the logic of fine-tuningby Jordan K. Matelsky, Lyle Ungar, Konrad…
Toward Conversational Agents with Context and Time Sensitive Long-term Memoryby Nick Alonso, Tomás Figliolia, Anthony…
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehensionby Shubham Vatsal, Ayush…
Crafting Interpretable Embeddings by Asking LLMs Questionsby Vinamra Benara, Chandan Singh, John X. Morris, Richard…
Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Modelsby Abhishek…
Chain-of-Thought Prompting for Demographic Inference with Large Multimodal Modelsby Yongsheng Yu, Jiebo LuoFirst submitted to…