Summary of Robust Agents Learn Causal World Models, by Jonathan Richens et al.
Robust agents learn causal world modelsby Jonathan Richens, Tom EverittFirst submitted to arxiv on: 16…
Robust agents learn causal world modelsby Jonathan Richens, Tom EverittFirst submitted to arxiv on: 16…
Explainability for Machine Learning Models: From Data Adaptability to User Perceptionby julien DelaunayFirst submitted to…
Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Modelsby Shengzhi Li, Rongyu…
Active Preference Optimization for Sample Efficient RLHFby Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray…
Can Transformers Predict Vibrations?by Fusataka Kuniyoshi, Yoshihide SawadaFirst submitted to arxiv on: 16 Feb 2024CategoriesMain:…
Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMsby Yeonhong Park, Jake Hyun, SangLyul Cho, Bonggeun…
Properties and Challenges of LLM-Generated Explanationsby Jenny Kunz, Marco KuhlmannFirst submitted to arxiv on: 16…
Personalised Drug Identifier for Cancer Treatment with Transformers using Auxiliary Informationby Aishwarya Jayagopal, Hansheng Xue,…
Direct Preference Optimization with an Offsetby Afra Amini, Tim Vieira, Ryan CotterellFirst submitted to arxiv…
Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identificationby Chao Qin, Daniel…