Summary of Evidence-driven Retrieval Augmented Response Generation For Online Misinformation, by Zhenrui Yue et al.
Evidence-Driven Retrieval Augmented Response Generation for Online Misinformationby Zhenrui Yue, Huimin Zeng, Yimeng Lu, Lanyu…
Evidence-Driven Retrieval Augmented Response Generation for Online Misinformationby Zhenrui Yue, Huimin Zeng, Yimeng Lu, Lanyu…
MedAide: Leveraging Large Language Models for On-Premise Medical Assistance on Edge Devicesby Abdul Basit, Khizar…
Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimizationby…
Direct Language Model Alignment from Online AI Feedbackby Shangmin Guo, Biao Zhang, Tianlin Liu, Tianqi…
Investigating Bias Representations in Llama 2 Chat via Activation Steeringby Dawn Lu, Nina RimskyFirst submitted…
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contextsby Lingfeng Shen, Weiting Tan,…
Reinforcement learning for question answering in programming domain using public community scoring as a human…
Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedbackby Seong Jin Lee, Will Wei Sun, Yufeng…
Comparing Few to Rank Many: Active Human Preference Learning using Randomized Frank-Wolfeby Kiran Koshy Thekumparampil,…
MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samplesby Shuo Xie, Fangzhi Zhu,…