Summary of Self-rewarding Language Models, by Weizhe Yuan et al.
Self-Rewarding Language Modelsby Weizhe Yuan, Richard Yuanzhe Pang, Kyunghyun Cho, Xian Li, Sainbayar Sukhbaatar, Jing…
Self-Rewarding Language Modelsby Weizhe Yuan, Richard Yuanzhe Pang, Kyunghyun Cho, Xian Li, Sainbayar Sukhbaatar, Jing…
Hallucination Detection and Hallucination Mitigation: An Investigationby Junliang Luo, Tianyu Li, Di Wu, Michael Jenkin,…
PersianMind: A Cross-Lingual Persian-English Large Language Modelby Pedram Rostami, Ali Salemi, Mohammad Javad DoustiFirst submitted…
SH2: Self-Highlighted Hesitation Helps You Decode More Truthfullyby Jushi Kai, Tianhang Zhang, Hai Hu, Zhouhan…
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by…
Fighting Fire with Fire: Adversarial Prompting to Generate a Misinformation Detection Datasetby Shrey Satapara, Parth…
Revisiting Zero-Shot Abstractive Summarization in the Era of Large Language Models from the Perspective of…
TinyLlama: An Open-Source Small Language Modelby Peiyuan Zhang, Guangtao Zeng, Tianduo Wang, Wei LuFirst submitted…
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Modelsby Matthew Dahl, Varun Magesh, Mirac…
LLaMA Beyond English: An Empirical Study on Language Capability Transferby Jun Zhao, Zhihao Zhang, Luhui…