Summary of Faithbench: a Diverse Hallucination Benchmark For Summarization by Modern Llms, By Forrest Sheng Bao et al.
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMsby Forrest Sheng Bao, Miaoran Li,…
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMsby Forrest Sheng Bao, Miaoran Li,…
Anchored Alignment for Self-Explanations Enhancementby Luis Felipe Villa-Arenas, Ata Nizamoglu, Qianli Wang, Sebastian Möller, Vera…
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapyby Mian Zhang, Xianjun Yang, Xinlu…
Research on Travel Route Planing Problems Based on Greedy Algorithmby Yiquan WangFirst submitted to arxiv…
SPIN: Self-Supervised Prompt INjectionby Leon Zhou, Junfeng Yang, Chengzhi MaoFirst submitted to arxiv on: 17…
Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysisby Yiyi…
Atomic Calibration of LLMs in Long-Form Generationsby Caiqi Zhang, Ruihan Yang, Zhisong Zhang, Xinting Huang,…
Automatic Translation Alignment Pipeline for Multilingual Digital Editions of Literary Worksby Maria LevchenkoFirst submitted to…
Roadmap towards Superhuman Speech Understanding using Large Language Modelsby Fan Bu, Yuhao Zhang, Xidong Wang,…
Advancing Large Language Model Attribution through Self-Improvingby Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao,…