Summary of Usersumbench: a Benchmark Framework For Evaluating User Summarization Approaches, by Chao Wang et al.
UserSumBench: A Benchmark Framework for Evaluating User Summarization Approachesby Chao Wang, Neo Wu, Lin Ning,…
UserSumBench: A Benchmark Framework for Evaluating User Summarization Approachesby Chao Wang, Neo Wu, Lin Ning,…
ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Modelsby Yeji…
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detectionby Mengya Hu, Rui Xu,…
MAPLE: Enhancing Review Generation with Multi-Aspect Prompt LEarning in Explainable Recommendationby Ching-Wen Yang, Che Wei…
Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language…
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectabilityby Jiri Hron,…
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decodingby Renato Vukovic, David Arps, Carel van Niekerk,…
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answeringby Danfeng Guo, Demetri…
Cost-Effective Hallucination Detection for LLMsby Simon Valentin, Jinmiao Fu, Gianluca Detommaso, Shaoyuan Xu, Giovanni Zappella,…
The Need for Guardrails with Large Language Models in Medical Safety-Critical Settings: An Artificial Intelligence…