Summary of Benchmarking Cognitive Domains For Llms: Insights From Taiwanese Hakka Culture, by Chen-chi Chang et al.
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Cultureby Chen-Chi Chang, Ching-Yuan Chen, Hung-Shin…
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Cultureby Chen-Chi Chang, Ching-Yuan Chen, Hung-Shin…
GenAI-powered Multi-Agent Paradigm for Smart Urban Mobility: Opportunities and Challenges for Integrating Large Language Models…
Evaluating ChatGPT on Nuclear Domain-Specific Databy Muhammad Anwar, Mischa de Costa, Issam Hammad, Daniel LauFirst…
LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance Propagationby Haichuan Hu, Yuhan Sun, Quanjun…
Probing Causality Manipulation of Large Language Modelsby Chenyang Zhang, Haibo Tong, Bin Zhang, Dongyu ZhangFirst…
Evidence-backed Fact Checking using RAG and Few-Shot In-Context Learning with LLMsby Ronit Singhal, Pransh Patwa,…
Xinyu: An Efficient LLM-based System for Commentary Generationby Yiquan Wu, Bo Tang, Chenyang Xi, Yu…
LegalBench-RAG: A Benchmark for Retrieval-Augmented Generation in the Legal Domainby Nicholas Pipitone, Ghita Houir AlamiFirst…
Graph Retrieval-Augmented Generation: A Surveyby Boci Peng, Yun Zhu, Yongchao Liu, Xiaohe Bo, Haizhou Shi,…
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generationby Dongyu Ru, Lin Qiu, Xiangkun Hu, Tianhang…