Summary of Wikicontradict: a Benchmark For Evaluating Llms on Real-world Knowledge Conflicts From Wikipedia, by Yufang Hou et al.
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipediaby Yufang Hou, Alessandra…