Summary of Real-time Fake News From Adversarial Feedback, by Sanxing Chen et al.
Real-time Fake News from Adversarial Feedbackby Sanxing Chen, Yukun Huang, Bhuwan DhingraFirst submitted to arxiv…
Real-time Fake News from Adversarial Feedbackby Sanxing Chen, Yukun Huang, Bhuwan DhingraFirst submitted to arxiv…
LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court…
MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generationby Aniket Deroy, Subhankar…
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMsby Forrest Sheng Bao, Miaoran Li,…
Capturing Bias Diversity in LLMsby Purva Prasad Gosavi, Vaishnavi Murlidhar Kulkarni, Alan F. SmeatonFirst submitted…
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Modelsby Lisa Dunlap, Krishna Mandal, Trevor…
Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructionsby…
Large Language Models for Medical OSCE Assessment: A Novel Approach to Transcript Analysisby Ameer Hamza…
Investigating Implicit Bias in Large Language Models: A Large-Scale Study of Over 50 LLMsby Divyanshu…
Evaluating Morphological Compositional Generalization in Large Language Modelsby Mete Ismayilzada, Defne Circi, Jonne Sälevä, Hale…