Summary of Hallucination Benchmark in Medical Visual Question Answering, by Jinge Wu et al.
Hallucination Benchmark in Medical Visual Question Answeringby Jinge Wu, Yunsoo Kim, Honghan WuFirst submitted to…
Hallucination Benchmark in Medical Visual Question Answeringby Jinge Wu, Yunsoo Kim, Honghan WuFirst submitted to…
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Modelsby Matthew Dahl, Varun Magesh, Mirac…
Question-Answering Based Summarization of Electronic Health Records using Retrieval Augmented Generationby Walid Saba, Suzanne Wendelken,…
Adversarial Transformer Language Models for Contextual Commonsense Inferenceby Pedro Colon-Hernandez, Henry Lieberman, Yida Xin, Claire…
An End-to-End Depth-Based Pipeline for Selfie Image Rectificationby Ahmed Alhawwary, Phong Nguyen-Ha, Janne Mustaniemi, Janne…
A Comparative Study of DSPy Teleprompter Algorithms for Aligning Large Language Models Evaluation Metrics to…
Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Databy Xue Wu, Kostas TsioutsiouliklisFirst submitted…
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queriesby Tao Wu, Chuhao Zhou, Yen Heng Wong,…
Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Promptsby Hazel…
A Graph-Based Approach for Conversational AI-Driven Personal Memory Capture and Retrieval in a Real-world Applicationby…