Summary of Tackling Structural Hallucination in Image Translation with Local Diffusion, by Seunghoi Kim et al.

Tackling Structural Hallucination in Image Translation with Local Diffusion

by Seunghoi Kim, Chen Jin, Tom Diethe, Matteo Figini, Henry F. J. Tregidgo, Asher Mullokandov, Philip Teare, Daniel C. Alexander

First submitted to arxiv on: 9 Apr 2024

GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty	Written by	Summary
High	Paper authors	High Difficulty Summary Read the original abstract here
Medium	GrooveSquid.com (original content)	Medium Difficulty Summary Medium Difficulty summary: Recent advancements in diffusion models have significantly improved conditioned image generation; however, they struggle to reconstruct out-of-distribution (OOD) images, such as unseen tumors in medical images, which can lead to “image hallucination” and misdiagnosis. We propose a training-free diffusion framework that reduces hallucination by using multiple Local Diffusion processes, involving OOD estimation followed by a “branching” module for generating local predictions and a “fusion” module for integrating them into one. Our evaluation demonstrates the effectiveness of our method in mitigating hallucinations over baseline models quantitatively and qualitatively, reducing misdiagnosis rates by 40% and 25% in medical and natural image datasets, respectively. This approach is compatible with various pre-trained diffusion models.
Low	GrooveSquid.com (original content)	Low Difficulty Summary Low Difficulty summary: Imagine trying to generate a new image of something you’ve never seen before, like a tumor in an X-ray. Computers are getting better at doing this, but sometimes they make mistakes by creating fake images instead. We want to fix this problem so doctors can get accurate diagnoses. To do this, we developed a new way of using computer models that generates multiple possible images and then combines them into one realistic image. This helps reduce the number of wrong predictions made by these computers, which is important for medical imaging.

Keywords

» Artificial intelligence » Diffusion » Hallucination » Image generation

Tackling Structural Hallucination in Image Translation with Local Diffusion

by Seunghoi Kim, Chen Jin, Tom Diethe, Matteo Figini, Henry F. J. Tregidgo, Asher Mullokandov, Philip Teare, Daniel C. Alexander

Categories

GrooveSquid.com Paper Summaries

Keywords

Summary of A Cyber Manufacturing Iot System For Adaptive Machine Learning Model Deployment by Interactive Causality Enabled Self-labeling, By Yutian Ren et al.

Summary of Pgtnet: a Process Graph Transformer Network For Remaining Time Prediction Of Business Process Instances, by Keyvan Amiri Elyasi et al.

Related Posts