Loading Now

Summary of Imagenet-d: Benchmarking Neural Network Robustness on Diffusion Synthetic Object, by Chenshuang Zhang et al.


ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object

by Chenshuang Zhang, Fei Pan, Junmo Kim, In So Kweon, Chengzhi Mao

First submitted to arxiv on: 27 Mar 2024

Categories

  • Main: Computer Vision and Pattern Recognition (cs.CV)
  • Secondary: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The paper introduces a new benchmark for visual perception robustness, called ImageNet-D, which generates high-quality synthetic images with diversified backgrounds, textures, and materials. This benchmark is designed to challenge the accuracy of deep learning models, including popular vision models like ResNet, CLIP, and MiniGPT-4, by reducing their performance by up to 60%. The paper leverages diffusion models to generate these challenging images and shows that they are effective in testing vision models.
Low GrooveSquid.com (original content) Low Difficulty Summary
The researchers create a new benchmark for visual perception robustness called ImageNet-D. They use a special type of computer model called a “diffusion model” to make the pictures look more real. This helps test how well computers can understand what’s in pictures, even if they’re tricky or noisy. The results show that popular computer models are not very good at understanding these challenging images.

Keywords

* Artificial intelligence  * Deep learning  * Diffusion model  * Resnet