Summary of Hero: Human-feedback Efficient Reinforcement Learning For Online Diffusion Model Finetuning, by Ayano Hiranaka et al.
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuningby Ayano Hiranaka, Shang-Fu Chen, Chieh-Hsin…