Summary of A Spark Of Vision-language Intelligence: 2-dimensional Autoregressive Transformer For Efficient Finegrained Image Generation, by Liang Chen et al.
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generationby Liang Chen,…