Summary of Tokenflow: Unified Image Tokenizer For Multimodal Understanding and Generation, by Liao Qu et al.
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generationby Liao Qu, Huichao Zhang, Yiheng Liu,…
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generationby Liao Qu, Huichao Zhang, Yiheng Liu,…
Scalable Image Tokenization with Index Backpropagation Quantizationby Fengyuan Shi, Zhuoyan Luo, Yixiao Ge, Yujiu Yang,…
Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmarkby Haidong Xu, Meishan Zhang,…
RandAR: Decoder-only Autoregressive Visual Generation in Random Ordersby Ziqi Pang, Tianyuan Zhang, Fujun Luan, Yunze…
Playable Game Generationby Mingyu Yang, Junyou Li, Zhongbin Fang, Sheng Chen, Yangbin Yu, Qiang Fu,…
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Headsby Siqi Kou, Jiachun Jin, Chang Liu, Ye…
EADReg: Probabilistic Correspondence Generation with Efficient Autoregressive Diffusion Model for Outdoor Point Cloud Registrationby Linrui…
Bi-Mamba: Towards Accurate 1-Bit State Space Modelsby Shengkun Tang, Liqun Ma, Haonan Li, Mingjie Sun,…
A Survey on Vision Autoregressive Modelby Kai Jiang, Jiaxing HuangFirst submitted to arxiv on: 13…
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generationby Yiyang Ma, Xingchao…