Summary of Tokenflow: Unified Image Tokenizer For Multimodal Understanding and Generation, by Liao Qu et al.
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generationby Liao Qu, Huichao Zhang, Yiheng Liu,…
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generationby Liao Qu, Huichao Zhang, Yiheng Liu,…
Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmarkby Haidong Xu, Meishan Zhang,…
Scalable Image Tokenization with Index Backpropagation Quantizationby Fengyuan Shi, Zhuoyan Luo, Yixiao Ge, Yujiu Yang,…
RandAR: Decoder-only Autoregressive Visual Generation in Random Ordersby Ziqi Pang, Tianyuan Zhang, Fujun Luan, Yunze…
Playable Game Generationby Mingyu Yang, Junyou Li, Zhongbin Fang, Sheng Chen, Yangbin Yu, Qiang Fu,…
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Headsby Siqi Kou, Jiachun Jin, Chang Liu, Ye…
EADReg: Probabilistic Correspondence Generation with Efficient Autoregressive Diffusion Model for Outdoor Point Cloud Registrationby Linrui…
Bi-Mamba: Towards Accurate 1-Bit State Space Modelsby Shengkun Tang, Liqun Ma, Haonan Li, Mingjie Sun,…
A Survey on Vision Autoregressive Modelby Kai Jiang, Jiaxing HuangFirst submitted to arxiv on: 13…
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generationby Yiyang Ma, Xingchao…