Summary of L3tc: Leveraging Rwkv For Learned Lossless Low-complexity Text Compression, by Junxuan Zhang et al.
L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compressionby Junxuan Zhang, Zhengxue Cheng, Yan Zhao,…
L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compressionby Junxuan Zhang, Zhengxue Cheng, Yan Zhao,…
SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretizationby Zhentao Tan, Ben Xue, Jian Jia, Junhao…
From Language Models over Tokens to Language Models over Charactersby Tim Vieira, Ben LeBrun, Mario…
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generationby Liao Qu, Huichao Zhang, Yiheng Liu,…
Scaling Image Tokenizers with Grouped Spherical Quantizationby Jiangtao Wang, Zhen Qin, Yifan Zhang, Vincent Tao…
Arabic-Nougat: Fine-Tuning Vision Transformers for Arabic OCR and Markdown Extractionby Mohamed RashadFirst submitted to arxiv…
UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editingby Yiheng Li, Ruibing…
Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languagesby S. Tamang, D. J.…
LARP: Tokenizing Videos with a Learned Autoregressive Generative Priorby Hanyu Wang, Saksham Suri, Yixuan Ren,…
Deep Learning Based Dense Retrieval: A Comparative Studyby Ming Zhong, Zhizhi Wu, Nanako HondaFirst submitted…