Summary of Batching Bpe Tokenization Merges, by Alexander P. Morgan
Batching BPE Tokenization Mergesby Alexander P. MorganFirst submitted to arxiv on: 5 Aug 2024CategoriesMain: Computation…
Batching BPE Tokenization Mergesby Alexander P. MorganFirst submitted to arxiv on: 5 Aug 2024CategoriesMain: Computation…
Bilingual Adaptation of Monolingual Foundation Modelsby Gurpreet Gosal, Yishi Xu, Gokul Ramakrishnan, Rituraj Joshi, Avraham…
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Spaceby Yumeng Zhang,…
Translatotron-V(ison): An End-to-End Model for In-Image Machine Translationby Zhibin Lan, Liqiang Niu, Fandong Meng, Jie…
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Drivingby Lening Wang, Wenzhao Zheng,…
Super Tiny Language Modelsby Dylan Hillier, Leon Guertler, Cheston Tan, Palaash Agrawal, Chen Ruirui, Bobby…
Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for Russianby Aleksandr Nikolich, Konstantin…
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretrainingby Dawei Feng, Yihai Zhang, Zhixuan XuFirst…
Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challengeby Khuyagbaatar Batsuren, Ekaterina Vylomova, Verna…
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrenceby Bo Peng, Daniel Goldstein, Quentin…