Summary of Where Is the Signal in Tokenization Space?, by Renato Lui Geh and Honghua Zhang and Kareem Ahmed and Benjie Wang and Guy Van Den Broeck
Where is the signal in tokenization space?by Renato Lui Geh, Honghua Zhang, Kareem Ahmed, Benjie…
Where is the signal in tokenization space?by Renato Lui Geh, Honghua Zhang, Kareem Ahmed, Benjie…
JPEG-LM: LLMs as Image Generators with Canonical Codec Representationsby Xiaochuang Han, Marjan Ghazvininejad, Pang Wei…
Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inferenceby Rohan Baskar Prabhakar, Hengrui Zhang, David WentzlaffFirst…
Exchangeable Sequence Models Quantify Uncertainty Over Latent Conceptsby Naimeng Ye, Hongseok NamkoongFirst submitted to arxiv…
UniMoT: Unified Molecule-Text Language Model with Discrete Token Representationby Juzheng Zhang, Yatao Bian, Yongqiang Chen,…
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognitionby…
Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelinesby Yuchen Li,…
Flusion: Integrating multiple data sources for accurate influenza predictionsby Evan L. Ray, Yijin Wang, Russell…
QT-TDM: Planning With Transformer Dynamics Model and Autoregressive Q-Learningby Mostafa Kotb, Cornelius Weber, Muhammad Burhan…
Reorganizing attention-space geometry with expressive attentionby Claudius GrosFirst submitted to arxiv on: 26 Jul 2024CategoriesMain:…