Summary of Team Ryu’s Submission to Sigmorphon 2024 Shared Task on Subword Tokenization, by Zilong Li
Team Ryu’s Submission to SIGMORPHON 2024 Shared Task on Subword Tokenizationby Zilong LiFirst submitted to…
Team Ryu’s Submission to SIGMORPHON 2024 Shared Task on Subword Tokenizationby Zilong LiFirst submitted to…
AutoTrain: No-code training for state-of-the-art modelsby Abhishek ThakurFirst submitted to arxiv on: 21 Oct 2024CategoriesMain:…
GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimizationby Zihui Wu, Haichang…
Efficient Vision-Language Models by Summarizing Visual Tokens into Compact Registersby Yuxin Wen, Qingqing Cao, Qichen…
Scaled and Inter-token Relation Enhanced Transformer for Sample-restricted Residential NILMby Minhajur Rahman, Yasir ArafatFirst submitted…
Language Models as Semiotic Machines: Reconceptualizing AI Language Systems through Structuralist and Post-Structuralist Theories of…
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinkingby Markus J.…
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspectiveby Yongxin Zhu, Bocheng Li,…
Large-scale cloze evaluation reveals that token prediction tasks are neither lexically nor semantically alignedby Cassandra…
Characterizing Model Collapse in Large Language Models Using Semantic Networks and Next-Token Probabilityby Daniele Gambetta,…