Summary of Watermarking Training Data Of Music Generation Models, by Pascal Epple et al.
Watermarking Training Data of Music Generation Modelsby Pascal Epple, Igor Shilov, Bozhidar Stevanoski, Yves-Alexandre de…
Watermarking Training Data of Music Generation Modelsby Pascal Epple, Igor Shilov, Bozhidar Stevanoski, Yves-Alexandre de…
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessonsby Andrew Szot, Bogdan Mazoure, Omar…
Language-Guided Image Tokenization for Generationby Kaiwen Zha, Lijun Yu, Alireza Fathi, David A. Ross, Cordelia…
Enhancing Foundation Models for Time Series Forecasting via Wavelet-based Tokenizationby Luca Masserano, Abdul Fatir Ansari,…
LinVT: Empower Your Image-level Large Language Model to Understand Videosby Lishuai Gao, Yujie Zhong, Yingsen…
Efficient Long Video Tokenization via Coordinate-based Patch Reconstructionby Huiwon Jang, Sihyun Yu, Jinwoo Shin, Pieter…
Adaptive Length Image Tokenization via Recurrent Allocationby Shivam Duggal, Phillip Isola, Antonio Torralba, William T.…
Adapting Language Models via Token Translationby Zhili Feng, Tanya Marwah, Nicolo Fusi, David Alvarez-Melis, Lester…
MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compressionby Noel Elias, Homa Esfahanizadeh, Kaan…
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMsby Michael…