Summary of Dmel: Speech Tokenization Made Simple, by He Bai et al.
dMel: Speech Tokenization made Simpleby He Bai, Tatiana Likhomanenko, Ruixiang Zhang, Zijin Gu, Zakaria Aldeneh,…
dMel: Speech Tokenization made Simpleby He Bai, Tatiana Likhomanenko, Ruixiang Zhang, Zijin Gu, Zakaria Aldeneh,…
Leveraging Large Language Models to Geolocate Linguistic Variations in Social Media Postsby Davide Savarro, Davide…
Self-training Room Layout Estimation via Geometry-aware Ray-castingby Bolivar Solarte, Chin-Hsuan Wu, Jin-Cheng Jhang, Jonathan Lee,…
MedSAGa: Few-shot Memory Efficient Medical Image Segmentation using Gradient Low-Rank Projection in SAMby Navyansh Mahla,…
Multi-Agent Causal Discovery Using Large Language Modelsby Hao Duong Le, Xin Xia, Zhang ChenFirst submitted…
Rethinking Feature Backbone Fine-tuning for Remote Sensing Object Detectionby Yechan Kim, JongHyun Park, SooYeon Kim,…
DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layerby Jinfeng Wei, Xiaofeng ZhangFirst submitted…
Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversificationby Yunyi Xuan, Weijie Chen, Shicai…
ReAttention: Training-Free Infinite Context with Finite Attention Scopeby Xiaoran Liu, Ruixiao Li, Qipeng Guo, Zhigeng…
Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspectiveby Mariya Hendriksen, Shuo Zhang, Ridho…