Summary of Ptt5-v2: a Closer Look at Continued Pretraining Of T5 Models For the Portuguese Language, by Marcos Piau et al.
ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Languageby Marcos…
ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Languageby Marcos…
Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentationby Pengfei Gu,…
Pandora: Towards General World Model with Natural Language Actions and Video Statesby Jiannan Xiang, Guangyi…
SViTT-Ego: A Sparse Video-Text Transformer for Egocentric Videoby Hector A. Valdez, Kyle Min, Subarna TripathiFirst…
Talking Heads: Understanding Inter-layer Communication in Transformer Language Modelsby Jack Merullo, Carsten Eickhoff, Ellie PavlickFirst…
Let’s Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversationby Se Jin Park, Chae Won…
CTC-based Non-autoregressive Textless Speech-to-Speech Translationby Qingkai Fang, Zhengrui Ma, Yan Zhou, Min Zhang, Yang FengFirst…
Argus: Benchmarking and Enhancing Vision-Language Models for 3D Radiology Report Generationby Che Liu, Zhongwei Wan,…
Robust Latent Representation Tuning for Image-text Classificationby Hao Sun, Yu SongFirst submitted to arxiv on:…
GOMAA-Geo: GOal Modality Agnostic Active Geo-localizationby Anindya Sarkar, Srikumar Sastry, Aleksis Pirinen, Chongjie Zhang, Nathan…