Pretraining – Page 8 – GrooveSquid.com

Loading Now

July 13, 2025

Summary of Ptt5-v2: a Closer Look at Continued Pretraining Of T5 Models For the Portuguese Language, by Marcos Piau et al.

ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Languageby Marcos…

July 13, 2025

Summary of Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders For 3d Medical Image Segmentation, by Pengfei Gu et al.

Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentationby Pengfei Gu,…

July 13, 2025

Summary of Pandora: Towards General World Model with Natural Language Actions and Video States, by Jiannan Xiang et al.

Pandora: Towards General World Model with Natural Language Actions and Video Statesby Jiannan Xiang, Guangyi…

July 13, 2025

Summary of Svitt-ego: a Sparse Video-text Transformer For Egocentric Video, by Hector A. Valdez and Kyle Min and Subarna Tripathi

SViTT-Ego: A Sparse Video-Text Transformer for Egocentric Videoby Hector A. Valdez, Kyle Min, Subarna TripathiFirst…

July 13, 2025

Summary of Talking Heads: Understanding Inter-layer Communication in Transformer Language Models, by Jack Merullo et al.

Talking Heads: Understanding Inter-layer Communication in Transformer Language Modelsby Jack Merullo, Carsten Eickhoff, Ellie PavlickFirst…

July 13, 2025

Summary of Let’s Go Real Talk: Spoken Dialogue Model For Face-to-face Conversation, by Se Jin Park et al.

Let’s Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversationby Se Jin Park, Chae Won…

July 13, 2025

Summary of Argus: Benchmarking and Enhancing Vision-language Models For 3d Radiology Report Generation, by Che Liu et al.

Argus: Benchmarking and Enhancing Vision-Language Models for 3D Radiology Report Generationby Che Liu, Zhongwei Wan,…

July 13, 2025

Summary of Ctc-based Non-autoregressive Textless Speech-to-speech Translation, by Qingkai Fang et al.

CTC-based Non-autoregressive Textless Speech-to-Speech Translationby Qingkai Fang, Zhengrui Ma, Yan Zhou, Min Zhang, Yang FengFirst…

July 13, 2025

Summary of Robust Latent Representation Tuning For Image-text Classification, by Hao Sun and Yu Song

Robust Latent Representation Tuning for Image-text Classificationby Hao Sun, Yu SongFirst submitted to arxiv on:…

July 13, 2025

Summary of Gomaa-geo: Goal Modality Agnostic Active Geo-localization, by Anindya Sarkar et al.

GOMAA-Geo: GOal Modality Agnostic Active Geo-localizationby Anindya Sarkar, Srikumar Sastry, Aleksis Pirinen, Chongjie Zhang, Nathan…