Cross attention – Page 10 – GrooveSquid.com

July 13, 2025

Cross-Attention Watermarking of Large Language Modelsby Folco Bertini Baldassini, Huy H. Nguyen, Ching-Chung Chang, Isao…

July 13, 2025

FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wildby Zhi-Song Liu, Robin Courant,…

July 13, 2025

VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object Detectionby Zhe Wang, Siqi Fan, Xiaoliang…

July 13, 2025

An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosisby Yingchen…

July 13, 2025

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generationby Lunhao Duan, Shanshan Zhao, Wenjun…

July 13, 2025

ObitoNet: Multimodal High-Resolution Point Cloud Reconstructionby Apoorv Thapliyal, Vinay Lanka, Swathi BaskaranFirst submitted to arxiv…

July 13, 2025

WiFi CSI Based Temporal Activity Detection via Dual Pyramid Networkby Zhendong Liu, Le Zhang, Bing…

July 13, 2025

A Full Transformer-based Framework for Automatic Pain Estimation using Videosby Stefanos Gkikas, Manolis TsiknakisFirst submitted…

July 13, 2025

Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learningby Eric Brouwer,…

July 13, 2025

Efficient Scaling of Diffusion Transformers for Text-to-Image Generationby Hao Li, Shamit Lal, Zhiheng Li, Yusheng…