Summary of Multi-scale Temporal Difference Transformer For Video-text Retrieval, by Ni Wang et al.
Multi-Scale Temporal Difference Transformer for Video-Text Retrievalby Ni Wang, Dongliang Liao, Xing XuFirst submitted to…
Multi-Scale Temporal Difference Transformer for Video-Text Retrievalby Ni Wang, Dongliang Liao, Xing XuFirst submitted to…
RouteFinder: Towards Foundation Models for Vehicle Routing Problemsby Federico Berto, Chuanbo Hua, Nayeli Gast Zepeda,…
Infusing clinical knowledge into tokenisers for language modelsby Abul Hasan, Jinge Wu, Quang Ngoc Nguyen,…
Exploring Spatial Representations in the Historical Lake District Texts with LLM-based Relation Extractionby Erum Haris,…
A Pure Transformer Pretraining Framework on Text-attributed Graphsby Yu Song, Haitao Mao, Jiachen Xiao, Jingzhe…
Federating to Grow Transformers with Constrained Resources without Model Sharingby Shikun Shen, Yifei Zou, Yuan…
Enhancing Visible-Infrared Person Re-identification with Modality- and Instance-aware Visual Prompt Learningby Ruiqi Wu, Bingliang Jiao,…
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformersby Yiwen Chen, Tong He, Di Huang, Weicai Ye,…
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformerby Wei-Ting Chen, Gurunandan…
Deep Transformer Network for Monocular Pose Estimation of Ship-Based UAVby Maneesha Wickramasuriya, Taeyoung Lee, Murray…