Summary of Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models, by Tongtong Feng et al.
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Modelsby Tongtong Feng, Qing Li, Xin Wang, Mingzi Wang,…
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Modelsby Tongtong Feng, Qing Li, Xin Wang, Mingzi Wang,…
Modelling Visual Semantics via Image Captioning to extract Enhanced Multi-Level Cross-Modal Semantic Incongruity Representation with…
LAM3D: Leveraging Attention for Monocular 3D Object Detectionby Diana-Alexandra Sas, Leandro Di Bella, Yangxintong Lyu,…
From Attributes to Natural Language: A Survey and Foresight on Text-based Person Re-identificationby Fanzhi Jiang,…
Transfer Learning for Wildlife Classification: Evaluating YOLOv8 against DenseNet, ResNet, and VGGNet on a Custom…
A Unified Graph Transformer for Overcoming Isolations in Multi-modal Recommendationby Zixuan Yi, Iadh OunisFirst submitted…
Appformer: A Novel Framework for Mobile App Usage Prediction Leveraging Progressive Multi-Modal Data Fusion and…
Official-NV: An LLM-Generated News Video Dataset for Multimodal Fake News Detectionby Yihao Wang, Lizhi Chen,…
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognitionby Wenbo Huang, Jinghui…
YOLO-pdd: A Novel Multi-scale PCB Defect Detection Method Using Deep Representations with Sequential Imagesby Bowen…