Summary of Mllmreid: Multimodal Large Language Model-based Person Re-identification, by Shan Yang et al.
MLLMReID: Multimodal Large Language Model-based Person Re-identificationby Shan Yang, Yongfei ZhangFirst submitted to arxiv on:…
MLLMReID: Multimodal Large Language Model-based Person Re-identificationby Shan Yang, Yongfei ZhangFirst submitted to arxiv on:…
Using Large Language Model for End-to-End Chinese ASR and NERby Yuang Li, Jiawei Yu, Min…
Zoom-shot: Fast and Efficient Unsupervised Zero-Shot Transfer of CLIP to Vision Encoders with Multimodal Lossby…
Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgeryby Beilei Cui, Mobarakol…
Exploiting Data Hierarchy as a New Modality for Contrastive Learningby Arjun Bhalla, Daniel Levenson, Jan…
MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron Captioningby Alfirsa Damasyifa Fauzulhaq, Wahyu Parwitayasa, Joseph Ananda…
Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimizationby Shixuan Liu, Yanghe Feng, Keyu Wu,…
Improving Generative Pre-Training: An In-depth Study of Masked Image Modeling and Denoising Modelsby Hyesong Choi,…
An Attentive Dual-Encoder Framework Leveraging Multimodal Visual and Semantic Information for Automatic OSAHS Diagnosisby Yingchen…
Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against…