Summary of Cross-modal Bidirectional Interaction Model For Referring Remote Sensing Image Segmentation, by Zhe Dong et al.
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentationby Zhe Dong, Yuzhe Sun, Yanfeng…
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentationby Zhe Dong, Yuzhe Sun, Yanfeng…
Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirementsby Jingyu Zhang, Ahmed Elgohary, Ahmed Magooda,…
Baichuan-Omni Technical Reportby Yadong Li, Haoze Sun, Mingan Lin, Tianpeng Li, Guosheng Dong, Tao Zhang,…
GrabDAE: An Innovative Framework for Unsupervised Domain Adaptation Utilizing Grab-Mask and Denoise Auto-Encoderby Junzhou Chen,…
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimizationby Yougang Lyu, Lingyong Yan, Zihan Wang, Dawei…
Better Language Models Exhibit Higher Visual Alignmentby Jona Ruthardt, Gertjan J. Burghouts, Serge Belongie, Yuki…
Exploring Efficient Foundational Multi-modal Models for Video Summarizationby Karan Samel, Apoorva Beedu, Nitish Sontakke, Irfan…
Uncovering Factor Level Preferences to Improve Human-Model Alignmentby Juhyun Oh, Eunsu Kim, Jiseon Kim, Wenda…
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignmentby Yifei Xing, Xiangyuan Lan, Ruiping Wang,…
Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registrationby Xueyang Kang, Zhaoliang Luan,…