Summary of V”mean”ba: Visual State Space Models Only Need 1 Hidden Dimension, by Tien-yu Chi et al.
V“Mean”ba: Visual State Space Models only need 1 hidden dimensionby Tien-Yu Chi, Hung-Yueh Chiang, Chi-Chih…
V“Mean”ba: Visual State Space Models only need 1 hidden dimensionby Tien-Yu Chi, Hung-Yueh Chiang, Chi-Chih…
Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoningby Hang Yin,…
Automated Bleeding Detection and Classification in Wireless Capsule Endoscopy with YOLOv8-Xby Pavan C Shekar, Vivek…
A Systems Thinking Approach to Algorithmic Fairnessby Chris LamFirst submitted to arxiv on: 21 Dec…
TimeRAG: BOOSTING LLM Time Series Forecasting via Retrieval-Augmented Generationby Silin Yang, Dong Wang, Haoqi Zheng,…
L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compressionby Junxuan Zhang, Zhengxue Cheng, Yan Zhao,…
Internalized Self-Correction for Large Language Modelsby Nishanth Upadhyaya, Raghavendra SridharamurthyFirst submitted to arxiv on: 21…
PB-UAP: Hybrid Universal Adversarial Attack For Image Segmentationby Yufei Song, Ziqi Zhou, Minghui Li, Xianlong…
Generalizable Articulated Object Perception with Superpointsby Qiaojun Yu, Ce Hao, Xibin Yuan, Li Zhang, Liu…
Adversarial Attack Against Images Classification based on Generative Adversarial Networksby Yahe YangFirst submitted to arxiv…