Summary of B-vllm: a Vision Large Language Model with Balanced Spatio-temporal Tokens, by Zhuqiang Lu et al.
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokensby Zhuqiang Lu, Zhenfei Yin, Mengwei…
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokensby Zhuqiang Lu, Zhenfei Yin, Mengwei…
Enhancing Nursing and Elderly Care with Large Language Models: An AI-Driven Frameworkby Qiao Sun, Jiexin…
SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraintsby Ziqi Sheng,…
Small Language Model as Data Prospector for Large Language Modelby Shiwen Ni, Haihong Wu, Di…
Visual Object Tracking across Diverse Data Modalities: A Reviewby Mengmeng Wang, Teli Ma, Shuo Xin,…
TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Viewsby Liang Zhao, Zehan Bao, Yi…
Large Action Models: From Inception to Implementationby Lu Wang, Fangkai Yang, Chaoyun Zhang, Junting Lu,…
Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identificationby Zi Yang,…
GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?by Zhikai Lei, Tianyi Liang, Hanglei…
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sectorby Zhensheng Wang, Wenmian…