Summary of Free Video-llm: Prompt-guided Visual Perception For Efficient Training-free Video Llms, by Kai Han et al.
Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMsby Kai Han, Jianyuan Guo, Yehui…
Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMsby Kai Han, Jianyuan Guo, Yehui…
Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weightingby Yifan Luo, Zhennan Zhou, Meitan Wang, Bin DongFirst…
ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localizationby Jiawei Liu, Fanrui…
The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment…
Recent advancements in LLM Red-Teaming: Techniques, Defenses, and Ethical Considerationsby Tarun Raheja, Nilay Pochhi, F.D.C.M.…
VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Trackingby Zekun Qian, Ruize Han, Junhui…
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentationby Zhe Dong, Yuzhe Sun, Yanfeng…
KV Prediction for Improved Time to First Tokenby Maxwell Horton, Qingqing Cao, Chenfan Sun, Yanzi…
Self-Boosting Large Language Models with Synthetic Preference Databy Qingxiu Dong, Li Dong, Xingxing Zhang, Zhifang…
PAR: Prompt-Aware Token Reduction Method for Efficient Large Multimodal Modelsby Yingen Liu, Fan Wu, Ruihui…