Summary of Videoqa-sc: Adaptive Semantic Communication For Video Question Answering, by Jiangyuan Guo et al.
VideoQA-SC: Adaptive Semantic Communication for Video Question Answeringby Jiangyuan Guo, Wei Chen, Yuxuan Sun, Jialong…
VideoQA-SC: Adaptive Semantic Communication for Video Question Answeringby Jiangyuan Guo, Wei Chen, Yuxuan Sun, Jialong…
Disentangling Knowledge-based and Visual Reasoning by Question Decomposition in KB-VQAby Elham J. Barezi, Parisa KordjamshidiFirst…
Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and…
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QAby Minzheng Wang, Longze Chen,…
PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preferenceby Jiaming Ji, Donghai Hong, Borong…
TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learningby Nemin Wu, Qian Cao,…
HCQA @ Ego4D EgoSchema Challenge 2024by Haoyu Zhang, Yuquan Xie, Yisen Feng, Zaijing Li, Meng…
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMsby Ziyan Jiang, Xueguang Ma, Wenhu ChenFirst submitted to…
Towards Retrieval Augmented Generation over Large Video Librariesby Yannis Tevissen, Khalil Guetari, Frédéric PetitpontFirst submitted…
A Learn-Then-Reason Model Towards Generalization in Knowledge Base Question Answeringby Lingxi Zhang, Jing Zhang, Yanling…