Summary of Helmet: How to Evaluate Long-context Language Models Effectively and Thoroughly, by Howard Yen et al.
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughlyby Howard Yen, Tianyu Gao, Minmin…
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughlyby Howard Yen, Tianyu Gao, Minmin…
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinationsby Hadas Orgad,…
SteerDiff: Steering towards Safe Text-to-Image Diffusion Modelsby Hongxiang Zhang, Yifeng He, Hao ChenFirst submitted to…
Curvature Diversity-Driven Deformation and Domain Alignment for Point Cloudby Mengxi Wu, Hao Huang, Yi Fang,…
Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorizationby Ryan C. Barron, Ves…
Justice or Prejudice? Quantifying Biases in LLM-as-a-Judgeby Jiayi Ye, Yanbo Wang, Yue Huang, Dongping Chen,…
Unified Multimodal Interleaved Document Representation for Retrievalby Jaewoo Lee, Joonho Ko, Jinheon Baek, Soyeong Jeong,…
AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularityby Zhibin Lan, Liqiang Niu, Fandong Meng,…
BoViLA: Bootstrapping Video-Language Alignment via LLM-Based Self-Questioning and Answeringby Jin Chen, Kaijing Ma, Haojian Huang,…
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Modelsby Zhipei Xu, Xuanyu…