Recall – Page 13 – GrooveSquid.com

July 13, 2025

Generative Retrieval with Large Language Modelsby Ye Wang, Xinrun Xu, Rui Xie, Wenxin Hu, Wei…

July 13, 2025

Memory GAPS: Would LLMs pass the Tulving Test?by Jean-Marie ChauvetFirst submitted to arxiv on: 26…

July 13, 2025

RefuteBench: Evaluating Refuting Instruction-Following for Large Language Modelsby Jianhao Yan, Yun Luo, Yue ZhangFirst submitted…

July 13, 2025

Identifying Semantic Induction Heads to Understand In-Context Learningby Jie Ren, Qipeng Guo, Hang Yan, Dongrui…

July 13, 2025

Where is the answer? Investigating Positional Bias in Language Model Knowledge Extractionby Kuniaki Saito, Kihyuk…

July 13, 2025

Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Modelsby…

July 13, 2025

GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Modelsby Pengcheng…

July 13, 2025

HGOT: Hierarchical Graph of Thoughts for Retrieval-Augmented In-Context Learning in Factuality Evaluationby Yihao Fang, Stephen…

July 13, 2025

Comparing Knowledge Sources for Open-Domain Scientific Claim Verificationby Juraj Vladika, Florian MatthesFirst submitted to arxiv…

July 13, 2025

MULTI: Multimodal Understanding Leaderboard with Text and Imagesby Zichen Zhu, Yang Xu, Lu Chen, Jingkai…