Summary of Project Shadow: Symbolic Higher-order Associative Deductive Reasoning on Wikidata Using Lm Probing, by Hanna Abi Akl
Project SHADOW: Symbolic Higher-order Associative Deductive reasoning On Wikidata using LM probingby Hanna Abi AklFirst…
Project SHADOW: Symbolic Higher-order Associative Deductive reasoning On Wikidata using LM probingby Hanna Abi AklFirst…
DHP Benchmark: Are LLMs Good NLG Evaluators?by Yicheng Wang, Jiayi Yuan, Yu-Neng Chuang, Zhuoer Wang,…
LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedbackby Tanushree Banerjee,…
No Dataset Needed for Downstream Knowledge Benchmarking: Response Dispersion Inversely Correlates with Accuracy on Domain-specific…
SimBench: A Rule-Based Multi-Turn Interaction Benchmark for Evaluating an LLM’s Ability to Generate Digital Twinsby…
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Modelby Chunting Zhou, Lili…
MEGen: Generative Backdoor in Large Language Models via Model Editingby Jiyang Qiu, Xinbei Ma, Zhuosheng…
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learningby Yilun Kong, Hangyu Mao, Qi Zhao,…
Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User…
Importance Weighting Can Help Large Language Models Self-Improveby Chunyang Jiang, Chi-min Chan, Wei Xue, Qifeng…