Summary of On the Robustness Of Language Models For Tabular Question Answering, by Kushal Raj Bhandari et al.
On the Robustness of Language Models for Tabular Question Answeringby Kushal Raj Bhandari, Sixue Xing,…
On the Robustness of Language Models for Tabular Question Answeringby Kushal Raj Bhandari, Sixue Xing,…
Problem-Solving in Language Model Networksby Ciaran Regan, Alexandre Gournail, Mizuki OkaFirst submitted to arxiv on:…
MedCalc-Bench: Evaluating Large Language Models for Medical Calculationsby Nikhil Khandekar, Qiao Jin, Guangzhi Xiong, Soren…
Program Synthesis Benchmark for Visual Programming in XLogoOnline Environmentby Chao Wen, Jacqueline Staub, Adish SinglaFirst…
TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generationby Jinyuan Fang, Zaiqiao Meng, Craig…
Context Graphby Chengjin Xu, Muzhi Li, Cehao Yang, Xuhui Jiang, Lumingyuan Tang, Yiyan Qi, Jian…
Balancing Rigor and Utility: Mitigating Cognitive Biases in Large Language Models for Multiple-Choice Questionsby Liman…
HiddenTables & PyQTax: A Cooperative Game and Dataset For TableQA to Ensure Scale and Data…
CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language Modelsby Wenjing Zhang, Xuejiao Lei, Zhaoxiang…
Efficient Prompting for LLM-based Generative Internet of Thingsby Bin Xiao, Burak Kantarci, Jiawen Kang, Dusit…