Summary of Large Language Model Benchmarks in Medical Tasks, by Lawrence K.q. Yan et al.
Large Language Model Benchmarks in Medical Tasksby Lawrence K.Q. Yan, Qian Niu, Ming Li, Yichao…
Large Language Model Benchmarks in Medical Tasksby Lawrence K.Q. Yan, Qian Niu, Ming Li, Yichao…
LLMCBench: Benchmarking Large Language Model Compression for Efficient Deploymentby Ge Yang, Changyi He, Jinyang Guo,…
Causal Interventions on Causal Paths: Mapping GPT-2’s Reasoning From Syntax to Semanticsby Isabelle Lee, Joshua…
CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chartby Bowen Zhao, Tianhao Cheng, Yuejie…
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion modelsby Yaopei Zeng, Yuanpu Cao, Bochuan Cao, Yurui…
Large Language Models for Manufacturingby Yiwei Li, Huaqin Zhao, Hanqi Jiang, Yi Pan, Zhengliang Liu,…
Estimating Causal Effects of Text Interventions Leveraging LLMsby Siyi Guo, Myrl G. Marmarelis, Fred Morstatter,…
Can Large Language Models Act as Symbolic Reasoners?by Rob Sullivan, Nelly ElsayedFirst submitted to arxiv…
Efficient Training of Sparse Autoencoders for Large Language Models via Layer Groupsby Davide Ghilardi, Federico…
Going Beyond H&E and Oncology: How Do Histopathology Foundation Models Perform for Multi-stain IHC and…