Language understanding – Page 24

July 13, 2025

Linear Transformers with Learnable Kernel Functions are Better In-Context Modelsby Yaroslav Aksenov, Nikita Balagansky, Sofia…

July 13, 2025

LoRA-drop: Efficient LoRA Parameter Pruning based on Output Evaluationby Hongyun Zhou, Xiangyu Lu, Wang Xu,…

July 13, 2025

Learn To be Efficient: Build Structured Sparsity in Large Language Modelsby Haizhong Zheng, Xiaoyan Bai,…

July 13, 2025

BlackMamba: Mixture of Experts for State-Space Modelsby Quentin Anthony, Yury Tokpanov, Paolo Glorioso, Beren MillidgeFirst…

July 13, 2025

Engineering A Large Language Model From Scratchby Abiodun Finbarrs OketunjiFirst submitted to arxiv on: 30…

July 13, 2025

Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Modelsby Erik Arakelyan, Zhaoqi Liu,…

July 13, 2025

How Can Large Language Models Understand Spatial-Temporal Data?by Lei Liu, Shuo Yu, Runze Wang, Zhenxun…

July 13, 2025

Learning Shortcuts: On the Misleading Promise of NLU in Language Modelsby Geetanjali Bihani, Julia Taylor…

July 13, 2025

Machine Translation with Large Language Models: Prompt Engineering for Persian, English, and Russian Directionsby Nooshin…

July 13, 2025

We Need to Talk About Classification Evaluation Metrics in NLPby Peter Vickers, Loïc Barrault, Emilio…