Summary of Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers, by Sukjun Hwang et al.
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixersby Sukjun Hwang, Aakash Lahoti, Tri Dao,…
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixersby Sukjun Hwang, Aakash Lahoti, Tri Dao,…
Identification of emotions on Twitter during the 2022 electoral process in Colombiaby Juan Jose Iguaran…
Convolutional vs Large Language Models for Software Log Classification in Edge-Deployable Cellular Network Testingby Achintha…
QET: Enhancing Quantized LLM Parameters and KV cache Compression through Element Substitution and Residual Clusteringby…
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Modelsby Ying Zhang, Ziheng Yang, Shufan JiFirst submitted…
Croppable Knowledge Graph Embeddingby Yushan Zhu, Wen Zhang, Zhiqiang Liu, Mingyang Chen, Lei Liang, Huajun…
Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representationby…
Analyzing Persuasive Strategies in Meme Texts: A Fusion of Language Models with Paraphrase Enrichmentby Kota…
BISeizuRe: BERT-Inspired Seizure Data Representation to Improve Epilepsy Monitoringby Luca Benfenati, Thorir Mar Ingolfsson, Andrea…
ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Modelsby Ahmed Heakl, Youssef Mohamed,…