Summary of Graph Neural Networks with Configuration Cross-attention For Tensor Compilers, by Dmitrii Khizbullin et al.
Graph neural networks with configuration cross-attention for tensor compilersby Dmitrii Khizbullin, Eduardo Rocha de Andrade,…
Graph neural networks with configuration cross-attention for tensor compilersby Dmitrii Khizbullin, Eduardo Rocha de Andrade,…
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Expertsby Mohammed Nowaz Rabbani Chowdhury,…
Zamba: A Compact 7B SSM Hybrid Modelby Paolo Glorioso, Quentin Anthony, Yury Tokpanov, James Whittington,…
Variance-Reducing Couplings for Random Featuresby Isaac Reid, Stratis Markou, Krzysztof Choromanski, Richard E. Turner, Adrian…
Efficient Probabilistic Modeling of Crystallization at Mesoscopic Scaleby Pol Timmer, Koen Minartz, Vlado MenkovskiFirst submitted…
Node Identifiers: Compact, Discrete Representations for Efficient Graph Learningby Yuankai Luo, Hongkang Li, Qijiong Liu,…
A Slices Perspective for Incremental Nonparametric Inference in High Dimensional State Spacesby Moshe Shienman, Ohad…
BOLD: Boolean Logic Deep Learningby Van Minh Nguyen, Cristian Ocampo, Aymen Askri, Louis Leconte, Ba-Hien…
SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter Pretraining of LLMsby Mohammad Mozaffari, Amir Yazdanbakhsh, Zhao…
Noisy Data Meets Privacy: Training Local Models with Post-Processed Remote Queriesby Kexin Li, Aastha Mehta,…