Summary of Cnn Mixture-of-depths, by Rinor Cakaj et al.
CNN Mixture-of-Depthsby Rinor Cakaj, Jens Mehnert, Bin YangFirst submitted to arxiv on: 25 Sep 2024CategoriesMain:…
CNN Mixture-of-Depthsby Rinor Cakaj, Jens Mehnert, Bin YangFirst submitted to arxiv on: 25 Sep 2024CategoriesMain:…
A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithmsby Ruihao Gong, Yifu Ding,…
Accelerating TinyML Inference on Microcontrollers through Approximate Kernelsby Giorgos Armeniakos, Georgios Mentzos, Dimitrios SoudrisFirst submitted…
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observabilityby Carlos E. Luis,…
A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligenceby Xin Yuan, Ning Li, Quan…
AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantizationby Yifan Tan, Haoze Wang, Chao Yan,…
Functional Stochastic Gradient MCMC for Bayesian Neural Networksby Mengjing Wu, Junyu Xuan, Jie LuFirst submitted…
Towards Representation Learning for Weighting Problems in Design-Based Causal Inferenceby Oscar Clivio, Avi Feller, Chris…
Edge-device Collaborative Computing for Multi-view Classificationby Marco Palena, Tania Cerquitelli, Carla Fabiana ChiasseriniFirst submitted to…
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Expertsby Xiaoming Shi, Shiyu Wang, Yuqi…