Summary of On the Role Of Activation Functions in Eeg-to-text Decoder, by Zenon Lamprou et al.
On the Role of Activation Functions in EEG-To-Text Decoderby Zenon Lamprou, Iakovos Tenedios, Yashar MoshfeghiFirst…
On the Role of Activation Functions in EEG-To-Text Decoderby Zenon Lamprou, Iakovos Tenedios, Yashar MoshfeghiFirst…
The Persian Rug: solving toy models of superposition using large-scale symmetriesby Aditya Cowsik, Kfir Dolev,…
ActNAS : Generating Efficient YOLO Models using Activation NASby Sudhakar Sah, Ravish Kumar, Darshan C.…
Non-convergence to global minimizers in data driven supervised deep learning: Adam and stochastic gradient descent…
Feature Averaging: An Implicit Bias of Gradient Descent Leading to Non-Robustness in Neural Networksby Binghui…
ReLU’s Revival: On the Entropic Overload in Normalization-Free Large Language Modelsby Nandan Kumar Jha, Brandon…
Looped ReLU MLPs May Be All You Need as Practical Programmable Computersby Yingyu Liang, Zhizhou…
Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Databy…
Provable Privacy Attacks on Trained Shallow Neural Networksby Guy Smorodinsky, Gal Vardi, Itay SafranFirst submitted…
On the Expressiveness of Multi-Neuron Convex Relaxationsby Yuhao Mao, Yani Zhang, Martin VechevFirst submitted to…