Summary of How Does Data Diversity Shape the Weight Landscape Of Neural Networks?, by Yang Ba et al.
How Does Data Diversity Shape the Weight Landscape of Neural Networks?by Yang Ba, Michelle V.…
How Does Data Diversity Shape the Weight Landscape of Neural Networks?by Yang Ba, Michelle V.…
ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flawsby Hai Huang, Randall BalestrieroFirst submitted to arxiv…
Efficient Hyperparameter Importance Assessment for CNNsby Ruinan Wang, Ian Nabney, Mohammad GolbabaeeFirst submitted to arxiv…
DeltaDQ: Ultra-High Delta Compression for Fine-Tuned LLMs via Group-wise Dropout and Separate Quantizationby Yanfeng Jiang,…
Uncertainty estimation via ensembles of deep learning models and dropout layers for seismic tracesby Giovanni…
Evaluating the Generalization Ability of Spatiotemporal Model in Urban Scenarioby Hongjun Wang, Jiyuan Chen, Tong…
House of Cards: Massive Weights in LLMsby Jaehoon Oh, Seungjun Shin, Dokwan OhFirst submitted to…
Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Trainingby Qingyang Li,…
Spectral Wavelet Dropout: Regularization in the Wavelet Domainby Rinor Cakaj, Jens Mehnert, Bin YangFirst submitted…
Stochastic Subsampling With Average Poolingby Bum Jun Kim, Sang Woo KimFirst submitted to arxiv on:…