Summary of Bilateral Sharpness-aware Minimization For Flatter Minima, by Jiaxin Deng et al.
Bilateral Sharpness-Aware Minimization for Flatter Minimaby Jiaxin Deng, Junbiao Pang, Baochang Zhang, Qingming HuangFirst submitted…
Bilateral Sharpness-Aware Minimization for Flatter Minimaby Jiaxin Deng, Junbiao Pang, Baochang Zhang, Qingming HuangFirst submitted…
ConvLSTMTransNet: A Hybrid Deep Learning Approach for Internet Traffic Telemetryby Sajal Saha, Saikat Das, Glaucio…
Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Managementby…
How the (Tensor-) Brain uses Embeddings and Embodiment to Encode Senses and Symbolsby Volker Tresp,…
Enhancing E-commerce Product Title Translation with Retrieval-Augmented Generation and Large Language Modelsby Bryan Zhang, Taichi…
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initializationby Mohammad Samragh, Iman Mirzadeh,…
Evaluating Defences against Unsafe Feedback in RLHFby Domenic Rosati, Giles Edkins, Harsh Raj, David Atanasov,…
Universal approximation theorem for neural networks with inputs from a topological vector spaceby Vugar IsmailovFirst…
Training Language Models to Self-Correct via Reinforcement Learningby Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi…
Revisiting Semi-supervised Adversarial Robustness via Noise-aware Online Robust Distillationby Tsung-Han Wu, Hung-Ting Su, Shang-Tse Chen,…