Summary of Simplified and Generalized Masked Diffusion For Discrete Data, by Jiaxin Shi et al.
Simplified and Generalized Masked Diffusion for Discrete Databy Jiaxin Shi, Kehang Han, Zhe Wang, Arnaud…
Simplified and Generalized Masked Diffusion for Discrete Databy Jiaxin Shi, Kehang Han, Zhe Wang, Arnaud…
Predictive Uncertainty Quantification for Bird’s Eye View Segmentation: A Benchmark and Novel Loss Functionby Linlin…
Understanding Encoder-Decoder Structures in Machine Learning Using Information Measuresby Jorge F. Silva, Victor Faraggi, Camilo…
Hierarchical Classification Auxiliary Network for Time Series Forecastingby Yanru Sun, Zongxia Xie, Dongyue Chen, Emadeldeen…
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scalesby Ju-Seung Byun,…
Understanding Linear Probing then Fine-tuning Language Models from NTK Perspectiveby Akiyoshi Tomihari, Issei SatoFirst submitted…
A unified law of robustness for Bregman divergence lossesby Santanu Das, Jatin Batra, Piyush SrivastavaFirst…
Online Self-Preferring Language Modelsby Yuanzhao Zhai, Zhuo Zhang, Kele Xu, Hanyang Peng, Yue Yu, Dawei…
Progress Measures for Grokking on Real-world Tasksby Satvik GolechhaFirst submitted to arxiv on: 21 May…
Alternators For Sequence Modelingby Mohammad Reza Rezaei, Adji Bousso DiengFirst submitted to arxiv on: 20…