Summary of Regularized Best-of-n Sampling with Minimum Bayes Risk Objective For Language Model Alignment, by Yuu Jinnai et al.
Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignmentby Yuu Jinnai, Tetsuro…
Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignmentby Yuu Jinnai, Tetsuro…
Configurable Safety Tuning of Language Models with Synthetic Preference Databy Victor GallegoFirst submitted to arxiv…
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstructionby Qiuhong Shen, Zike Wu,…
Improving Attributed Text Generation of Large Language Models via Preference Learningby Dongfang Li, Zetian Sun,…
Prioritized League Reinforcement Learning for Large-Scale Heterogeneous Multiagent Systemsby Qingxu Fu, Zhiqiang Pu, Min Chen,…
Towards a FAIR Documentation of Workflows and Models in Applied Mathematicsby Marco Reidelbach, Björn Schembera,…
InternLM2 Technical Reportby Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen,…
Learning Traffic Signal Control via Genetic Programmingby Xiao-Cheng Liao, Yi Mei, Mengjie ZhangFirst submitted to…
Explainable Graph Neural Networks for Observation Impact Analysis in Atmospheric State Estimationby Hyeon-Ju Jeon, Jeon-Ho…
An Open-source End-to-End Logic Optimization Framework for Large-scale Boolean Network with Reinforcement Learningby Zhen Li,…