Summary of Openmathinstruct-1: a 1.8 Million Math Instruction Tuning Dataset, by Shubham Toshniwal et al.
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Datasetby Shubham Toshniwal, Ivan Moshkov, Sean Narenthiran, Daria…
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Datasetby Shubham Toshniwal, Ivan Moshkov, Sean Narenthiran, Daria…
Large Scale Constrained Clustering With Reinforcement Learningby Benedikt Schesch, Marco CasertaFirst submitted to arxiv on:…
Reward Generalization in RLHF: A Topological Perspectiveby Tianyi Qiu, Fanzhi Zeng, Jiaming Ji, Dong Yan,…
Crafting a Good Prompt or Providing Exemplary Dialogues? A Study of In-Context Learning for Persona-based…
Why are Sensitive Functions Hard for Transformers?by Michael Hahn, Mark RofinFirst submitted to arxiv on:…
Hierarchy Representation of Data in Machine Learningsby Han Yegang, Park Minjun, Byun Duwon, Park InkyuFirst…
Accelerating Parallel Sampling of Diffusion Modelsby Zhiwei Tang, Jiasheng Tang, Hao Luo, Fan Wang, Tsung-Hui…
Data Augmentation and Transfer Learning Approaches Applied to Facial Expressions Recognitionby Enrico Randellini, Leonardo Rigutini,…
Fast Vocabulary Transfer for Language Model Compressionby Leonidas Gee, Andrea Zugarini, Leonardo Rigutini, Paolo TorroniFirst…
Symmetry-Breaking Augmentations for Ad Hoc Teamworkby Ravi Hammond, Dustin Craggs, Mingyu Guo, Jakob Foerster, Ian…