Summary of The Power Of Active Multi-task Learning in Reinforcement Learning From Human Feedback, by Ruitao Chen et al.
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedbackby Ruitao Chen, Liwei…
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedbackby Ruitao Chen, Liwei…
OTLP: Output Thresholding Using Mixed Integer Linear Programmingby Baran Koseoglu, Luca Traverso, Mohammed Topiwalla, Egor…
SimAD: A Simple Dissimilarity-based Approach for Time Series Anomaly Detectionby Zhijie Zhong, Zhiwen Yu, Xing…
WisPerMed at “Discharge Me!”: Advancing Text Generation in Healthcare with Large Language Models, Dynamic Expert…
Submodular Information Selection for Hypothesis Testing with Misclassification Penaltiesby Jayanth Bhargav, Mahsa Ghasemi, Shreyas SundaramFirst…
The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networksby Lucius Bushnaq,…
Observational Scaling Laws and the Predictability of Language Model Performanceby Yangjun Ruan, Chris J. Maddison,…
DINO as a von Mises-Fisher mixture modelby Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik LindstenFirst…
Untargeted Adversarial Attack on Knowledge Graph Embeddingsby Tianzhe Zhao, Jiaoyan Chen, Yanchi Ru, Qika Lin,…
Block Selective Reprogramming for On-device Training of Vision Transformersby Sreetama Sarkar, Souvik Kundu, Kai Zheng,…