Summary of Supervised Fine-tuning As Inverse Reinforcement Learning, by Hao Sun
Supervised Fine-Tuning as Inverse Reinforcement Learningby Hao SunFirst submitted to arxiv on: 18 Mar 2024CategoriesMain:…
Supervised Fine-Tuning as Inverse Reinforcement Learningby Hao SunFirst submitted to arxiv on: 18 Mar 2024CategoriesMain:…
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Modelsby Junlin Han, Filippos Kokkinos, Philip…
One-Step Image Translation with Text-to-Image Modelsby Gaurav Parmar, Taesung Park, Srinivasa Narasimhan, Jun-Yan ZhuFirst submitted…
SuperLoRA: Parameter-Efficient Unified Adaptation of Multi-Layer Attention Modulesby Xiangyu Chen, Jing Liu, Ye Wang, Pu…
Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelinesby Ekaterina Trofimova, Emil…
JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuningby Anique Tahir, Lu Cheng, Huan LiuFirst…
Model Reprogramming Outperforms Fine-tuning on Out-of-distribution Data in Text-Image Encodersby Andrew Geng, Pin-Yu ChenFirst submitted…
Enhancing Out-of-Distribution Detection with Multitesting-based Layer-wise Feature Fusionby Jiawei Li, Sitong Li, Shanshan Wang, Yicheng…
Parameter Efficient Reinforcement Learning from Human Feedbackby Hakim Sidahmed, Samrat Phatale, Alex Hutcheson, Zhuonan Lin,…
MoPE: Mixture of Prompt Experts for Parameter-Efficient and Scalable Multimodal Fusionby Ruixiang Jiang, Lingbo Liu,…