Summary of Gradient Weight-normalized Low-rank Projection For Efficient Llm Training, by Jia-hong Huang et al.
Gradient Weight-normalized Low-rank Projection for Efficient LLM Trainingby Jia-Hong Huang, Yixian Shen, Hongyi Zhu, Stevan…
Gradient Weight-normalized Low-rank Projection for Efficient LLM Trainingby Jia-Hong Huang, Yixian Shen, Hongyi Zhu, Stevan…
Assessing Pre-Trained Models for Transfer Learning Through Distribution of Spectral Componentsby Tengxue Zhang, Yang Shu,…
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RLby Qin-Wen Luo, Ming-Kun Xie, Ye-Wen…
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMsby Junying Chen, Zhenyang Cai, Ke Ji, Xidong Wang,…
Optimizing Large Language Models with an Enhanced LoRA Fine-Tuning Algorithm for Efficiency and Robustness in…
Torque-Aware Momentumby Pranshu Malviya, Goncalo Mordido, Aristide Baratin, Reza Babanezhad Harikandeh, Gintare Karolina Dziugaite, Razvan…
AgreeMate: Teaching LLMs to Haggleby Ainesh Chatterjee, Samuel Miller, Nithin ParepallyFirst submitted to arxiv on:…
FameBias: Embedding Manipulation Bias Attack in Text-to-Image Modelsby Jaechul Roh, Andrew Yuan, Jinsong MaoFirst submitted…
MMFactory: A Universal Solution Search Engine for Vision-Language Tasksby Wan-Cyuan Fan, Tanzila Rahman, Leonid SigalFirst…
Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergenceby Yinbin Han, Meisam Razaviyayn, Renyuan…