Fine tuning – Page 228 – GrooveSquid.com

July 13, 2025

Reflect-RL: Two-Player Online RL Fine-Tuning for LMsby Runlong Zhou, Simon S. Du, Beibin LiFirst submitted…

July 13, 2025

EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMsby Song Guo, Fan Wu, Lei Zhang, Xiawu…

July 13, 2025

Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Modelsby Christian Schlarmann,…

July 13, 2025

A Critical Evaluation of AI Feedback for Aligning Large Language Modelsby Archit Sharma, Sedrick Keh,…

July 13, 2025

Mafin: Enhancing Black-Box Embeddings with Model Augmented Fine-Tuningby Mingtian Zhang, Shawn Lan, Peter Hayes, David…

July 13, 2025

Uncertainty quantification in fine-tuned LLMs using LoRA ensemblesby Oleksandr Balabanov, Hampus LinanderFirst submitted to arxiv…

July 13, 2025

Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Modelsby Himanshu…

July 13, 2025

Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decodingby Hanling Yi,…

July 13, 2025

LoRA Training in the NTK Regime has No Spurious Local Minimaby Uijeong Jang, Jason D.…

July 13, 2025

Invertible Fourier Neural Operators for Tackling Both Forward and Inverse Problemsby Da Long, Shandian ZheFirst…