Alignment – Page 91 – GrooveSquid.com

July 13, 2025

Summary of Understanding Multimodal Hallucination with Parameter-free Representation Alignment, by Yueqian Wang et al.

Understanding Multimodal Hallucination with Parameter-Free Representation Alignmentby Yueqian Wang, Jianxin Liang, Yuxuan Wang, Huishuai Zhang,…

July 13, 2025

Summary of The Dark Side Of Human Feedback: Poisoning Large Language Models Via User Inputs, by Bocheng Chen et al.

The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputsby Bocheng Chen,…

July 13, 2025

Summary of Chatting Up Attachment: Using Llms to Predict Adult Bonds, by Paulo Soares et al.

Chatting Up Attachment: Using LLMs to Predict Adult Bondsby Paulo Soares, Sean McCurdy, Andrew J.…

July 13, 2025

Summary of Does Alignment Tuning Really Break Llms’ Internal Confidence?, by Hongseok Oh et al.

Does Alignment Tuning Really Break LLMs’ Internal Confidence?by Hongseok Oh, Wonseok HwangFirst submitted to arxiv…

July 13, 2025

Summary of Non-instructional Fine-tuning: Enabling Instruction-following Capabilities in Pre-trained Language Models Without Instruction-following Data, by Juncheng Xie et al.

Non-instructional Fine-tuning: Enabling Instruction-Following Capabilities in Pre-trained Language Models without Instruction-Following Databy Juncheng Xie, Shensian…

July 13, 2025

Summary of Iterative Graph Alignment, by Fangyuan Yu et al.

Iterative Graph Alignmentby Fangyuan Yu, Hardeep Singh Arora, Matt JohnsonFirst submitted to arxiv on: 29…

July 13, 2025

Summary of Learning Harmonized Representations For Speculative Sampling, by Lefan Zhang et al.

Learning Harmonized Representations for Speculative Samplingby Lefan Zhang, Xiaodan Wang, Yanhua Huang, Ruiwen XuFirst submitted…

July 13, 2025

Summary of Boosting Lossless Speculative Decoding Via Feature Sampling and Partial Alignment Distillation, by Lujun Gui et al.

Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillationby Lujun Gui, Bin Xiao,…

July 13, 2025

Summary of Una: Unifying Alignments Of Rlhf/ppo, Dpo and Kto by a Generalized Implicit Reward Function, By Zhichao Wang et al.

UNA: Unifying Alignments of RLHF/PPO, DPO and KTO by a Generalized Implicit Reward Functionby Zhichao…

July 13, 2025

Summary of Surgen: Text-guided Diffusion Model For Surgical Video Generation, by Joseph Cho et al.

SurGen: Text-Guided Diffusion Model for Surgical Video Generationby Joseph Cho, Samuel Schmidgall, Cyril Zakka, Mrudang…