Alignment – Page 107 – GrooveSquid.com

Loading Now

July 13, 2025

Summary of Reinforcement Learning Based Escape Route Generation in Low Visibility Environments, by Hari Srikanth

Reinforcement Learning Based Escape Route Generation in Low Visibility Environmentsby Hari SrikanthFirst submitted to arxiv…

July 13, 2025

Summary of 3d-properties: Identifying Challenges in Dpo and Charting a Path Forward, by Yuzi Yan et al.

3D-Properties: Identifying Challenges in DPO and Charting a Path Forwardby Yuzi Yan, Yibo Miao, Jialian…

July 13, 2025

Summary of Failures Are Fated, but Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-scale Vision and Language Models, by Som Sagar et al.

Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision…

July 13, 2025

Summary of Paircfr: Enhancing Model Training on Paired Counterfactually Augmented Data Through Contrastive Learning, by Xiaoqi Qiu et al.

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learningby Xiaoqi Qiu, Yongjie…

July 13, 2025

Summary of Dualtime: a Dual-adapter Multimodal Language Model For Time Series Representation, by Weiqi Zhang et al.

DualTime: A Dual-Adapter Multimodal Language Model for Time Series Representationby Weiqi Zhang, Jiexia Ye, Ziyue…

July 13, 2025

Summary of Understanding Attention-based Encoder-decoder Networks: a Case Study with Chess Scoresheet Recognition, by Sergio Y. Hayashi et al.

Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognitionby Sergio Y. Hayashi, Nina…

July 13, 2025

Summary of Vcr: Visual Caption Restoration, by Tianyu Zhang et al.

VCR: Visual Caption Restorationby Tianyu Zhang, Suyuchen Wang, Lu Li, Ge Zhang, Perouz Taslakian, Sai…

July 13, 2025

Summary of Diffusion-rpo: Aligning Diffusion Models Through Relative Preference Optimization, by Yi Gu et al.

Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimizationby Yi Gu, Zhendong Wang, Yueqin Yin, Yujia…

July 13, 2025

Summary of Aligning Large Language Models with Representation Editing: a Control Perspective, by Lingkai Kong et al.

Aligning Large Language Models with Representation Editing: A Control Perspectiveby Lingkai Kong, Haorui Wang, Wenhao…

July 13, 2025

Summary of Distributional Preference Alignment Of Llms Via Optimal Transport, by Igor Melnyk et al.

Distributional Preference Alignment of LLMs via Optimal Transportby Igor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia…