Alignment – Page 114 – GrooveSquid.com

Loading Now

July 13, 2025

Summary of Online Merging Optimizers For Boosting Rewards and Mitigating Tax in Alignment, by Keming Lu et al.

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignmentby Keming Lu, Bowen Yu,…

July 13, 2025

Summary of Mmdisco: Multi-modal Discriminator-guided Cooperative Diffusion For Joint Audio and Video Generation, by Akio Hayakawa et al.

MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generationby Akio Hayakawa, Masato Ishii,…

July 13, 2025

Summary of Ai Alignment with Changing and Influenceable Reward Functions, by Micah Carroll et al.

AI Alignment with Changing and Influenceable Reward Functionsby Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart…

July 13, 2025

Summary of Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation For Fault Diagnosis, By Ziyan Wang et al.

Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosisby Ziyan Wang, Mohamed…

July 13, 2025

Summary of How Do the Architecture and Optimizer Affect Representation Learning? on the Training Dynamics Of Representations in Deep Neural Networks, by Yuval Sharon and Yehuda Dar

How Do the Architecture and Optimizer Affect Representation Learning? On the Training Dynamics of Representations…

July 13, 2025

Summary of Remodetect: Reward Models Recognize Aligned Llm’s Generations, by Hyunseok Lee et al.

ReMoDetect: Reward Models Recognize Aligned LLM’s Generationsby Hyunseok Lee, Jihoon Tack, Jinwoo ShinFirst submitted to…

July 13, 2025

Summary of Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models, by Shengyun Peng et al.

Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Modelsby ShengYun Peng, Pin-Yu Chen,…

July 13, 2025

Summary of Beamvq: Aligning Space-time Forecasting Model Via Self-training on Physics-aware Metrics, by Hao Wu et al.

BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metricsby Hao Wu, Xingjian Shi, Ziyue…

July 13, 2025

Summary of Improving Data-aware and Parameter-aware Robustness For Continual Learning, by Hanxi Xiao and Fan Lyu

Improving Data-aware and Parameter-aware Robustness for Continual Learningby Hanxi Xiao, Fan LyuFirst submitted to arxiv…

July 13, 2025

Summary of Effective Layer Pruning Through Similarity Metric Perspective, by Ian Pons et al.

Effective Layer Pruning Through Similarity Metric Perspectiveby Ian Pons, Bruno Yamamoto, Anna H. Reali Costa,…