Alignment – Page 15 – GrooveSquid.com

July 13, 2025

Multi-round jailbreak attack on large language modelsby Yihua Zhou, Xiaochuan ShiFirst submitted to arxiv on:…

July 13, 2025

Can Structured Data Reduce Epistemic Uncertainty?by Shriram M S, Sushmitha S, Gayathri K S, Shahina…

July 13, 2025

Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editingby Yoonjeon…

July 13, 2025

Improving Data Efficiency via Curating LLM-Driven Rating Systemsby Jinlong Pang, Jiaheng Wei, Ankit Parag Shah,…

July 13, 2025

Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learningby Bokai Hu, Sai…

July 13, 2025

Thinking LLMs: General Instruction Following with Thought Generationby Tianhao Wu, Janice Lan, Weizhe Yuan, Jiantao…

July 13, 2025

Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Cluesby Qibing Ren, Hao Li, Dongrui Liu,…

July 13, 2025

ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localizationby Jiawei Liu, Fanrui…

July 13, 2025

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspectiveby Xiangru Zhu, Penglei Sun, Yaoxian Song,…

July 13, 2025

Improving Semantic Understanding in Speech Language Models via Brain-tuningby Omer Moussa, Dietrich Klakow, Mariya TonevaFirst…