Summary of Enhancing Multiple Dimensions Of Trustworthiness in Llms Via Sparse Activation Control, by Yuxin Xiao et al.
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Controlby Yuxin Xiao, Chaoqun Wan,…
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Controlby Yuxin Xiao, Chaoqun Wan,…
Detect an Object At Once without Fine-tuningby Junyu Hao, Jianheng Liu, Yongjia Zhao, Zuofan Chen,…
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generationby Zhenbin Wang, Lei Zhang,…
A Simple and Effective Temporal Grounding Pipeline for Basketball Broadcast Footageby Levi HarrisFirst submitted to…
IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselvesby Ruofan Wang, Juncheng Li, Yixu Wang,…
Evolving Alignment via Asymmetric Self-Playby Ziyu Ye, Rishabh Agarwal, Tianqi Liu, Rishabh Joshi, Sarmishta Velury,…
Democratizing Reward Design for Personal and Representative Value-Alignmentby Carter Blair, Kate Larson, Edith LawFirst submitted…
Do Large Language Models Align with Core Mental Health Counseling Competencies?by Viet Cuong Nguyen, Mohammad…
From Explicit Rules to Implicit Reasoning in an Interpretable Violence Monitoring Systemby Wen-Dong Jiang, Chih-Yung…
Multi-path Exploration and Feedback Adjustment for Text-to-Image Person Retrievalby Bin Kang, Bin Chen, Junjie Wang,…