Summary of Truth or Deceit? a Bayesian Decoding Game Enhances Consistency and Reliability, by Weitong Zhang et al.
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliabilityby Weitong Zhang, Chengqi Zang,…
Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliabilityby Weitong Zhang, Chengqi Zang,…
Possible principles for aligned structure learning agentsby Lancelot Da Costa, Tomáš Gavenčiak, David Hyland, Mandana…
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentationby Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas…
Maia-2: A Unified Model for Human-AI Alignment in Chessby Zhenwei Tang, Difan Jiao, Reid McIlroy-Young,…
Grounding 3D Scene Affordance From Egocentric Interactionsby Cuiyu Liu, Wei Zhai, Yuhang Yang, Hongchen Luo,…
Designing Domain-Specific Large Language Models: The Critical Role of Fine-Tuning in Public Opinion Simulationby Haocheng…
Systematic Characterization of the Effectiveness of Alignment in Large Language Models for Categorical Decisionsby Isaac…
Elephant in the Room: Unveiling the Impact of Reward Model Quality in Alignmentby Yan Liu,…
LLMs4Synthesis: Leveraging Large Language Models for Scientific Synthesisby Hamed Babaei Giglou, Jennifer D'Souza, Sören AuerFirst…
Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awarenessby Jian Li, Haojing Huang,…