Summary of Measuring and Reducing Llm Hallucination Without Gold-standard Answers, by Jiaheng Wei et al.
Measuring and Reducing LLM Hallucination without Gold-Standard Answersby Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi…
Measuring and Reducing LLM Hallucination without Gold-Standard Answersby Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi…
Transductive Learning Is Compactby Julian Asilis, Siddartha Devic, Shaddin Dughmi, Vatsal Sharan, Shang-Hua TengFirst submitted…
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustmentby Rui Yang, Xiaoman Pan, Feng…
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generationby Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan…
RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language…
Node Duplication Improves Cold-start Link Predictionby Zhichun Guo, Tong Zhao, Yozen Liu, Kaiwen Dong, William…
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learningby Michael Lanier, Ying Xu,…
Comparing supervised learning dynamics: Deep neural networks match human data efficiency but show a generalisation…
Embracing the black box: Heading towards foundation models for causal discovery from time series databy…
Weakly Supervised Segmentation of Vertebral Bodies with Iterative Slice-propagationby Shiqi Peng, Bolin Lai, Guangyu Yao,…