Summary of Step-level Value Preference Optimization For Mathematical Reasoning, by Guoxin Chen et al.
Step-level Value Preference Optimization for Mathematical Reasoningby Guoxin Chen, Minpeng Liao, Chengxi Li, Kai FanFirst…
Step-level Value Preference Optimization for Mathematical Reasoningby Guoxin Chen, Minpeng Liao, Chengxi Li, Kai FanFirst…
Exploring the Zero-Shot Capabilities of LLMs Handling Multiple Problems at onceby Zhengxiang Wang, Jordan Kodner,…
FZI-WIM at SemEval-2024 Task 2: Self-Consistent CoT for Complex NLI in Biomedical Domainby Jin Liu,…
GPT-ology, Computational Models, Silicon Sampling: How should we think about LLMs in Cognitive Science?by Desmond…
Surprise! Using Physiological Stress for Allostatic Regulation Under the Active Inference Framework [Pre-Print]by Imran Khan,…
RMem: Restricted Memory Banks Improve Video Object Segmentationby Junbao Zhou, Ziqi Pang, Yu-Xiong WangFirst submitted…
Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generationby Yiwei Li, Fei Mi, Yitong Li, Yasheng…
Legend: Leveraging Representation Engineering to Annotate Safety Margin for Preference Datasetsby Duanyu Feng, Bowen Qin,…
Structured Active Inference (Extended Abstract)by Toby St Clere SmitheFirst submitted to arxiv on: 7 Jun…
MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Modelsby Tianle Gu, Zeyang Zhou,…