Summary of Process Reward Model with Q-value Rankings, by Wendi Li et al.
Process Reward Model with Q-Value Rankingsby Wendi Li, Yixuan LiFirst submitted to arxiv on: 15…
Process Reward Model with Q-Value Rankingsby Wendi Li, Yixuan LiFirst submitted to arxiv on: 15…
Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editingby Yoonjeon…
Applying Refusal-Vector Ablation to Llama 3.1 70B Agentsby Simon Lermen, Mateusz Dziemian, Govind PimpaleFirst submitted…
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Searchby Chenglin Li, Qianglong Chen,…
KBLaM: Knowledge Base augmented Language Modelby Xi Wang, Taketomo Isazawa, Liana Mikaelyan, James HensmanFirst submitted…
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMsby Yijie Li, Yuan SunFirst submitted…
Surgical-LLaVA: Toward Surgical Scenario Understanding via Large Language and Vision Modelsby Juseong Jin, Chang Wook…
SimpleStrat: Diversifying Language Model Generation with Stratificationby Justin Wong, Yury Orlovskiy, Michael Luo, Sanjit A.…
AutoEval: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasksby Rushang Karia, Daniel Bramblett,…
Baichuan-Omni Technical Reportby Yadong Li, Haoze Sun, Mingan Lin, Tianpeng Li, Guosheng Dong, Tao Zhang,…