Summary of Rethinking Data Synthesis: a Teacher Model Training Recipe with Interpretation, by Yifang Chen et al.
Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretationby Yifang Chen, David Zhu, Simon…
Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretationby Yifang Chen, David Zhu, Simon…
Paved or unpaved? A Deep Learning derived Road Surface Global Dataset from Mapillary Street-View Imageryby…
Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Modelsby Danqing Wang, Zhuorui Ye, Fei…
Learning from Response not Preference: A Stackelberg Approach for LLM Detoxification using Non-parallel Databy Xinhong…
Effective Instruction Parsing Plugin for Complex Logical Query Answering on Knowledge Graphsby Xingrui Zhuo, Jiapu…
2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervisionby Shilong Li, Yancheng He, Hui Huang, Xingyuan…
Improving Small-Scale Large Language Models Function Calling for Reasoning Tasksby Graziano A. Manduzio, Federico A.…
PRACT: Optimizing Principled Reasoning and Acting of LLM Agentby Zhiwei Liu, Weiran Yao, Jianguo Zhang,…
LOGO – Long cOntext aliGnment via efficient preference Optimizationby Zecheng Tang, Zechen Sun, Juntao Li,…
Little Giants: Synthesizing High-Quality Embedding Data at Scaleby Haonan Chen, Liang Wang, Nan Yang, Yutao…