Summary of Drt: Deep Reasoning Translation Via Long Chain-of-thought, by Jiaan Wang et al.
DRT: Deep Reasoning Translation via Long Chain-of-Thoughtby Jiaan Wang, Fandong Meng, Yunlong Liang, Jie ZhouFirst…
DRT: Deep Reasoning Translation via Long Chain-of-Thoughtby Jiaan Wang, Fandong Meng, Yunlong Liang, Jie ZhouFirst…
On the Feasibility of Vision-Language Models for Time-Series Classificationby Vinay Prithyani, Mohsin Mohammed, Richa Gadgil,…
Boosting LLM via Learning from Data Iteratively and Selectivelyby Qi Jia, Siyu Ren, Ziheng Qin,…
Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinementby Hyeonjin Kim, Jaejun YooFirst…
Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior…
GAS: Generative Auto-bidding with Post-training Searchby Yewen Li, Shuai Mao, Jingtong Gao, Nan Jiang, Yunjian…
Survey on Abstractive Text Summarization: Dataset, Models, and Metricsby Gospel Ozioma Nnadi, Flavio BertiniFirst submitted…
Online Learning from Strategic Human Feedback in LLM Fine-Tuningby Shugang Hao, Lingjie DuanFirst submitted to…
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningby Yuxiang Zhang, Yuqi Yang,…
System-2 Mathematical Reasoning via Enriched Instruction Tuningby Huanqia Cai, Yijun Yang, Zhifeng LiFirst submitted to…