Summary of Patentgpt: a Large Language Model For Patent Drafting Using Knowledge-based Fine-tuning Method, by Runtao Ren et al.
PatentGPT: A Large Language Model for Patent Drafting Using Knowledge-based Fine-tuning Methodby Runtao Ren, Jian…
PatentGPT: A Large Language Model for Patent Drafting Using Knowledge-based Fine-tuning Methodby Runtao Ren, Jian…
Minor SFT loss for LLM fine-tune to increase performance and reduce model deviationby Shiming Xie,…
A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Caseby Sonia…
Model Surgery: Modulating LLM’s Behavior Via Simple Parameter Editingby Huanqian Wang, Yang Yue, Rui Lu,…
Towards Comprehensive Preference Data Collection for Reward Modelingby Yulan Hu, Qingyang Li, Sheng Ouyang, Ge…
Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language Modelsby Lulu Zhao, Weihao Zeng, Xiaofeng Shi, Hua…
Toward Optimal LLM Alignments Using Two-Player Gamesby Rui Zheng, Hongyi Guo, Zhihan Liu, Xiaoying Zhang,…
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsby Rui Yang, Ruomeng Ding, Yong…
Creativity Has Left the Chat: The Price of Debiasing Language Modelsby Behnam MohammadiFirst submitted to…
Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHFby Yuan Sun, Navid Salami…