Summary of Qpo: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning, by Yilun Kong et al.
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learningby Yilun Kong, Hangyu Mao, Qi Zhao,…
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learningby Yilun Kong, Hangyu Mao, Qi Zhao,…
How to Make the Most of LLMs’ Grammatical Knowledge for Acceptability Judgmentsby Yusuke Ide, Yuto…
Paired Completion: Flexible Quantification of Issue-framing at Scale with LLMsby Simon D Angus, Lachlan O'NeillFirst…
Concept Distillation from Strong to Weak Models via Hypotheses-to-Theories Promptingby Emmanuel Aboah Boateng, Cassiano O.…
Chinese Metaphor Recognition Using a Multi-stage Prompting Large Language Modelby Jie Wang, Jin Wang, Xuejie…
Evaluating the Evaluator: Measuring LLMs’ Adherence to Task Evaluation Instructionsby Bhuvanashree Murugadoss, Christian Poelitz, Ian…
Reasoning Beyond Bias: A Study on Counterfactual Prompting and Chain of Thought Reasoningby Kyle Moore,…
Large Language Models Prompting With Episodic Memoryby Dai Do, Quan Tran, Svetha Venkatesh, Hung LeFirst…
Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplacesby Zhiling…
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalizationby Yuhang Zang, Hanlin Goh, Josh…