Summary of Few-shot Steerable Alignment: Adapting Rewards and Llm Policies with Neural Processes, by Katarzyna Kobalczyk et al.
Few-shot Steerable Alignment: Adapting Rewards and LLM Policies with Neural Processesby Katarzyna Kobalczyk, Claudio Fanconi,…