Summary of Exploration by Running Away From the Past, By Paul-antoine Le Tolguenec et al.
Exploration by Running Away from the Pastby Paul-Antoine Le Tolguenec, Yann Besse, Florent Teichteil-Koenigsbuch, Dennis…
Exploration by Running Away from the Pastby Paul-Antoine Le Tolguenec, Yann Besse, Florent Teichteil-Koenigsbuch, Dennis…
GNN-MultiFix: Addressing the pitfalls for GNNs for multi-label node classificationby Tianqi Zhao, Megha KhoslaFirst submitted…
Umbrella Reinforcement Learning – computationally efficient tool for hard non-linear problemsby Egor E. Nuzhin, Nikolai…
GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMsby Advik Raj Basani, Xiao ZhangFirst…
ComfyGI: Automatic Improvement of Image Generation Workflowsby Dominik Sobania, Martin Briesch, Franz RothlaufFirst submitted to…
Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networksby Sanchar…
Evaluating the Robustness of Analogical Reasoning in Large Language Modelsby Martha Lewis, Melanie MitchellFirst submitted…
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMsby Akari Asai, Jacqueline He, Rulin Shao, Weijia Shi,…
Natural Language Reinforcement Learningby Xidong Feng, Ziyu Wan, Haotian Fu, Bo Liu, Mengyue Yang, Girish…
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Modelsby Javier Ferrando, Oscar…