Summary of Assessing the Zero-shot Capabilities Of Llms For Action Evaluation in Rl, by Eduardo Pignatelli et al.
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RLby Eduardo Pignatelli, Johan Ferret,…
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RLby Eduardo Pignatelli, Johan Ferret,…
Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large Language Models Iteratively without Gold Labelsby Chaoqun Liu,…
Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Databy Jiaming Zhou, Abbas Ghaddar,…
A Controlled Study on Long Context Extension and Generalization in LLMsby Yi Lu, Jing Nathan…
Democratizing MLLMs in Healthcare: TinyLLaVA-Med for Efficient Healthcare Diagnostics in Resource-Constrained Settingsby Aya El Mir,…
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvementby An Yang, Beichen Zhang, Binyuan Hui,…
Finetuning Language Models to Emit Linguistic Expressions of Uncertaintyby Arslan Chaudhry, Sridhar Thiagarajan, Dilan GorurFirst…
LPT++: Efficient Training on Mixture of Long-tailed Expertsby Bowen Dong, Pan Zhou, Wangmeng ZuoFirst submitted…
Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Modelsby Divij Gupta, Anubhav Bhatti,…
Cross-lingual transfer of multilingual models on low resource African Languagesby Harish Thangaraj, Ananya Chenat, Jaskaran…