Summary of On the Self-verification Limitations Of Large Language Models on Reasoning and Planning Tasks, by Kaya Stechly et al.
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasksby Kaya Stechly,…
On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasksby Kaya Stechly,…
Secret Collusion among Generative AI Agentsby Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina,…
FinLLM-B: When Large Language Models Meet Financial Breakout Tradingby Kang Zhang, Osamu Yoshie, Lichao Sun,…
Large Language Models “Ad Referendum”: How Good Are They at Machine Translation in the Legal…
CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity Knowledgeby Norbert…
Lissard: Long and Simple Sequential Reasoning Datasetsby Mirelle Bueno, Roberto Lotufo, Rodrigo NogueiraFirst submitted to…
Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision…
ChemLLM: A Chemical Large Language Modelby Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang…
UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Constructionby Yansong Ning,…
Large Language Models: A Surveyby Shervin Minaee, Tomas Mikolov, Narjes Nikzad, Meysam Chenaghlu, Richard Socher,…