Summary of Plancraft: An Evaluation Dataset For Planning with Llm Agents, by Gautier Dagan et al.
Plancraft: an evaluation dataset for planning with LLM agentsby Gautier Dagan, Frank Keller, Alex LascaridesFirst…
Plancraft: an evaluation dataset for planning with LLM agentsby Gautier Dagan, Frank Keller, Alex LascaridesFirst…
A Comprehensive Framework for Reliable Legal AI: Combining Specialized Expert Systems and Adaptive Refinementby Sidra…
GeAR: Graph-enhanced Agent for Retrieval-augmented Generationby Zhili Shen, Chenxin Diao, Pavlos Vougiouklis, Pascual Merita, Shriram…
Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Basesby Christian Di Maio, Cristian…
Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agentsby Antony…
LLM Agent for Fire Dynamics Simulationsby Leidong Xu, Danyal Mohaddes, Yi WangFirst submitted to arxiv…
Formal Language Knowledge Corpus for Retrieval Augmented Generationby Majd Zayyad, Yossi AdiFirst submitted to arxiv…
TimeRAG: BOOSTING LLM Time Series Forecasting via Retrieval-Augmented Generationby Silin Yang, Dong Wang, Haoqi Zheng,…
On the Suitability of pre-trained foundational LLMs for Analysis in German Legal Educationby Lorenz Wendlinger,…
Context-DPO: Aligning Language Models for Context-Faithfulnessby Baolong Bi, Shaohan Huang, Yiwei Wang, Tianchi Yang, Zihan…