Summary of Systematic Evaluation Of Long-context Llms on Financial Concepts, by Lavanya Gupta et al.
Systematic Evaluation of Long-Context LLMs on Financial Conceptsby Lavanya Gupta, Saket Sharma, Yiyun ZhaoFirst submitted…
Systematic Evaluation of Long-Context LLMs on Financial Conceptsby Lavanya Gupta, Saket Sharma, Yiyun ZhaoFirst submitted…
ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Studyby…
Relational Programming with Foundation Modelsby Ziyang Li, Jiani Huang, Jason Liu, Felix Zhu, Eric Zhao,…
How good is GPT at writing political speeches for the White House?by Jacques SavoyFirst submitted…
GLIDER: Grading LLM Interactions and Decisions using Explainable Rankingby Darshan Deshpande, Selvan Sunitha Ravi, Sky…
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4oby Tony Cheng Tong,…
ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planningby Yi Huang, Fangyin Cheng, Fan…
GIRAFFE: Design Choices for Extending the Context Length of Visual Language Modelsby Mukai Li, Lei…
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Modelsby YiFan Zhang, Shanglin Lei, Runqi Qiao,…
OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviewsby Maximilian Idahl, Zahra…