Summary of Self-taught Evaluators, by Tianlu Wang et al.
Self-Taught Evaluatorsby Tianlu Wang, Ilia Kulikov, Olga Golovneva, Ping Yu, Weizhe Yuan, Jane Dwivedi-Yu, Richard…
Self-Taught Evaluatorsby Tianlu Wang, Ilia Kulikov, Olga Golovneva, Ping Yu, Weizhe Yuan, Jane Dwivedi-Yu, Richard…
DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Modelsby Bowen Wang, Jiuyang Chang, Yiming…
ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Scienceby Robert Wolfe, Alexis…
High-Throughput Phenotyping of Clinical Text Using Large Language Modelsby Daniel B. Hier, S. Ilyas Munzir,…
Granting GPT-4 License and Opportunity: Enhancing Accuracy and Confidence Estimation for Few-Shot Event Detectionby Steven…
A new approach for encoding code and assisting code understandingby Mengdan Fan, Wei Zhang, Haiyan…
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebusesby Gabriele Sarti,…
Closing the gap between open-source and commercial large language models for medical evidence summarizationby Gongbo…
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilitiesby Weihao Yu,…