Summary of Evaluating the Translation Performance Of Large Language Models Based on Euas-20, by Yan Huang et al.
Evaluating the Translation Performance of Large Language Models Based on Euas-20by Yan Huang, Wei LiuFirst…
Evaluating the Translation Performance of Large Language Models Based on Euas-20by Yan Huang, Wei LiuFirst…
DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Modelsby Bowen Wang, Jiuyang Chang, Yiming…
ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Scienceby Robert Wolfe, Alexis…
High-Throughput Phenotyping of Clinical Text Using Large Language Modelsby Daniel B. Hier, S. Ilyas Munzir,…
A new approach for encoding code and assisting code understandingby Mengdan Fan, Wei Zhang, Haiyan…
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebusesby Gabriele Sarti,…
Closing the gap between open-source and commercial large language models for medical evidence summarizationby Gongbo…
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilitiesby Weihao Yu,…
Granting GPT-4 License and Opportunity: Enhancing Accuracy and Confidence Estimation for Few-Shot Event Detectionby Steven…
Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistencyby Taiji Li,…