Summary of Benchmark Transparency: Measuring the Impact Of Data on Evaluation, by Venelin Kovatchev and Matthew Lease
Benchmark Transparency: Measuring the Impact of Data on Evaluationby Venelin Kovatchev, Matthew LeaseFirst submitted to…
Benchmark Transparency: Measuring the Impact of Data on Evaluationby Venelin Kovatchev, Matthew LeaseFirst submitted to…
A Benchmark Evaluation of Clinical Named Entity Recognition in Frenchby Nesrine Bannour, Christophe Servan, Aurélie…
Uncovering Misattributed Suicide Causes through Annotation Inconsistency Detection in Death Investigation Notesby Song Wang, Yiliang…
Reshaping Free-Text Radiology Notes Into Structured Reports With Generative Transformersby Laura Bergomi, Tommaso M. Buonocore,…
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?by Christophe Servan, Sahar Ghannay, Sophie…
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Textby Elliot Bolton, Abhinav Venigalla, Michihiro…
Neural Architecture Search for Sentence Classification with BERTby Philip Kenneweg, Sarah Schröder, Barbara HammerFirst submitted…
Harnessing the power of LLMs for normative reasoning in MASsby Bastin Tony Roy Savarimuthu, Surangika…
PE: A Poincare Explanation Method for Fast Text Hierarchy Generationby Qian Chen, Dongyang Li, Xiaofeng…
Qibo: A Large Language Model for Traditional Chinese Medicineby Heyi Zhang, Xin Wang, Zhaopeng Meng,…