Summary of Scicode: a Research Coding Benchmark Curated by Scientists, By Minyang Tian et al.
SciCode: A Research Coding Benchmark Curated by Scientistsby Minyang Tian, Luyu Gao, Shizhuo Dylan Zhang,…
SciCode: A Research Coding Benchmark Curated by Scientistsby Minyang Tian, Luyu Gao, Shizhuo Dylan Zhang,…
Unified-EGformer: Exposure Guided Lightweight Transformer for Mixed-Exposure Image Enhancementby Eashan Adhikarla, Kai Zhang, Rosaura G.…
NODER: Image Sequence Regression Based on Neural Ordinary Differential Equationsby Hao Bai, Yi HongFirst submitted…
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Modelsby Kangyun Ning, Yisong Su,…
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Modelsby Xavier Suau, Pieter Delobelle, Katherine…
Assessing the Effectiveness of GPT-4o in Climate Change Evidence Synthesis and Systematic Assessments: Preliminary Insightsby…
Regurgitative Training: The Value of Real Data in Training Large Language Modelsby Jinghui Zhang, Dandan…
Why Does New Knowledge Create Messy Ripple Effects in LLMs?by Jiaxin Qin, Zixuan Zhang, Chi…
Black-box Model Ensembling for Textual and Visual Question Answering via Information Fusionby Yuxi Xia, Kilm…
MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Productionby Jian Ma, Wenguan Wang, Yi Yang, Feng…