Summary of Aradice: Benchmarks For Dialectal and Cultural Capabilities in Llms, by Basel Mousi et al.
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMsby Basel Mousi, Nadir Durrani, Fatema Ahmad,…
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMsby Basel Mousi, Nadir Durrani, Fatema Ahmad,…
GOSt-MT: A Knowledge Graph for Occupation-related Gender Biases in Machine Translationby Orfeas Menis Mastromichalakis, Giorgos…
RoMath: A Mathematical Reasoning Benchmark in Romanianby Adrian Cosma, Ana-Maria Bucur, Emilian RadoiFirst submitted to…
Task Arithmetic for Language Expansion in Speech Translationby Yao-Fei Cheng, Hayato Futami, Yosuke Kashiwagi, Emiru…
360PanT: Training-Free Text-Driven 360-Degree Panorama-to-Panorama Translationby Hai Wang, Jing-Hao XueFirst submitted to arxiv on: 12…
An Unsupervised Dialogue Topic Segmentation Model Based on Utterance Rewritingby Xia Hou, Qifeng Li, Tongliang…
Evaluation of Google Translate for Mandarin Chinese translation using sentiment and semantic analysisby Xuechun Wang,…
From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasksby Andreas Stephan, Dawei Zhu,…
N-gram Prediction and Word Difference Representations for Language Modelingby DongNyeong Heo, Daniela Noemi Rim, Heeyoul…
How Much Data is Enough Data? Fine-Tuning Large Language Models for In-House Translation: Performance Evaluation…