Summary of Igot: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining, by Dawei Feng et al.
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretrainingby Dawei Feng, Yihai Zhang, Zhixuan XuFirst…
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretrainingby Dawei Feng, Yihai Zhang, Zhixuan XuFirst…
SciQAG: A Framework for Auto-Generated Science Question Answering Dataset with Fine-grained Evaluationby Yuwei Wan, Yixuan…
GPT-3.5 for Grammatical Error Correctionby Anisia Katinskaia, Roman YangarberFirst submitted to arxiv on: 14 May…
MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasksby Haijiang…
LlamaTurk: Adapting Open-Source Generative Large Language Models for Low-Resource Languageby Cagri ToramanFirst submitted to arxiv…
FreeVA: Offline MLLM as Training-Free Video Assistantby Wenhao WuFirst submitted to arxiv on: 13 May…
PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetitionby Ziyang Zhang, Qizhen Zhang, Jakob…
CANTONMT: Investigating Back-Translation and Model-Switch Mechanisms for Cantonese-English Neural Machine Translationby Kung Yin Hong, Lifeng…
Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITAby Marco Polignano, Pierpaolo Basile, Giovanni SemeraroFirst submitted…
InsightNet: Structured Insight Mining from Customer Feedbackby Sandeep Sricharan Mukku, Manan Soni, Jitenkumar Rana, Chetan…