Summary of Qalam : a Multimodal Llm For Arabic Optical Character and Handwriting Recognition, by Gagan Bhatia et al.
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognitionby Gagan Bhatia, El…
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognitionby Gagan Bhatia, El…
PLANTS: A Novel Problem and Dataset for Summarization of Planning-Like (PL) Tasksby Vishal Pallagani, Biplav…
Training-free Composite Scene Generation for Layout-to-Image Synthesisby Jiaqi Liu, Tao Huang, Chang XuFirst submitted to…
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabulariesby Chaofan Tao, Qian Liu, Longxu Dou,…
A Comparative Study on Automatic Coding of Medical Letters with Explainabilityby Jamie Glen, Lifeng Han,…
Weak-to-Strong Reasoningby Yuqing Yang, Yan Ma, Pengfei LiuFirst submitted to arxiv on: 18 Jul 2024CategoriesMain:…
HPix: Generating Vector Maps from Satellite Imagesby Aditya Taparia, Keshab NathFirst submitted to arxiv on:…
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solvingby Yuxuan Tong, Xiwen Zhang, Rui Wang, Ruidong Wu,…
Cross-Task Attack: A Self-Supervision Generative Framework Based on Attention Shiftby Qingyuan Zeng, Yunpeng Gong, Min…
Scaling Granite Code Models to 128K Contextby Matt Stallone, Vaibhav Saxena, Leonid Karlinsky, Bridget McGinn,…