Summary of Federated Document Visual Question Answering: a Pilot Study, by Khanh Nguyen and Dimosthenis Karatzas
Federated Document Visual Question Answering: A Pilot Studyby Khanh Nguyen, Dimosthenis KaratzasFirst submitted to arxiv…
Federated Document Visual Question Answering: A Pilot Studyby Khanh Nguyen, Dimosthenis KaratzasFirst submitted to arxiv…
HMT: Hierarchical Memory Transformer for Efficient Long Context Language Processingby Zifan He, Yingqi Cao, Zongyue…
Switchable Decision: Dynamic Neural Generation Networksby Shujian Zhang, Korawat Tanwisuth, Chengyue Gong, Pengcheng He, Mingyuan…
ERATTA: Extreme RAG for Table To Answers with Large Language Modelsby Sohini Roychowdhury, Marko Krema,…
Advancing Multimodal Medical Capabilities of Geminiby Lin Yang, Shawn Xu, Andrew Sellergren, Timo Kohlberger, Yuchen…
Mitigating LLM Hallucinations via Conformal Abstentionby Yasin Abbasi Yadkori, Ilja Kuzborskij, David Stutz, András György,…
MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learningby Nadia SaeedFirst submitted…
UQA: Corpus for Urdu Question Answeringby Samee Arif, Sualeha Farid, Awais Athar, Agha Ali RazaFirst…
Beyond Human Vision: The Role of Large Vision Language Models in Microscope Image Analysisby Prateek…
Evaluating Tool-Augmented Agents in Remote Sensing Platformsby Simranjit Singh, Michael Fore, Dimitrios StamoulisFirst submitted to…