Summary of Exploring the Capabilities Of Large Multimodal Models on Dense Text, by Shuo Zhang et al.
Exploring the Capabilities of Large Multimodal Models on Dense Textby Shuo Zhang, Biao Yang, Zhang…
Exploring the Capabilities of Large Multimodal Models on Dense Textby Shuo Zhang, Biao Yang, Zhang…
Digital Diagnostics: The Potential Of Large Language Models In Recognizing Symptoms Of Common Illnessesby Gaurav…
VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Contextby Yunxin Li, Baotian…
A Fourth Wave of Open Data? Exploring the Spectrum of Scenarios for Open Data and…
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Modelsby Mihir Parmar, Nisarg…
TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Modelsby Ya-Qi Yu, Minghui Liao, Jihao…
DesignQA: A Multimodal Benchmark for Evaluating Large Language Models’ Understanding of Engineering Documentationby Anna C.…
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?by Junpeng Liu,…
Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigationby Yixin Wan, Arjun Subramonian, Anaelia…
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representationsby Deqing Fu, Ruohao Guo, Ghazal Khalighinejad, Ollie…