Summary of Creating a Lens Of Chinese Culture: a Multimodal Dataset For Chinese Pun Rebus Art Understanding, by Tuo Zhang et al.
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understandingby…
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understandingby…
Efficient Prompting for LLM-based Generative Internet of Thingsby Bin Xiao, Burak Kantarci, Jiawen Kang, Dusit…
What is the Visual Cognition Gap between Humans and Multimodal LLMs?by Xu Cao, Bolin Lai,…
Localizing Events in Videos with Multimodal Queriesby Gengyuan Zhang, Mang Ling Ada Fok, Jialu Ma,…
Details Make a Difference: Object State-Sensitive Neurorobotic Task Planningby Xiaowen Sun, Xufeng Zhao, Jae Hee…
Tilt and Average : Geometric Adjustment of the Last Layer for Recalibrationby Gyusang Cho, Chan-Hyun…
First Multi-Dimensional Evaluation of Flowchart Comprehension for Multimodal Large Language Modelsby Enming Zhang, Ruobing Yao,…
FZI-WIM at SemEval-2024 Task 2: Self-Consistent CoT for Complex NLI in Biomedical Domainby Jin Liu,…
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understandingby Junwei Luo,…
Exploration by Learning Diverse Skills through Successor State Measuresby Paul-Antoine Le Tolguenec, Yann Besse, Florent…