Summary of Temporal Grounding Of Activities Using Multimodal Large Language Models, by Young Chol Song
Temporal Grounding of Activities using Multimodal Large Language Modelsby Young Chol SongFirst submitted to arxiv…
Temporal Grounding of Activities using Multimodal Large Language Modelsby Young Chol SongFirst submitted to arxiv…
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervisionby Orr Zohar, Xiaohan Wang, Yonatan Bitton,…
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instructby Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang,…
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wildby Ahmed Masry, Megh Thakkar, Aayush Bajaj,…
Raw Text is All you Need: Knowledge-intensive Multi-turn Instruction Tuning for Large Language Modelby Xia…
MMedAgent: Learning to Use Medical Tools with Multi-modal Agentby Binxu Li, Tiankai Yan, Yuanting Pan,…
GraphArena: Evaluating and Exploring Large Language Models on Graph Computationby Jianheng Tang, Qifan Zhang, Yuhan…
YuLan: An Open-source Large Language Modelby Yutao Zhu, Kun Zhou, Kelong Mao, Wentong Chen, Yiding…
Methodology of Adapting Large English Language Models for Specific Cultural Contextsby Wenjing Zhang, Siqi Xiao,…
Optimizing Psychological Counseling with Instruction-Tuned Large Language Modelsby Wenjie Li, Tianyu Sun, Kun Qian, Wenhong…