Summary of Timesuite: Improving Mllms For Long Video Understanding Via Grounded Tuning, by Xiangyu Zeng et al.
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuningby Xiangyu Zeng, Kunchang Li, Chenting…
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuningby Xiangyu Zeng, Kunchang Li, Chenting…
2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervisionby Shilong Li, Yancheng He, Hui Huang, Xingyuan…
Counting Ability of Large Language Models and Impact of Tokenizationby Xiang Zhang, Juntai Cao, Chenyu…
Integrating Reasoning Systems for Trustworthy AI, Proceedings of the 4th Workshop on Logic and Practice…
The Potential and Value of AI Chatbot in Personalized Cognitive Trainingby Zilong Wang, Nan Chen,…
A SAM based Tool for Semi-Automatic Food Annotationby Lubnaa Abdur Rahman, Ioannis Papathanail, Lorenzo Brigato,…
Movie Trailer Genre Classification Using Multimodal Pretrained Featuresby Serkan Sulun, Paula Viana, Matthew E. P.…
PINNing Cerebral Blood Flow: Analysis of Perfusion MRI in Infants using Physics-Informed Neural Networksby Christoforos…
Reliable, Routable, and Reproducible: Collection of Pedestrian Pathways at Statewide Scaleby Yuxiang Zhang, Bill Howe,…
LocateBench: Evaluating the Locating Ability of Vision Language Modelsby Ting-Rui Chiang, Joshua Robinson, Xinyan Velocity…