Summary of Llava-surg: Towards Multimodal Surgical Assistant Via Structured Surgical Video Learning, by Jiajie Li et al.
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learningby Jiajie Li, Garrett Skinner, Gene…
LLaVA-Surg: Towards Multimodal Surgical Assistant via Structured Surgical Video Learningby Jiajie Li, Garrett Skinner, Gene…
IIU: Independent Inference Units for Knowledge-based Visual Question Answeringby Yili Li, Jing Yu, Keke Gai,…
DIVE: Towards Descriptive and Diverse Visual Commonsense Generationby Jun-Hyung Park, Hyuntae Park, Youjin Kang, Eojin…
Abstract Operations Research Modeling Using Natural Language Inputsby Junxuan Li, Ryan Wickman, Sahil Bhatnagar, Raj…
On-the-fly Synthesis for LTL over Finite Traces: An Efficient Approach that Countsby Shengping Xiao, Yongkang…
Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality Collaborationby Xiaogen Zhou, Yiyou Sun, Min…
RTAT: A Robust Two-stage Association Tracker for Multi-Object Trackingby Song Guo, Rujie Liu, Narishige AbeFirst…
Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-Free Psychometricsby Peter…
A Quantum-Inspired Analysis of Human Disambiguation Processesby Daphne WangFirst submitted to arxiv on: 14 Aug…
The Restaurant Meal Delivery Problem with Ghost Kitchensby Gal Neria, Florentin D Hildebrandt, Michal Tzur,…