Summary of Grounding Descriptions in Images Informs Zero-shot Visual Recognition, by Shaunak Halbe et al.
Grounding Descriptions in Images informs Zero-Shot Visual Recognitionby Shaunak Halbe, Junjiao Tian, K J Joseph,…
Grounding Descriptions in Images informs Zero-Shot Visual Recognitionby Shaunak Halbe, Junjiao Tian, K J Joseph,…
VisionZip: Longer is Better but Not Necessary in Vision Language Modelsby Senqiao Yang, Yukang Chen,…
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policyby Keru Chen, Honghao Wei, Zhigang…
Multi-Bin Batching for Increasing LLM Inference Throughputby Ozgur Guldogan, Jackson Kunde, Kangwook Lee, Ramtin PedarsaniFirst…
FedDW: Distilling Weights through Consistency Optimization in Heterogeneous Federated Learningby Jiayu Liu, Yong Wang, Nianbin…
Arctic-Embed 2.0: Multilingual Retrieval Without Compromiseby Puxuan Yu, Luke Merrick, Gaurav Nuti, Daniel CamposFirst submitted…
Prompting Large Language Models for Clinical Temporal Relation Extractionby Jianping He, Laila Rasmy, Haifang Li,…
Leveraging Multimodal Protein Representations to Predict Protein Melting Temperaturesby Daiheng Zhang, Yan Zeng, Xinyu Hong,…
MageBench: Bridging Large Multimodal Models to Agentsby Miaosen Zhang, Qi Dai, Yifan Yang, Jianmin Bao,…
WinTSR: A Windowed Temporal Saliency Rescaling Method for Interpreting Time Series Deep Learning Modelsby Md.…