Summary of Crema: Generalizable and Efficient Video-language Reasoning Via Multimodal Modular Fusion, by Shoubin Yu et al.
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusionby Shoubin Yu, Jaehong Yoon, Mohit…
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusionby Shoubin Yu, Jaehong Yoon, Mohit…
Can Large Language Model Agents Simulate Human Trust Behavior?by Chengxing Xie, Canyu Chen, Feiran Jia,…
Direct Language Model Alignment from Online AI Feedbackby Shangmin Guo, Biao Zhang, Tianlin Liu, Tianqi…
KICGPT: Large Language Model with Knowledge in Context for Knowledge Graph Completionby Yanbin Wei, Qiushi…
Retrieval Augmented End-to-End Spoken Dialog Modelsby Mingqiu Wang, Izhak Shafran, Hagen Soltau, Wei Han, Yuan…
Enhance Reasoning for Large Language Models in the Game Werewolfby Shuang Wu, Liwen Zhu, Tao…
Hardware Phi-1.5B: A Large Language Model Encodes Hardware Domain Specific Knowledgeby Weimin Fu, Shijie Li,…
K-Level Reasoning: Establishing Higher Order Beliefs in Large Language Models for Strategic Reasoningby Yadong Zhang,…
ChatGPT vs Gemini vs LLaMA on Multilingual Sentiment Analysisby Alessio Buscemi, Daniele ProverbioFirst submitted to…
Executable Code Actions Elicit Better LLM Agentsby Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang,…