Summary of Leveraging Speech For Gesture Detection in Multimodal Communication, by Esam Ghaleb et al.
Leveraging Speech for Gesture Detection in Multimodal Communicationby Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim…
Leveraging Speech for Gesture Detection in Multimodal Communicationby Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim…
BCFPL: Binary classification ConvNet based Fast Parking space recognition with Low resolution imageby Shuo Zhang,…
UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptationby Siru Zhong, Xixuan Hao, Yibo Yan, Ying…
A Survey on Efficient Inference for Large Language Modelsby Zixuan Zhou, Xuefei Ning, Ke Hong,…
Explaining Arguments’ Strength: Unveiling the Role of Attacks and Supports (Technical Report)by Xiang Yin, Potyka…
Automatic Discovery of Visual Circuitsby Achyuta Rajaram, Neil Chowdhury, Antonio Torralba, Jacob Andreas, Sarah SchwettmannFirst…
Graphic Design with Large Multimodal Modelby Yutao Cheng, Zhao Zhang, Maoke Yang, Hui Nie, Chunyuan…
Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graphby Xiaochen Kev Gao, Feng…
Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Modelsby Vishruth Veerendranath, Vishwa Shah,…
Knowledge-Aware Neuron Interpretation for Scene Classificationby Yong Guan, Freddy Lecue, Jiaoyan Chen, Ru Li, Jeff…