Summary of Llava-chef: a Multi-modal Generative Model For Food Recipes, by Fnu Mohbat and Mohammed J. Zaki
LLaVA-Chef: A Multi-modal Generative Model for Food Recipesby Fnu Mohbat, Mohammed J. ZakiFirst submitted to…
LLaVA-Chef: A Multi-modal Generative Model for Food Recipesby Fnu Mohbat, Mohammed J. ZakiFirst submitted to…
Tex-ViT: A Generalizable, Robust, Texture-based dual-branch cross-attention deepfake detectorby Deepak Dagar, Dinesh Kumar VishwakarmaFirst submitted…
DLFormer: Enhancing Explainability in Multivariate Time Series Forecasting using Distributed Lag Embeddingby Younghwi Kim, Dohee…
Multitask learning for improved scour detection: A dynamic wave tank studyby Simon M. Brealy, Aidan…
TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classificationby Bidyut Saha, Riya…
Adaptive Variational Continual Learning via Task-Heuristic Modellingby Fan YangFirst submitted to arxiv on: 29 Aug…
SFR-GNN: Simple and Fast Robust GNNs against Structural Attacksby Xing Ai, Guanyu Zhu, Yulin Zhu,…
SALSA: Speedy ASR-LLM Synchronous Aggregationby Ashish Mittal, Darshan Prabhu, Sunita Sarawagi, Preethi JyothiFirst submitted to…
Seeking the Sufficiency and Necessity Causal Features in Multimodal Representation Learningby Boyu Chen, Junjie Liu,…
CrisperWhisper: Accurate Timestamps on Verbatim Speech Transcriptionsby Laurin Wagner, Bernhard Thallinger, Mario ZusagFirst submitted to…