Summary of Optical Flow Representation Alignment Mamba Diffusion Model For Medical Video Generation, by Zhenbin Wang et al.
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generationby Zhenbin Wang, Lei Zhang,…
Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generationby Zhenbin Wang, Lei Zhang,…
RAGViz: Diagnose and Visualize Retrieval-Augmented Generationby Tevin Wang, Jingyuan He, Chenyan XiongFirst submitted to arxiv…
EEG-based Multimodal Representation Learning for Emotion Recognitionby Kang Yin, Hye-Bin Shin, Dan Li, Seong-Whan LeeFirst…
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learningby John…
STAA: Spatio-Temporal Attention Attribution for Real-Time Interpreting Transformer-based Video Modelsby Zerui Wang, Yan LiuFirst submitted…
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketchingby Xinwang Chen, Ning Liu, Yichen…
Commonsense Knowledge Editing Based on Free-Text in LLMsby Xiusheng Huang, Yequan Wang, Jun Zhao, Kang…
Dataset Awareness is not Enough: Implementing Sample-level Tail Encouragement in Long-tailed Self-supervised Learningby Haowen Xiao,…
DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PETby Yitong Li, Morteza Ghahremani,…
VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Accelerationby Dezhan Tu, Danylo…