Summary of Malt: Multi-scale Action Learning Transformer For Online Action Detection, by Zhipeng Yang et al.
MALT: Multi-scale Action Learning Transformer for Online Action Detectionby Zhipeng Yang, Ruoyu Wang, Yang Tan,…
MALT: Multi-scale Action Learning Transformer for Online Action Detectionby Zhipeng Yang, Ruoyu Wang, Yang Tan,…
Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customizationby Yisu Liu, Jinyang An, Wanqian Zhang,…
CaLa: Complementary Association Learning for Augmenting Composed Image Retrievalby Xintong Jiang, Yaxiong Wang, Mengjian Li,…
Language Reconstruction with Brain Predictive Coding from fMRI Databy Congchi Yin, Ziyi Ye, Piji LiFirst…
Structured Click Control in Transformer-based Interactive Segmentationby Long Xu, Yongquan Chen, Rui Huang, Feng Wu,…
Obtaining Favorable Layouts for Multiple Object Generationby Barak Battash, Amit Rozner, Lior Wolf, Ofir LindenbaumFirst…
Cross-Task Multi-Branch Vision Transformer for Facial Expression and Mask Wearing Classificationby Armando Zhu, Keqin Li,…
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusionsby Xiaoran Zhao, Tianhao Wu, Yu Lai, Zhiliang…
AKGNet: Attribute Knowledge-Guided Unsupervised Lung-Infected Area Segmentationby Qing En, Yuhong GuoFirst submitted to arxiv on:…
TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Modelsby Ya-Qi Yu, Minghui Liao, Jihao…