Summary of Efficient Multiscale Multimodal Bottleneck Transformer For Audio-video Classification, by Wentao Zhu
Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classificationby Wentao ZhuFirst submitted to arxiv on: 8…
Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classificationby Wentao ZhuFirst submitted to arxiv on: 8…
Plug-and-Play Transformer Modules for Test-Time Adaptationby Xiangyu Chang, Sk Miraj Ahmed, Srikanth V. Krishnamurthy, Basak…
Global-Aware Enhanced Spatial-Temporal Graph Recurrent Networks: A New Framework For Traffic Flow Predictionby Haiyang Liu,…
Attention and Autoencoder Hybrid Model for Unsupervised Online Anomaly Detectionby Seyed Amirhossein Najafi, Mohammad Hassan…
Graph2Tac: Online Representation Learning of Formal Math Conceptsby Lasse Blaauwbroek, Miroslav Olšák, Jason Rute, Fidel…
UnetTSF: A Better Performance Linear Complexity Time Series Prediction Modelby Li Chu, Xiao Bingjia, Yuan…
Geometric-Facilitated Denoising Diffusion Model for 3D Molecule Generationby Can Xu, Haosen Wang, Weigang Wang, Pengfei…
A Cost-Efficient FPGA Implementation of Tiny Transformer Model using Neural ODEby Ikumi Okubo, Keisuke Sugiura,…
Multi-Source Domain Adaptation with Transformer-based Feature Generation for Subject-Independent EEG-based Emotion Recognitionby Shadi Sartipi, Mujdat…
ODIN: A Single Model for 2D and 3D Segmentationby Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios,…