Summary of On the Efficacy Of Text-based Input Modalities For Action Anticipation, by Apoorva Beedu et al.
On the Efficacy of Text-Based Input Modalities for Action Anticipationby Apoorva Beedu, Harish Haresamudram, Karan…
On the Efficacy of Text-Based Input Modalities for Action Anticipationby Apoorva Beedu, Harish Haresamudram, Karan…
Dynamic Layer Tying for Parameter-Efficient Transformersby Tamir David Hay, Lior WolfFirst submitted to arxiv on:…
P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformerby Zhiyuan Wang, Xiaoyang Qu,…
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformersby Katherine Crowson, Stefan Andreas Baumann, Alex…
Freely Long-Thinking Transformer (FraiLT)by Akbay TabakFirst submitted to arxiv on: 21 Jan 2024CategoriesMain: Machine Learning…
Language Models as Hierarchy Encodersby Yuan He, Zhangdie Yuan, Jiaoyan Chen, Ian HorrocksFirst submitted to…
VONet: Unsupervised Video Object Learning With Parallel U-Net Attention and Object-wise Sequential VAEby Haonan Yu,…
Cross-Task Affinity Learning for Multitask Dense Scene Predictionsby Dimitrios Sinodinos, Narges ArmanfardFirst submitted to arxiv…
Recent Advances in Named Entity Recognition: A Comprehensive Survey and Comparative Studyby Imed Keraghel, Stanislas…
Understanding Video Transformers via Universal Concept Discoveryby Matthew Kowal, Achal Dave, Rares Ambrus, Adrien Gaidon,…