Summary of Learning Long-term Spatial-temporal Graphs For Active Speaker Detection, by Kyle Min et al.
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detectionby Kyle Min, Sourya Roy, Subarna Tripathi, Tanaya…
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detectionby Kyle Min, Sourya Roy, Subarna Tripathi, Tanaya…
Class-incremental Novel Class Discoveryby Subhankar Roy, Mingxuan Liu, Zhun Zhong, Nicu Sebe, Elisa RicciFirst submitted…
GRIT: Faster and Better Image captioning Transformer Using Dual Visual Featuresby Van-Quang Nguyen, Masanori Suganuma,…
Towards Compatible Fine-tuning for Vision-Language Model Updatesby Zhengbo Wang, Jian Liang, Lijun Sheng, Ran He,…
DDIM sampling for Generative AIBIM, a faster intelligent structural design frameworkby Zhili He, Yu-Hsing WangFirst…
Generalizing in Net-Zero Microgrids: A Study with Federated PPO and TRPOby Nicolas M Cuadrado Avila,…
Uncertainty-Aware Out-of-Distribution Detection with Gaussian Processesby Yang Chen, Chih-Li Sung, Arpan Kusari, Xiaoyang Song, Wenbo…
Conservation-informed Graph Learning for Spatiotemporal Dynamics Predictionby Yuan Mi, Pu Ren, Hongteng Xu, Hongsheng Liu,…
AlignAb: Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodiesby Yibo Wen, Chenwei Xu, Jerry Yao-Chieh Hu,…
RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defensesby Mohamed Djilani, Salah Ghamizi, Maxime CordyFirst submitted…