Summary of Vilco-bench: Video Language Continual Learning Benchmark, by Tianqi Tang et al.
ViLCo-Bench: VIdeo Language COntinual learning Benchmarkby Tianqi Tang, Shohreh Deldari, Hao Xue, Celso De Melo,…
ViLCo-Bench: VIdeo Language COntinual learning Benchmarkby Tianqi Tang, Shohreh Deldari, Hao Xue, Celso De Melo,…
Cycle-Correspondence Loss: Learning Dense View-Invariant Visual Features from Unlabeled and Unordered RGB Imagesby David B.…
DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Featuresby…
ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shiftsby Samar Khanna, Medhanie Irgau,…
SSTFB: Leveraging self-supervised pretext learning and temporal self-attention with feature branching for real-time video polyp…
Fine-Grained Urban Flow Inference with Multi-scale Representation Learningby Shilu Yuan, Dongfeng Li, Wei Liu, Xinxin…
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn’tby Chihiro Taguchi, David…
Exploring Self-Supervised Multi-view Contrastive Learning for Speech Emotion Recognition with Limited Annotationsby Bulat Khaertdinov, Pedro…
BrainChat: Decoding Semantic Information from fMRI using Vision-language Pretrained Modelsby Wanaiu HuangFirst submitted to arxiv…
Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Trainingby Ke Niu, Haiyang Yu, Xuelin…