Summary of Vovtrack: Exploring the Potentiality in Videos For Open-vocabulary Object Tracking, by Zekun Qian et al.
VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Trackingby Zekun Qian, Ruize Han, Junhui…
VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Trackingby Zekun Qian, Ruize Han, Junhui…
Channel-Aware Throughput Maximization for Cooperative Data Fusion in CAVby Haonan An, Zhengru Fang, Yuang Zhang,…
SyllableLM: Learning Coarse Semantic Units for Speech Language Modelsby Alan Baade, Puyuan Peng, David HarwathFirst…
SynCo: Synthetic Hard Negatives for Contrastive Visual Representation Learningby Nikolaos Giakoumoglou, Tania StathakiFirst submitted to…
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generationby Liang Chen,…
Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gapsby…
ProMerge: Prompt and Merge for Unsupervised Instance Segmentationby Dylan Li, Gyungin ShinFirst submitted to arxiv…
SC-Phi2: A Fine-tuned Small Language Model for StarCraft II Macromanagement Tasksby Muhammad Junaid Khan, Gita…
Learning from Pattern Completion: Self-supervised Controllable Generationby Zhiqiang Chen, Guofan Fan, Jinying Gao, Lei Ma,…
Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awarenessby Jian Li, Haojing Huang,…