Summary of Tc-llava: Rethinking the Transfer From Image to Video Understanding with Temporal Considerations, by Mingze Gao et al.
TC-LLaVA: Rethinking the Transfer from Image to Video Understanding with Temporal Considerationsby Mingze Gao, Jingyu…
TC-LLaVA: Rethinking the Transfer from Image to Video Understanding with Temporal Considerationsby Mingze Gao, Jingyu…
Boosting Generalizability towards Zero-Shot Cross-Dataset Single-Image Indoor Depth by Meta-Initializationby Cho-Ying Wu, Yiqi Zhong, Junying…
AdaComp: Extractive Context Compression with Adaptive Predictor for Retrieval-Augmented Large Language Modelsby Qianchi Zhang, Hainan…
Path-Consistency: Prefix Enhancement for Efficient Inference in LLMby Jiace Zhu, Yingtao Shen, Jie Zhao, An…
ViRED: Prediction of Visual Relations in Engineering Drawingsby Chao Gu, Ke Lin, Yiyang Luo, Jiahui…
Learning in Hybrid Active Inference Modelsby Poppy Collis, Ryan Singh, Paul F Kinghorn, Christopher L…
ConCSE: Unified Contrastive Learning and Augmentation for Code-Switched Embeddingsby Jangyeong Jeon, Sangyeon Cho, Minuk Ma,…
Enhancing Natural Language Inference Performance with Knowledge Graph for COVID-19 Automated Fact-Checking in Indonesian Languageby…
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Samplingby Hritik Bansal, Arian Hosseini, Rishabh…
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformersby Juncan Deng, Shuaiting Li, Zeyu Wang, Hong…