Summary of Unleash the Potential Of Clip For Video Highlight Detection, by Donghoon Han et al.
Unleash the Potential of CLIP for Video Highlight Detectionby Donghoon Han, Seunghyeon Seo, Eunhwan Park,…
Unleash the Potential of CLIP for Video Highlight Detectionby Donghoon Han, Seunghyeon Seo, Eunhwan Park,…
A Review of Multi-Modal Large Language and Vision Modelsby Kilian Carolan, Laura Fennelly, Alan F.…
Towards Safety and Helpfulness Balanced Responses via Controllable Large Language Modelsby Yi-Lin Tuan, Xilun Chen,…
Bailong: Bilingual Transfer Learning based on QLoRA and Zip-tie Embeddingby Lung-Chuan Chen, Zong-Ru LiFirst submitted…
LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interactionby Bo Zou, Chao Yang, Yu Qiao, Chengbin…
Configurable Safety Tuning of Language Models with Synthetic Preference Databy Victor GallegoFirst submitted to arxiv…
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognitionby Yash Jain, David Chan, Pranav Dheram, Aparna Khare,…
Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilizationby…
Classification of Diabetic Retinopathy using Pre-Trained Deep Learning Modelsby Inas Al-Kamachy, Reza Hassanpour, Roya ChoupaniFirst…
“Sorry, Come Again?” Prompting – Enhancing Comprehension and Diminishing Hallucination with [PAUSE]-injected Optimal Paraphrasingby Vipula…