Summary of Just a Few Glances: Open-set Visual Perception with Image Prompt Paradigm, by Jinrong Zhang et al.
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigmby Jinrong Zhang, Penghui Wang,…
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigmby Jinrong Zhang, Penghui Wang,…
Optimizing Few-Step Sampler for Diffusion Probabilistic Modelby Jen-Yuan HuangFirst submitted to arxiv on: 14 Dec…
Enhance Vision-Language Alignment with Noiseby Sida Huang, Hongyuan Zhang, Xuelong LiFirst submitted to arxiv on:…
RapidNet: Multi-Level Dilated Convolution Based Mobile Backboneby Mustafa Munir, Md Mostafijur Rahman, Radu MarculescuFirst submitted…
Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learningby Shihao…
On Adversarial Robustness and Out-of-Distribution Robustness of Large Language Modelsby April Yang, Jordan Tab, Parth…
Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptationby Rithvik PrakkiFirst submitted…
SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Modelsby Hung Nguyen, Quang Qui-Vinh Nguyen,…
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understandingby Zhiyu Wu, Xiaokang Chen, Zizheng Pan, Xingchao…
From Noise to Nuance: Advances in Deep Generative Image Modelsby Benji Peng, Chia Xin Liang,…