Summary of Empowering Embodied Visual Tracking with Visual Foundation Models and Offline Rl, by Fangwei Zhong et al.
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RLby Fangwei Zhong, Kui Wu,…
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RLby Fangwei Zhong, Kui Wu,…
Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Modelsby Peifei Zhu, Tsubasa Takahashi, Hirokatsu KataokaFirst…
Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Modelsby Matteo Pennisi, Giovanni Bellitto, Simone Palazzo, Mubarak…
QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback-based Self-Correctionby Xiang Huang, Sitao Cheng,…
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenariosby Guoshan Liu,…
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generatorsby Indraneil Paul, Goran Glavaš, Iryna…
SAR-AE-SFP: SAR Imagery Adversarial Example in Real Physics domain with Target Scattering Feature Parametersby Jiahao…
How Do Humans Write Code? Large Models Do It the Same Way Tooby Long Li,…
Flexible Physical Camouflage Generation Based on a Differential Approachby Yang Li, Wenyi Tan, Tingrui Wang,…
Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warpingby Qinliang Lin, Cheng Luo, Zenghao Niu,…