Summary of Hpe-cogvlm: Advancing Vision Language Models with a Head Pose Grounding Task, by Yu Tian et al.
HPE-CogVLM: Advancing Vision Language Models with a Head Pose Grounding Taskby Yu Tian, Tianqi Shao,…
HPE-CogVLM: Advancing Vision Language Models with a Head Pose Grounding Taskby Yu Tian, Tianqi Shao,…
Research on the Application of Computer Vision Based on Deep Learning in Autonomous Driving Technologyby…
Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensingby Minjong CheonFirst submitted to arxiv on:…
An Effective Weight Initialization Method for Deep Learning: Application to Satellite Image Classificationby Wadii Boulila,…
Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Studyby…
DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wildby Honghao Fu, Yufei…
MDS-ViTNet: Improving saliency prediction for Eye-Tracking with Vision Transformerby Polezhaev Ignat, Goncharenko Igor, Iurina NatalyaFirst…
A Deep Convolutional Neural Network-based Model for Aspect and Polarity Classification in Hausa Movie Reviewsby…
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attentionby Bencheng Liao, Xinggang Wang, Lianghui Zhu,…
A Review and Implementation of Object Detection Models and Optimizations for Real-time Medical Mask Detection…