Summary of Simplifying Clip: Unleashing the Power Of Large-scale Models on Consumer-level Computers, by Hongbo Liu
Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computersby Hongbo LiuFirst submitted to…
Simplifying CLIP: Unleashing the Power of Large-Scale Models on Consumer-level Computersby Hongbo LiuFirst submitted to…
High-Resolution Image Synthesis via Next-Token Predictionby Dengsheng Chen, Jie Hu, Tiezhu Yue, Xiaoming Wei, Enhua…
VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Spaceby Armani Rodriguez, Silvija…
Multiset Transformer: Advancing Representation Learning in Persistence Diagramsby Minghua Wang, Ziyun Huang, Jinhui XuFirst submitted…
FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Accelerationby…
AI Tailoring: Evaluating Influence of Image Features on Fashion Product Popularityby Xiaomin Li, Junyi ShaFirst…
Point Cloud Understanding via Attention-Driven Contrastive Learningby Yi Wang, Jiaze Wang, Ziyu Guo, Renrui Zhang,…
FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformersby Zehua Pei, Hui-Ling Zhen, Xianzhi Yu, Sinno…
Stable Flow: Vital Layers for Training-Free Image Editingby Omri Avrahami, Or Patashnik, Ohad Fried, Egor…
Generative Fuzzy System for Sequence Generationby Hailong Yang, Zhaohong Deng, Wei Zhang, Zhuangzhuang Zhao, Guanjin…